Genome Channel Computational Biosciences
HomeAbout UsAnalysis ToolsInformation ResourcesProjects and Research
[space] Gene Channel Generation Microbial Gene Prediction System Grail (Microbial Gene Prediction System Internet Link) GrailEXP Genome Analysis Pipeline DomainParser PROSPECT (PROtein Structure Prediction and Evaluation Computer Toolkit) [space]
-

Gene Number Comparison

Creating of a gene number cross reference between versions is complicated any or all of the following reasons:

  1. Two genes become one - As contigs assemble, a partial gene at the end of one contig can assemble with a partial gene at the end of a different contig.
  2. Two genes become one - As the sequence is polished, changes to a stop, insertions, and deletions can cause two genes to be in frame and now form one gene.
  3. One gene becomes two - As the sequence is polished, insertions, deletions, and new stops can cause a gene to be split between two frames and now be two gene calls.
  4. A gene disappears - As the sequence is polished, a frame change or stop occurs so close to the start of a gene that it doesn't get predicted in the new assembly.
  5. A gene gets added - A new sequence may have new opportunities for genes which were not present in the previous build.
A tab-delimited comparison file is available to aid in the development of a cross reference. It is to be used as a starting point and not viewed as a final list. It was generated using a BLAST analysis from old to new and from new to old and has not been reviewed by human eyes. The list has not been sorted.

The columns are:

  1. Gene from the new assembly
  2. Length of gene from new assembly
  3. Gene from the old assembly
  4. Length of gene from old assembly
  5. Type of hit -
  6. Comparison of the two gene lengths (e.g., may be different)
  7. Percent Identity over the length of the alignment

-
SiteMap      Feedback      Life Sciences Division      ORNL     Disclaimer     Webmaster