A clone-based sequence-ready map provided the majority of sequencing substrates, including cosmids and YACs. Analysis and annotation of the finished sequence include identification of potential exons by similarity to EST data and known protein sequences and by gene-prediction programs. Before submission to GenBank, the generated data sets are read into the ACeDB database and reconciled manually with each other and with ancillary C. elegans map data. Thus far the teams have identified 13,747 annotated genes in 71.4 Mb of annotated sequence from the February 1998 ACeDB release. Some 30% match a C. elegans EST, and 55% have some similarity. About half of C. elegans genes lack significant database hits that are likely to provide clues to function, Chissoe noted, so WUSTL investigators are generating C. briggsae comparative sequencing data.
To read pdf files, download the free Acrobat Reader software.
Home * Search * Contacts * Disclaimer
Base URL: www.ornl.gov/hgmis
Site sponsored by the U.S. Department of Energy Office of Science, Office of Biological and Environmental Research, Human Genome Program