C elegans

skip navigation

C. elegans Sequencing Project Nears Finish

At the 1997 Santa Fe meeting, NIH-funded researcher Stephanie Chissoe [Washington University, St. Louis (WUSTL)] provided an overview of the final, closure phases of the project to sequence the 100-Mb genome of the roundworm Caenorhabditis elegans. Working in equal collaboration with the Sanger Centre (Hinxton, U.K.), researchers expect completion by the end of this year, marking another major achievement in the Human Genome Project.

A clone-based sequence-ready map provided the majority of sequencing substrates, including cosmids and YACs. Analysis and annotation of the finished sequence include identification of potential exons by similarity to EST data and known protein sequences and by gene-prediction programs. Before submission to GenBank, the generated data sets are read into the ACeDB database and reconciled manually with each other and with ancillary C. elegans map data. Thus far the teams have identified 13,747 annotated genes in 71.4 Mb of annotated sequence from the February 1998 ACeDB release. Some 30% match a C. elegans EST, and 55% have some similarity. About half of C. elegans genes lack significant database hits that are likely to provide clues to function, Chissoe noted, so WUSTL investigators are generating C. briggsae comparative sequencing data.

To read pdf files, download the free Acrobat Reader software.

Last modified: Wednesday, February 28, 2001

Home * Search * Contacts * Disclaimer

Base URL: www.ornl.gov/hgmis

Site sponsored by the U.S. Department of Energy Office of Science, Office of Biological and Environmental Research, Human Genome Program