Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007;8(1):R13.
doi: 10.1186/gb-2007-8-1-r13.

Creating a honey bee consensus gene set

Affiliations

Creating a honey bee consensus gene set

Christine G Elsik et al. Genome Biol. 2007.

Abstract

Background: We wished to produce a single reference gene set for honey bee (Apis mellifera). Our motivation was twofold. First, we wished to obtain an improved set of gene models with increased coverage of known genes, while maintaining gene model quality. Second, we wished to provide a single official gene list that the research community could further utilize for consistent and comparable analyses and functional annotation.

Results: We created a consensus gene set for honey bee (Apis mellifera) using GLEAN, a new algorithm that uses latent class analysis to automatically combine disparate gene prediction evidence in the absence of known genes. The consensus gene models had increased representation of honey bee genes without sacrificing quality compared with any one of the input gene predictions. When compared with manually annotated gold standards, the consensus set of gene models was similar or superior in quality to each of the input sets.

Conclusion: Most eukaryotic genome projects produce multiple gene sets because of the variety of gene prediction programs. Each of the gene prediction programs has strengths and weaknesses, and so the multiplicity of gene sets offers users a more comprehensive collection of genes to use than is available from a single program. On the other hand, the availability of multiple gene sets is also a cause for uncertainty among users as regards which set they should use. GLEAN proved to be an effective method to combine gene lists into a single reference set.

PubMed Disclaimer

Similar articles

Cited by

References

    1. The Honey Bee Genome Sequencing Consortium Insights into social insects from the genome of the honey bee Apis mellifera. Nature. 2006;443:931–949. doi: 10.1038/nature05260. - DOI - PMC - PubMed
    1. Elsik CG, Worley KC, Zhang L, Milshina NV, Jiang H, Reese JT, Childs KL, Venkatraman A, Dickens CM, Weinstock GM, et al. Community annotation: procedures, protocols and supporting tools. Genome Res. 2006;16:1329–1333. doi: 10.1101/gr.5580606. - DOI - PubMed
    1. FlyBase http://flybase.org
    1. Drysdale RA, Crosby MA, FlyBase Consortium FlyBase: genes and gene models. Nucleic Acids Res. 2005;33:D390–D395. doi: 10.1093/nar/gki046. - DOI - PMC - PubMed
    1. BeeBase http://www.beebase.org

Publication types

LinkOut - more resources

-