BBP: Brucella genome annotation with literature mining and curation
- PMID: 16842628
- PMCID: PMC1539029
- DOI: 10.1186/1471-2105-7-347
BBP: Brucella genome annotation with literature mining and curation
Abstract
Background: Brucella species are Gram-negative, facultative intracellular bacteria that cause brucellosis in humans and animals. Sequences of four Brucella genomes have been published, and various Brucella gene and genome data and analysis resources exist. A web gateway to integrate these resources will greatly facilitate Brucella research. Brucella genome data in current databases is largely derived from computational analysis without experimental validation typically found in peer-reviewed publications. It is partially due to the lack of a literature mining and curation system able to efficiently incorporate the large amount of literature data into genome annotation. It is further hypothesized that literature-based Brucella gene annotation would increase understanding of complicated Brucella pathogenesis mechanisms.
Results: The Brucella Bioinformatics Portal (BBP) is developed to integrate existing Brucella genome data and analysis tools with literature mining and curation. The BBP InterBru database and Brucella Genome Browser allow users to search and analyze genes of 4 currently available Brucella genomes and link to more than 20 existing databases and analysis programs. Brucella literature publications in PubMed are extracted and can be searched by a TextPresso-powered natural language processing method, a MeSH browser, a keywords search, and an automatic literature update service. To efficiently annotate Brucella genes using the large amount of literature publications, a literature mining and curation system coined Limix is developed to integrate computational literature mining methods with a PubSearch-powered manual curation and management system. The Limix system is used to quickly find and confirm 107 Brucella gene mutations including 75 genes shown to be essential for Brucella virulence. The 75 genes are further clustered using COG. In addition, 62 Brucella genetic interactions are extracted from literature publications. These results make possible more comprehensive investigation of Brucella pathogenesis. Other BBP features include publication email alert service, Brucella researchers' contact database, and discussion forum.
Conclusion: BBP is a gateway for Brucella researchers to search, analyze, and curate Brucella genome data originated from public databases and literature. Brucella gene mutations and genetic interactions are annotated using Limix leading to better understanding of Brucella pathogenesis.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/1539029/bin/1471-2105-7-347-1.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/1539029/bin/1471-2105-7-347-2.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/1539029/bin/1471-2105-7-347-3.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/1539029/bin/1471-2105-7-347-4.gif)
![Figure 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/1539029/bin/1471-2105-7-347-5.gif)
Similar articles
-
A versatile computational pipeline for bacterial genome annotation improvement and comparative analysis, with Brucella as a use case.Nucleic Acids Res. 2007;35(12):3953-62. doi: 10.1093/nar/gkm377. Epub 2007 Jun 6. Nucleic Acids Res. 2007. PMID: 17553834 Free PMC article.
-
MILANO--custom annotation of microarray results using automatic literature searches.BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12. BMC Bioinformatics. 2005. PMID: 15661078 Free PMC article.
-
Textpresso: an ontology-based information retrieval and extraction system for biological literature.PLoS Biol. 2004 Nov;2(11):e309. doi: 10.1371/journal.pbio.0020309. Epub 2004 Sep 21. PLoS Biol. 2004. PMID: 15383839 Free PMC article.
-
Annotation, comparison and databases for hundreds of bacterial genomes.Res Microbiol. 2007 Dec;158(10):724-36. doi: 10.1016/j.resmic.2007.09.009. Epub 2007 Oct 6. Res Microbiol. 2007. PMID: 18031997 Review.
-
Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies?Brief Bioinform. 2008 Nov;9(6):466-78. doi: 10.1093/bib/bbn043. Epub 2008 Dec 6. Brief Bioinform. 2008. PMID: 19060303 Review.
Cited by
-
Possible biased virulence attenuation in the Senegal strain of Ehrlichia ruminantium by ntrX gene conversion from an inverted segmental duplication.PLoS One. 2023 Feb 17;18(2):e0266234. doi: 10.1371/journal.pone.0266234. eCollection 2023. PLoS One. 2023. PMID: 36800354 Free PMC article.
-
Alignment of vaccine codes using an ontology of vaccine descriptions.J Biomed Semantics. 2022 Oct 18;13(1):24. doi: 10.1186/s13326-022-00278-0. J Biomed Semantics. 2022. PMID: 36258262 Free PMC article.
-
Mining the Flavoproteome of Brucella ovis, the Brucellosis Causing Agent in Ovis aries.Microbiol Spectr. 2022 Apr 27;10(2):e0229421. doi: 10.1128/spectrum.02294-21. Epub 2022 Mar 22. Microbiol Spectr. 2022. PMID: 35315701 Free PMC article.
-
Vaccine Design by Reverse Vaccinology and Machine Learning.Methods Mol Biol. 2022;2414:1-16. doi: 10.1007/978-1-0716-1900-1_1. Methods Mol Biol. 2022. PMID: 34784028
-
Transcriptomic Analysis of the Brucella melitensis Rev.1 Vaccine Strain in an Acidic Environment: Insights Into Virulence Attenuation.Front Microbiol. 2019 Feb 14;10:250. doi: 10.3389/fmicb.2019.00250. eCollection 2019. Front Microbiol. 2019. PMID: 30837973 Free PMC article.
References
-
- Paulsen IT, Seshadri R, Nelson KE, Eisen JA, Heidelberg JF, Read TD, Dodson RJ, Umayam L, Brinkac LM, Beanan MJ, Daugherty SC, Deboy RT, Durkin AS, Kolonay JF, Madupu R, Nelson WC, Ayodeji B, Kraul M, Shetty J, Malek J, Van Aken SE, Riedmuller S, Tettelin H, Gill SR, White O, Salzberg SL, Hoover DL, Lindler LE, Halling SM, Boyle SM, Fraser CM. The Brucella suis genome reveals fundamental similarities between animal and plant pathogens and symbionts. Proc Natl Acad Sci U S A. 2002;99:13148–13153. doi: 10.1073/pnas.192319099. - DOI - PMC - PubMed
-
- Halling SM, Peterson-Burch BD, Bricker BJ, Zuerner RL, Qing Z, Li LL, Kapur V, Alt DP, Olsen SC. Completion of the genome sequence of Brucella abortus and comparison to the highly similar genomes of Brucella melitensis and Brucella suis. J Bacteriol. 2005;187:2715–2726. doi: 10.1128/JB.187.8.2715-2726.2005. - DOI - PMC - PubMed
-
- DelVecchio VG, Kapatral V, Redkar RJ, Patra G, Mujer C, Los T, Ivanova N, Anderson I, Bhattacharyya A, Lykidis A, Reznik G, Jablonski L, Larsen N, D'Souza M, Bernal A, Mazur M, Goltsman E, Selkov E, Elzer PH, Hagius S, O'Callaghan D, Letesson JJ, Haselkorn R, Kyrpides N, Overbeek R. The genome sequence of the facultative intracellular pathogen Brucella melitensis. Proc Natl Acad Sci U S A. 2002;99:443–448. doi: 10.1073/pnas.221575398. - DOI - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources