FOCUS: an alignment-free model to identify organisms in metagenomes using non-negative least squares
- PMID: 24949242
- PMCID: PMC4060023
- DOI: 10.7717/peerj.425
FOCUS: an alignment-free model to identify organisms in metagenomes using non-negative least squares
Abstract
One of the major goals in metagenomics is to identify the organisms present in a microbial community from unannotated shotgun sequencing reads. Taxonomic profiling has valuable applications in biological and medical research, including disease diagnostics. Most currently available approaches do not scale well with increasing data volumes, which is important because both the number and lengths of the reads provided by sequencing platforms keep increasing. Here we introduce FOCUS, an agile composition based approach using non-negative least squares (NNLS) to report the organisms present in metagenomic samples and profile their abundances. FOCUS was tested with simulated and real metagenomes, and the results show that our approach accurately predicts the organisms present in microbial communities. FOCUS was implemented in Python. The source code and web-sever are freely available at http://edwards.sdsu.edu/FOCUS.
Keywords: Metagenomes; Modeling; k-mer.
Figures
Similar articles
-
SUPER-FOCUS: a tool for agile functional analysis of shotgun metagenomic data.Bioinformatics. 2016 Feb 1;32(3):354-61. doi: 10.1093/bioinformatics/btv584. Epub 2015 Oct 9. Bioinformatics. 2016. PMID: 26454280 Free PMC article.
-
An Agile Functional Analysis of Metagenomic Data Using SUPER-FOCUS.Methods Mol Biol. 2017;1611:35-44. doi: 10.1007/978-1-4939-7015-5_4. Methods Mol Biol. 2017. PMID: 28451970
-
Assessment of k-mer spectrum applicability for metagenomic dissimilarity analysis.BMC Bioinformatics. 2016 Jan 16;17:38. doi: 10.1186/s12859-015-0875-7. BMC Bioinformatics. 2016. PMID: 26774270 Free PMC article.
-
Assessment of metagenomic assemblers based on hybrid reads of real and simulated metagenomic sequences.Brief Bioinform. 2020 May 21;21(3):777-790. doi: 10.1093/bib/bbz025. Brief Bioinform. 2020. PMID: 30860572 Free PMC article. Review.
-
What Is Metagenomics Teaching Us, and What Is Missed?Annu Rev Microbiol. 2020 Sep 8;74:117-135. doi: 10.1146/annurev-micro-012520-072314. Epub 2020 Jun 30. Annu Rev Microbiol. 2020. PMID: 32603623 Review.
Cited by
-
Integrating taxonomic signals from MAGs and contigs improves read annotation and taxonomic profiling of metagenomes.Nat Commun. 2024 Apr 20;15(1):3373. doi: 10.1038/s41467-024-47155-1. Nat Commun. 2024. PMID: 38643272 Free PMC article.
-
Advances in phage-host interaction prediction: in silico method enhances the development of phage therapies.Brief Bioinform. 2024 Mar 27;25(3):bbae117. doi: 10.1093/bib/bbae117. Brief Bioinform. 2024. PMID: 38555471 Free PMC article. Review.
-
Exophiala chapopotensis sp. nov., an extremotolerant black yeast from an oil-polluted soil in Mexico; phylophenetic approach to species hypothesis in the Herpotrichiellaceae family.PLoS One. 2024 Feb 14;19(2):e0297232. doi: 10.1371/journal.pone.0297232. eCollection 2024. PLoS One. 2024. PMID: 38354109 Free PMC article.
-
YACHT: an ANI-based statistical test to detect microbial presence/absence in a metagenomic sample.Bioinformatics. 2024 Feb 1;40(2):btae047. doi: 10.1093/bioinformatics/btae047. Bioinformatics. 2024. PMID: 38268451 Free PMC article.
-
REC protein family expansion by the emergence of a new signaling pathway.mBio. 2023 Dec 19;14(6):e0262223. doi: 10.1128/mbio.02622-23. Epub 2023 Nov 22. mBio. 2023. PMID: 37991384 Free PMC article.
References
-
- Aziz RK, Devoid S, Disz T, Edwards RA, Henry CS, Olsen GJ, Olson R, Overbeek R, Parrello B, Pusch GD, Stevens RL, Vonstein V, Xia F. SEED servers: high-performance access to the seed genomes, annotations, and metabolic models. PLoS ONE. 2012;7:e425. doi: 10.1371/journal.pone.0048053. - DOI - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous