TagCleaner: Identification and removal of tag sequences from genomic and metagenomic datasets
- PMID: 20573248
- PMCID: PMC2910026
- DOI: 10.1186/1471-2105-11-341
TagCleaner: Identification and removal of tag sequences from genomic and metagenomic datasets
Abstract
Background: Sequencing metagenomes that were pre-amplified with primer-based methods requires the removal of the additional tag sequences from the datasets. The sequenced reads can contain deletions or insertions due to sequencing limitations, and the primer sequence may contain ambiguous bases. Furthermore, the tag sequence may be unavailable or incorrectly reported. Because of the potential for downstream inaccuracies introduced by unwanted sequence contaminations, it is important to use reliable tools for pre-processing sequence data.
Results: TagCleaner is a web application developed to automatically identify and remove known or unknown tag sequences allowing insertions and deletions in the dataset. TagCleaner is designed to filter the trimmed reads for duplicates, short reads, and reads with high rates of ambiguous sequences. An additional screening for and splitting of fragment-to-fragment concatenations that gave rise to artificial concatenated sequences can increase the quality of the dataset. Users may modify the different filter parameters according to their own preferences.
Conclusions: TagCleaner is a publicly available web application that is able to automatically detect and efficiently remove tag sequences from metagenomic datasets. It is easily configurable and provides a user-friendly interface. The interactive web interface facilitates export functionality for subsequent data processing, and is available at http://edwards.sdsu.edu/tagcleaner.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2910026/bin/1471-2105-11-341-1.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2910026/bin/1471-2105-11-341-2.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2910026/bin/1471-2105-11-341-3.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2910026/bin/1471-2105-11-341-4.gif)
![Figure 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2910026/bin/1471-2105-11-341-5.gif)
Similar articles
-
Assessment of metagenomic assemblers based on hybrid reads of real and simulated metagenomic sequences.Brief Bioinform. 2020 May 21;21(3):777-790. doi: 10.1093/bib/bbz025. Brief Bioinform. 2020. PMID: 30860572 Free PMC article. Review.
-
COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.PLoS One. 2015 Nov 11;10(11):e0142102. doi: 10.1371/journal.pone.0142102. eCollection 2015. PLoS One. 2015. PMID: 26561344 Free PMC article.
-
Fast identification and removal of sequence contamination from genomic and metagenomic datasets.PLoS One. 2011 Mar 9;6(3):e17288. doi: 10.1371/journal.pone.0017288. PLoS One. 2011. PMID: 21408061 Free PMC article.
-
Artificial and natural duplicates in pyrosequencing reads of metagenomic data.BMC Bioinformatics. 2010 Apr 13;11:187. doi: 10.1186/1471-2105-11-187. BMC Bioinformatics. 2010. PMID: 20388221 Free PMC article.
-
Genome-scale probe and primer design with PRIMEGENS.Methods Mol Biol. 2007;402:159-76. doi: 10.1007/978-1-59745-528-2_8. Methods Mol Biol. 2007. PMID: 17951795 Review.
Cited by
-
Conserved signatures of the canine faecal microbiome are associated with metronidazole treatment and recovery.Sci Rep. 2024 Mar 4;14(1):5277. doi: 10.1038/s41598-024-51338-7. Sci Rep. 2024. PMID: 38438389 Free PMC article.
-
Association of blood cell-based inflammatory markers with gut microbiota and cancer incidence in the Rotterdam study.Cancer Med. 2024 Feb;13(3):e6860. doi: 10.1002/cam4.6860. Epub 2024 Feb 17. Cancer Med. 2024. PMID: 38366800 Free PMC article.
-
Glaucoma Patients Have a Lower Abundance of Butyrate-Producing Taxa in the Gut.Invest Ophthalmol Vis Sci. 2024 Feb 1;65(2):7. doi: 10.1167/iovs.65.2.7. Invest Ophthalmol Vis Sci. 2024. PMID: 38315494 Free PMC article.
-
Elevated A-to-I RNA editing in COVID-19 infected individuals.NAR Genom Bioinform. 2023 Oct 18;5(4):lqad092. doi: 10.1093/nargab/lqad092. eCollection 2023 Dec. NAR Genom Bioinform. 2023. PMID: 37859800 Free PMC article.
-
Advanced Glycation End Products (AGEs) in Diet and Skin in Relation to Stool Microbiota: The Rotterdam Study.Nutrients. 2023 May 30;15(11):2567. doi: 10.3390/nu15112567. Nutrients. 2023. PMID: 37299529 Free PMC article.
References
-
- Dinsdale EA, Edwards RA, Hall D, Angly F, Breitbart M, Brulc JM, Furlan M, Desnues C, Haynes M, Li L, McDaniel L, Moran MA, Nelson KE, Nilsson C, Olson R, Paul J, Brito BR, Ruan Y, Swan BK, Stevens R, Valentine DL, Thurber RV, Wegley L, White BA, Rohwer F. Functional metagenomic profiling of nine biomes. Nature. 2008;452(7187):629–632. doi: 10.1038/nature06810. - DOI - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources