Global genomic similarity and core genome sequence diversity of the Streptococcus genus as a toolkit to identify closely related bacterial species in complex environments
- PMID: 30656069
- PMCID: PMC6336011
- DOI: 10.7717/peerj.6233
Global genomic similarity and core genome sequence diversity of the Streptococcus genus as a toolkit to identify closely related bacterial species in complex environments
Abstract
Background: The Streptococcus genus is relevant to both public health and food safety because of its ability to cause pathogenic infections. It is well-represented (>100 genomes) in publicly available databases. Streptococci are ubiquitous, with multiple sources of isolation, from human pathogens to dairy products. The Streptococcus genus has traditionally been classified by morphology, serum types, the 16S ribosomal RNA (rRNA) gene, and multi-locus sequence types subject to in-depth comparative genomic analysis.
Methods: Core and pan-genomes described the genomic diversity of 108 strains belonging to 16 Streptococcus species. The core genome nucleotide diversity was calculated and compared to phylogenomic distances within the genus Streptococcus. The core genome was also used as a resource to recruit metagenomic fragment reads from streptococci dominated environments. A conventional 16S rRNA gene phylogeny reconstruction was used as a reference to compare the resulting dendrograms of average nucleotide identity (ANI) and genome similarity score (GSS) dendrograms.
Results: The core genome, in this work, consists of 404 proteins that are shared by all 108 Streptococcus. The average identity of the pairwise compared core proteins decreases proportionally to GSS lower scores, across species. The GSS dendrogram recovers most of the clades in the 16S rRNA gene phylogeny while distinguishing between 16S polytomies (unresolved nodes). The GSS is a distance metric that can reflect evolutionary history comparing orthologous proteins. Additionally, GSS resulted in the most useful metric for genus and species comparisons, where ANI metrics failed due to false positives when comparing different species.
Discussion: Understanding of genomic variability and species relatedness is the goal of tools like GSS, which makes use of the maximum pairwise shared orthologous sequences for its calculation. It allows for long evolutionary distances (above species) to be included because of the use of amino acid alignment scores, rather than nucleotides, and normalizing by positive matches. Newly sequenced species and strains could be easily placed into GSS dendrograms to infer overall genomic relatedness. The GSS is not restricted to ubiquitous conservancy of gene features; thus, it reflects the mosaic-structure and dynamism of gene acquisition and loss in bacterial genomes.
Keywords: Comparative genomics; Core genome; Genomic similarity score; Streptococcus.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/6336011/bin/peerj-07-6233-g001.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/6336011/bin/peerj-07-6233-g002.gif)
Similar articles
-
Uncovering the boundaries of Campylobacter species through large-scale phylogenetic and nucleotide identity analyses.mSystems. 2024 Apr 16;9(4):e0121823. doi: 10.1128/msystems.01218-23. Epub 2024 Mar 26. mSystems. 2024. PMID: 38530055 Free PMC article.
-
Optimizing the Parametrization of Homologue Classification in the Pan-Genome Computation for a Bacterial Species: Case Study Streptococcus pyogenes.Methods Mol Biol. 2022;2449:299-324. doi: 10.1007/978-1-0716-2095-3_13. Methods Mol Biol. 2022. PMID: 35507269 Review.
-
Re-evaluation of the taxonomy of the Mitis group of the genus Streptococcus based on whole genome phylogenetic analyses, and proposed reclassification of Streptococcus dentisani as Streptococcus oralis subsp. dentisani comb. nov., Streptococcus tigurinus as Streptococcus oralis subsp. tigurinus comb. nov., and Streptococcus oligofermentans as a later synonym of Streptococcus cristatus.Int J Syst Evol Microbiol. 2016 Nov;66(11):4803-4820. doi: 10.1099/ijsem.0.001433. Epub 2016 Aug 17. Int J Syst Evol Microbiol. 2016. PMID: 27534397
-
Comparative genomics of the bacterial genus Streptococcus illuminates evolutionary implications of species groups.PLoS One. 2014 Jun 30;9(6):e101229. doi: 10.1371/journal.pone.0101229. eCollection 2014. PLoS One. 2014. PMID: 24977706 Free PMC article.
-
Comparative genomics of Bifidobacterium, Lactobacillus and related probiotic genera.Microb Ecol. 2012 Apr;63(3):651-73. doi: 10.1007/s00248-011-9948-y. Epub 2011 Oct 27. Microb Ecol. 2012. PMID: 22031452 Free PMC article. Review.
Cited by
-
The methanogen core and pangenome: conservation and variability across biology's growth temperature extremes.DNA Res. 2023 Feb 1;30(1):dsac048. doi: 10.1093/dnares/dsac048. DNA Res. 2023. PMID: 36454681 Free PMC article.
-
Streptococcus oriscaviae sp. nov. Infection Associated with Guinea Pigs.Microbiol Spectr. 2022 Jun 29;10(3):e0001422. doi: 10.1128/spectrum.00014-22. Epub 2022 May 5. Microbiol Spectr. 2022. PMID: 35510851 Free PMC article.
-
Antifungal Activity and Biosynthetic Potential of New Streptomyces sp. MW-W600-10 Strain Isolated from Coal Mine Water.Int J Mol Sci. 2021 Jul 12;22(14):7441. doi: 10.3390/ijms22147441. Int J Mol Sci. 2021. PMID: 34299061 Free PMC article.
-
The Thermosynechococcus Genus: Wide Environmental Distribution, but a Highly Conserved Genomic Core.Microbes Environ. 2021;36(2):ME20138. doi: 10.1264/jsme2.ME20138. Microbes Environ. 2021. PMID: 33952861 Free PMC article.
-
A review of Listeria monocytogenes from meat and meat products: Epidemiology, virulence factors, antimicrobial resistance and diagnosis.Onderstepoort J Vet Res. 2020 Oct 9;87(1):e1-e20. doi: 10.4102/ojvr.v87i1.1869. Onderstepoort J Vet Res. 2020. PMID: 33054262 Free PMC article. Review.
References
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous