INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes
- PMID: 36159887
- PMCID: PMC9041510
- DOI: 10.1089/phage.2021.0007
INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes
Abstract
Background: With advances in sequencing technology and decreasing costs, the number of phage genomes that have been sequenced has increased markedly in the past decade. Materials and Methods: We developed an automated retrieval and analysis system for phage genomes (https://github.com/RyanCook94/inphared) to produce the INfrastructure for a PHAge REference Database (INPHARED) of phage genomes and associated metadata. Results: As of January 2021, 14,244 complete phage genomes have been sequenced. The INPHARED data set is dominated by phages that infect a small number of bacterial genera, with 75% of phages isolated on only 30 bacterial genera. There is further bias, with significantly more lytic phage genomes (∼70%) than temperate (∼30%) within our database. Collectively, this results in ∼54% of temperate phage genomes originating from just three host genera. With much debate on the carriage of antibiotic resistance genes and their potential safety in phage therapy, we searched for putative antibiotic resistance genes. Frequency of antibiotic resistance gene carriage was found to be higher in temperate phages than in lytic phages and again varied with host. Conclusions: Given the bias of currently sequenced phage genomes, we suggest to fully understand phage diversity, efforts should be made to isolate and sequence a larger number of phages, in particular temperate phages, from a greater diversity of hosts.
Keywords: antibiotic resistance genes; jumbo phages; phage genomes; virulence genes.
Copyright 2021, Mary Ann Liebert, Inc., publishers.
Conflict of interest statement
No competing financial interests exist.
Figures
![FIG. 1.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/9041510/bin/phage.2021.0007_figure1.gif)
![FIG. 2.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/9041510/bin/phage.2021.0007_figure2.gif)
![FIG. 3.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/9041510/bin/phage.2021.0007_figure3.gif)
![FIG. 4.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/9041510/bin/phage.2021.0007_figure4.gif)
Similar articles
-
Genetic characteristics and integration specificity of Salmonella enterica temperate phages.Front Microbiol. 2023 Aug 1;14:1199843. doi: 10.3389/fmicb.2023.1199843. eCollection 2023. Front Microbiol. 2023. PMID: 37593543 Free PMC article.
-
The Role of Temperate Phages in Bacterial Pathogenicity.Microorganisms. 2023 Feb 21;11(3):541. doi: 10.3390/microorganisms11030541. Microorganisms. 2023. PMID: 36985115 Free PMC article. Review.
-
Mining bacterial NGS data vastly expands the complete genomes of temperate phages.NAR Genom Bioinform. 2022 Aug 3;4(3):lqac057. doi: 10.1093/nargab/lqac057. eCollection 2022 Sep. NAR Genom Bioinform. 2022. PMID: 35937545 Free PMC article.
-
The Isolation and Genome Sequencing of Five Novel Bacteriophages From the Rumen Active Against Butyrivibrio fibrisolvens.Front Microbiol. 2020 Jul 14;11:1588. doi: 10.3389/fmicb.2020.01588. eCollection 2020. Front Microbiol. 2020. PMID: 32760371 Free PMC article.
-
Enterococcal Bacteriophages and Genome Defense.2014 Feb 11. In: Gilmore MS, Clewell DB, Ike Y, Shankar N, editors. Enterococci: From Commensals to Leading Causes of Drug Resistant Infection [Internet]. Boston: Massachusetts Eye and Ear Infirmary; 2014–. 2014 Feb 11. In: Gilmore MS, Clewell DB, Ike Y, Shankar N, editors. Enterococci: From Commensals to Leading Causes of Drug Resistant Infection [Internet]. Boston: Massachusetts Eye and Ear Infirmary; 2014–. PMID: 24649501 Free Books & Documents. Review.
Cited by
-
Genomic and taxonomic evaluation of 38 Treponema prophage sequences.BMC Genomics. 2024 Jun 1;25(1):549. doi: 10.1186/s12864-024-10461-5. BMC Genomics. 2024. PMID: 38824509 Free PMC article.
-
Nanopore and Illumina sequencing reveal different viral populations from human gut samples.Microb Genom. 2024 Apr;10(4):001236. doi: 10.1099/mgen.0.001236. Microb Genom. 2024. PMID: 38683195 Free PMC article.
-
Large-scale genomic survey with deep learning-based method reveals strain-level phage specificity determinants.Gigascience. 2024 Jan 2;13:giae017. doi: 10.1093/gigascience/giae017. Gigascience. 2024. PMID: 38649301 Free PMC article.
-
Discovery and description of novel phage genomes from urban microbiomes sampled by the MetaSUB consortium.Sci Rep. 2024 Apr 4;14(1):7913. doi: 10.1038/s41598-024-58226-0. Sci Rep. 2024. PMID: 38575625 Free PMC article.
-
A microbial knowledge graph-based deep learning model for predicting candidate microbes for target hosts.Brief Bioinform. 2024 Mar 27;25(3):bbae119. doi: 10.1093/bib/bbae119. Brief Bioinform. 2024. PMID: 38555472 Free PMC article.
References
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous