Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
- PMID: 31483244
- PMCID: PMC6807382
- DOI: 10.1099/mgen.0.000294
Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
Abstract
Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly.
Keywords: Enterobacteriaceae; bacterial genomics; hybrid assembly; long-read sequencing; plasmid assembly.
Conflict of interest statement
The authors declare that there are no conflicts of interest.
Figures
Similar articles
-
Advancements in long-read genome sequencing technologies and algorithms.Genomics. 2024 May;116(3):110842. doi: 10.1016/j.ygeno.2024.110842. Epub 2024 Apr 11. Genomics. 2024. PMID: 38608738 Review.
-
Comparison of De Novo Assembly Strategies for Bacterial Genomes.Int J Mol Sci. 2021 Jul 17;22(14):7668. doi: 10.3390/ijms22147668. Int J Mol Sci. 2021. PMID: 34299288 Free PMC article.
-
Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.Genomics. 2021 May;113(3):1366-1377. doi: 10.1016/j.ygeno.2021.03.018. Epub 2021 Mar 11. Genomics. 2021. PMID: 33716184
-
Completion of draft bacterial genomes by long-read sequencing of synthetic genomic pools.BMC Genomics. 2020 Jul 29;21(1):519. doi: 10.1186/s12864-020-06910-6. BMC Genomics. 2020. PMID: 32727443 Free PMC article.
-
Oxford Nanopore MinION Sequencing and Genome Assembly.Genomics Proteomics Bioinformatics. 2016 Oct;14(5):265-279. doi: 10.1016/j.gpb.2016.05.004. Epub 2016 Sep 17. Genomics Proteomics Bioinformatics. 2016. PMID: 27646134 Free PMC article. Review.
Cited by
-
Biosynthetic gene clusters with biotechnological applications in novel Antarctic isolates from Actinomycetota.Appl Microbiol Biotechnol. 2024 May 8;108(1):325. doi: 10.1007/s00253-024-13154-x. Appl Microbiol Biotechnol. 2024. PMID: 38717668 Free PMC article.
-
PlasCAT: Plasmid Cloud Assembly Tool.Bioinformatics. 2024 May 2;40(5):btae299. doi: 10.1093/bioinformatics/btae299. Bioinformatics. 2024. PMID: 38696761 Free PMC article.
-
Sequencing Strategy to Ensure Accurate Plasmid Assembly.bioRxiv [Preprint]. 2024 Jun 10:2024.03.25.586694. doi: 10.1101/2024.03.25.586694. bioRxiv. 2024. PMID: 38585828 Free PMC article. Preprint.
-
Comparative whole genome analysis of face-derived Streptococcus infantis CX-4 unravels the functions related to skin barrier.Genes Genomics. 2024 Apr;46(4):499-510. doi: 10.1007/s13258-024-01495-w. Epub 2024 Mar 7. Genes Genomics. 2024. PMID: 38453815
-
The Genome of Arsenophonus sp. and Its Potential Contribution in the Corn Planthopper, Peregrinus maidis.Insects. 2024 Feb 5;15(2):113. doi: 10.3390/insects15020113. Insects. 2024. PMID: 38392531 Free PMC article.
References
-
- George S, Pankhurst L, Hubbard A, Votintseva A, Stoesser N, et al. Resolving plasmid structures in Enterobacteriaceae using the MinION nanopore sequencer: assessment of MinION and MinION/Illumina hybrid data assembly approaches. Microb Genom. 2017;3:e000118. doi: 10.1099/mgen.0.000118. - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources