Benchmarking Long-Read Assemblers for Genomic Analyses of Bacterial Pathogens Using Oxford Nanopore Sequencing
- PMID: 33271875
- PMCID: PMC7730629
- DOI: 10.3390/ijms21239161
Benchmarking Long-Read Assemblers for Genomic Analyses of Bacterial Pathogens Using Oxford Nanopore Sequencing
Abstract
Oxford Nanopore sequencing can be used to achieve complete bacterial genomes. However, the error rates of Oxford Nanopore long reads are greater compared to Illumina short reads. Long-read assemblers using a variety of assembly algorithms have been developed to overcome this deficiency, which have not been benchmarked for genomic analyses of bacterial pathogens using Oxford Nanopore long reads. In this study, long-read assemblers, namely Canu, Flye, Miniasm/Racon, Raven, Redbean, and Shasta, were thus benchmarked using Oxford Nanopore long reads of bacterial pathogens. Ten species were tested for mediocre- and low-quality simulated reads, and 10 species were tested for real reads. Raven was the most robust assembler, obtaining complete and accurate genomes. All Miniasm/Racon and Raven assemblies of mediocre-quality reads provided accurate antimicrobial resistance (AMR) profiles, while the Raven assembly of Klebsiella variicola with low-quality reads was the only assembly with an accurate AMR profile among all assemblers and species. All assemblers functioned well for predicting virulence genes using mediocre-quality and real reads, whereas only the Raven assemblies of low-quality reads had accurate numbers of virulence genes. Regarding multilocus sequence typing (MLST), Miniasm/Racon was the most effective assembler for mediocre-quality reads, while only the Raven assemblies of Escherichia coli O157:H7 and K. variicola with low-quality reads showed positive MLST results. Miniasm/Racon and Raven were the best performers for MLST using real reads. The Miniasm/Racon and Raven assemblies showed accurate phylogenetic inference. For the pan-genome analyses, Raven was the strongest assembler for simulated reads, whereas Miniasm/Racon and Raven performed the best for real reads. Overall, the most robust and accurate assembler was Raven, closely followed by Miniasm/Racon.
Keywords: Oxford Nanopore sequencing; bacterial pathogen; benchmarking; genome assembly; genomic analysis; long-read assembler; long-read sequencing; whole-genome sequencing.
Conflict of interest statement
The authors declare no conflict of interest. The funder had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7730629/bin/ijms-21-09161-g001.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7730629/bin/ijms-21-09161-g002.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7730629/bin/ijms-21-09161-g003.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7730629/bin/ijms-21-09161-g004.gif)
![Figure 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7730629/bin/ijms-21-09161-g005.gif)
![Figure 6](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7730629/bin/ijms-21-09161-g006.gif)
![Figure 7](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7730629/bin/ijms-21-09161-g007.gif)
![Figure 8](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7730629/bin/ijms-21-09161-g008.gif)
Similar articles
-
Perspectives and Benefits of High-Throughput Long-Read Sequencing in Microbial Ecology.Appl Environ Microbiol. 2021 Aug 11;87(17):e0062621. doi: 10.1128/AEM.00626-21. Epub 2021 Aug 11. Appl Environ Microbiol. 2021. PMID: 34132589 Free PMC article. Review.
-
Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.Genomics. 2021 May;113(3):1366-1377. doi: 10.1016/j.ygeno.2021.03.018. Epub 2021 Mar 11. Genomics. 2021. PMID: 33716184
-
Benchmarking hybrid assembly approaches for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing.BMC Genomics. 2020 Sep 14;21(1):631. doi: 10.1186/s12864-020-07041-8. BMC Genomics. 2020. PMID: 32928108 Free PMC article.
-
Benchmarking of long-read assemblers for prokaryote whole genome sequencing.F1000Res. 2019 Dec 23;8:2138. doi: 10.12688/f1000research.21782.4. eCollection 2019. F1000Res. 2019. PMID: 31984131 Free PMC article.
-
Nanopore sequencing technology and tools for genome assembly: computational analysis of the current state, bottlenecks and future directions.Brief Bioinform. 2019 Jul 19;20(4):1542-1559. doi: 10.1093/bib/bby017. Brief Bioinform. 2019. PMID: 29617724 Free PMC article. Review.
Cited by
-
Integrating multi-platform assembly to recover MAGs from hot spring biofilms: insights into microbial diversity, biofilm formation, and carbohydrate degradation.Environ Microbiome. 2024 May 6;19(1):29. doi: 10.1186/s40793-024-00572-7. Environ Microbiome. 2024. PMID: 38706006 Free PMC article.
-
Three Rounds of Read Correction Significantly Improve Eukaryotic Protein Detection in ONT Reads.Microorganisms. 2024 Jan 24;12(2):247. doi: 10.3390/microorganisms12020247. Microorganisms. 2024. PMID: 38399651 Free PMC article.
-
Whole-genome sequencing and evolutionary analysis of the wild edible mushroom, Morchella eohespera.Front Microbiol. 2024 Feb 1;14:1309703. doi: 10.3389/fmicb.2023.1309703. eCollection 2023. Front Microbiol. 2024. PMID: 38361578 Free PMC article.
-
Evaluating long-read de novo assembly tools for eukaryotic genomes: insights and considerations.Gigascience. 2022 Dec 28;12:giad100. doi: 10.1093/gigascience/giad100. Epub 2023 Nov 24. Gigascience. 2022. PMID: 38000912 Free PMC article.
-
Chromosome-level, nanopore-only genome and allele-specific DNA methylation of Pallas's cat, Otocolobus manul.NAR Genom Bioinform. 2023 Apr 4;5(2):lqad033. doi: 10.1093/nargab/lqad033. eCollection 2023 Jun. NAR Genom Bioinform. 2023. PMID: 37025970 Free PMC article.
References
-
- De Maio N., Shaw L.P., Hubbard A., George S., Sanderson N.D., Swann J., Wick R., AbuOun M., Stubberfield E., Hoosdally S.J., et al. Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes. Microb. Genom. 2019;5:e000294. doi: 10.1099/mgen.0.000294. - DOI - PMC - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources