Nanopore sequencing and full genome de novo assembly of human cytomegalovirus TB40/E reveals clonal diversity and structural variations
- PMID: 30068288
- PMCID: PMC6090854
- DOI: 10.1186/s12864-018-4949-6
Nanopore sequencing and full genome de novo assembly of human cytomegalovirus TB40/E reveals clonal diversity and structural variations
Abstract
Background: Human cytomegalovirus (HCMV) has a double-stranded DNA genome of approximately 235 Kbp that is structurally complex including extended GC-rich repeated regions. Genomic recombination events are frequent in HCMV cultures but have also been observed in vivo. Thus, the assembly of HCMV whole genomes from technologies producing shorter than 500 bp sequences is technically challenging. Here we improved the reconstruction of HCMV full genomes by means of a hybrid, de novo genome-assembly bioinformatics pipeline upon data generated from the recently released MinION MkI B sequencer from Oxford Nanopore Technologies.
Results: The MinION run of the HCMV (strain TB40/E) library resulted in ~ 47,000 reads from a single R9 flowcell and in ~ 100× average read depth across the virus genome. We developed a novel, self-correcting bioinformatics algorithm to assemble the pooled HCMV genomes in three stages. In the first stage of the bioinformatics algorithm, long contigs (N50 = 21,892) of lower accuracy were reconstructed. In the second stage, short contigs (N50 = 5686) of higher accuracy were assembled, while in the final stage the high quality contigs served as template for the correction of the longer contigs resulting in a high-accuracy, full genome assembly (N50 = 41,056). We were able to reconstruct a single representative haplotype without employing any scaffolding steps. The majority (98.8%) of the genomic features from the reference strain were accurately annotated on this full genome construct. Our method also allowed the detection of multiple alternative sub-genomic fragments and non-canonical structures suggesting rearrangement events between the unique (UL /US) and the repeated (T/IRL/S) genomic regions.
Conclusions: Third generation high-throughput sequencing technologies can accurately reconstruct full-length HCMV genomes including their low-complexity and highly repetitive regions. Full-length HCMV genomes could prove crucial in understanding the genetic determinants and viral evolution underpinning drug resistance, virulence and pathogenesis.
Keywords: Human cytomegalovirus; MinION; Mutation; Nanopore; Quasi-species; Recombination; Variable number tandem repeats; de novo assembly.
Conflict of interest statement
Not applicable.
Not applicable.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures
![Fig. 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/6090854/bin/12864_2018_4949_Fig1_HTML.gif)
![Fig. 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/6090854/bin/12864_2018_4949_Fig2_HTML.gif)
![Fig. 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/6090854/bin/12864_2018_4949_Fig3_HTML.gif)
![Fig. 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/6090854/bin/12864_2018_4949_Fig4_HTML.gif)
![Fig. 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/6090854/bin/12864_2018_4949_Fig5_HTML.gif)
![Fig. 6](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/6090854/bin/12864_2018_4949_Fig6_HTML.gif)
Similar articles
-
Nanopore sequencing technology, bioinformatics and applications.Nat Biotechnol. 2021 Nov;39(11):1348-1365. doi: 10.1038/s41587-021-01108-x. Epub 2021 Nov 8. Nat Biotechnol. 2021. PMID: 34750572 Free PMC article. Review.
-
de novo assembly and population genomic survey of natural yeast isolates with the Oxford Nanopore MinION sequencer.Gigascience. 2017 Feb 1;6(2):1-13. doi: 10.1093/gigascience/giw018. Gigascience. 2017. PMID: 28369459 Free PMC article.
-
Oxford Nanopore MinION Sequencing and Genome Assembly.Genomics Proteomics Bioinformatics. 2016 Oct;14(5):265-279. doi: 10.1016/j.gpb.2016.05.004. Epub 2016 Sep 17. Genomics Proteomics Bioinformatics. 2016. PMID: 27646134 Free PMC article. Review.
-
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8. BMC Genomics. 2016. PMID: 27556636 Free PMC article.
-
De Novo Assembly of Human Herpes Virus Type 1 (HHV-1) Genome, Mining of Non-Canonical Structures and Detection of Novel Drug-Resistance Mutations Using Short- and Long-Read Next Generation Sequencing Technologies.PLoS One. 2016 Jun 16;11(6):e0157600. doi: 10.1371/journal.pone.0157600. eCollection 2016. PLoS One. 2016. PMID: 27309375 Free PMC article.
Cited by
-
Direct Nanopore Sequencing of Human Cytomegalovirus Genomes from High-Viral-Load Clinical Samples.Viruses. 2023 May 26;15(6):1248. doi: 10.3390/v15061248. Viruses. 2023. PMID: 37376548 Free PMC article.
-
Strain-Dependent Restriction of Human Cytomegalovirus by Zinc Finger Antiviral Proteins.J Virol. 2023 Mar 30;97(3):e0184622. doi: 10.1128/jvi.01846-22. Epub 2023 Mar 14. J Virol. 2023. PMID: 36916924 Free PMC article.
-
Evaluation of tangential flow filtration coupled to long-read sequencing for ostreid herpesvirus type 1 genome assembly.Microb Genom. 2022 Nov;8(11):mgen000895. doi: 10.1099/mgen.0.000895. Microb Genom. 2022. PMID: 36355418 Free PMC article.
-
Targeted Virome Sequencing Enhances Unbiased Detection and Genome Assembly of Known and Emerging Viruses-The Example of SARS-CoV-2.Viruses. 2022 Jun 11;14(6):1272. doi: 10.3390/v14061272. Viruses. 2022. PMID: 35746743 Free PMC article.
-
Viral gene drive in herpesviruses.Nat Commun. 2020 Sep 28;11(1):4884. doi: 10.1038/s41467-020-18678-0. Nat Commun. 2020. PMID: 32985507 Free PMC article.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous