Skip to main content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Microbiol Resour Announc. 2020 Feb; 9(6): e01439-19.
Published online 2020 Feb 6. doi: 10.1128/MRA.01439-19
PMCID: PMC7005117
PMID: 32029569

Five Complete Genome Sequences Spanning the Dutch Streptococcus suis Serotype 2 and Serotype 9 Populations

Julie C. Dunning Hotopp, Editor
Julie C. Dunning Hotopp, University of Maryland School of Medicine;

Associated Data

Data Availability Statement

The zoonotic pathogen Streptococcus suis can cause septicemia and meningitis in humans. We report five complete genomes of Streptococcus suis serotype 2 and serotype 9, covering the complete phylogeny of serotype 9 Dutch porcine isolates and zoonotic isolates. The isolates include the model strain S10 and the Dutch emerging zoonotic lineage.

ABSTRACT

The zoonotic pathogen Streptococcus suis can cause septicemia and meningitis in humans. We report five complete genomes of Streptococcus suis serotype 2 and serotype 9, covering the complete phylogeny of serotype 9 Dutch porcine isolates and zoonotic isolates. The isolates include the model strain S10 and the Dutch emerging zoonotic lineage.

ANNOUNCEMENT

Streptococcus suis is an opportunistic pathogen in pigs which can cause zoonotic infections. Human infections are predominantly caused by S. suis serotype 2 (1) and can lead to septicemia and meningitis (2). We recently identified a zoonotic S. suis serotype 2 clone belonging to clonal complex 20 (CC20), which emerged from a nonzoonotic serotype 9 CC16 clone (3) in the Netherlands. To facilitate further research on the zoonotic potential of S. suis, we sequenced the genomes of S. suis serotype 9 CC16 and CC20 strains, isolated from diseased pigs, and three serotype 2 strains, including strain S10 (CC1, pig) and two CC20 strains, one each from human and porcine infections (Table 1). Data were generated using Illumina and Nanopore MinION sequencing technologies.

TABLE 1

Isolate details, genome information, and accession numbers

IsolateIsolation sourceSerotypeClonal complexGenome length (bp)GC content (%)No. of total CDSsaNanopore read N50 (bp)No. of Nanopore readsNanopore coverage (×)Nanopore run accession no.Illumina run accession no.Assembly accession no.
861160Human CSFb2202,148,82441.102,02913,5897,55723ERR3664732ERR1055554GCA_902702745
GD-0001Diseased pig2202,125,46841.242,01425,83110,49054ERR3664733ERR1055586GCA_902702785
9401240Diseased pig9202,195,21541.432,03611,19230,41860ERR3664735ERR1055578GCA_902702775
GD-0088Diseased pig9162,298,01241.202,2137,65715,32127ERR3664734ERR1055627GCA_902702765
S10Diseased pig212,048,27541.321,95215,20820,25172ERR3664731ERR1055646GCA_902702755
aCDSs, coding sequences.
bCSF, cerebrospinal fluid.

S. suis was grown overnight in Todd-Hewitt broth supplemented with yeast extract (THY), and genomic DNA was isolated using the Qiagen MagAttract high-molecular-weight (HMW) DNA extraction kit. The sequence library was constructed using the native barcoding (catalog number EXP-NBD114) and ligation sequencing (catalog number SQK-LSK109) kits (Oxford Nanopore). DNA was repaired and A tailed using NEBNext formalin-fixed, paraffin-embedded (FFPE) DNA repair mix and the NEBNext Ultra II end repair/dA-tailing module (New England BioLabs). A barcode was ligated to the A-tailed DNA using blunt/TA ligase master mix (New England Biolabs). Sequence adapters were ligated to barcoded samples pooled by equal mass with Quick T4 DNA ligase (New England BioLabs). The library was loaded on the flow cell (FLO-MIN106D [R9]) and sequenced using MinKNOW fast base calling version 3.5.5. Default parameters were used for all tools except where noted otherwise. Illumina data were available from our previous study (Table 1) (3).

Illumina read filtering was performed using fastp version 0.20.0 (4). MinION reads were filtered for quality and length using Filtlong version 0.2.0 (5), using the filtered Illumina reads as reference. FastQC version 0.11.8 was used for quality control (6). Illumina and MinION reads were used in a hybrid assembly using Unicycler version 0.4.8, which also performs assembly trimming, circularizing, and rotating (7). Assembly statistics were collected using Quast version 4.6.3 (8). Coverage was assessed using Minimap2 version 2.17 (9), SAMtools version 1.9 (10), and BEDTools version 2.29.0 (11). The complete genomes were annotated using Prokka version 1.14.0 (12). Multilocus sequence typing (MLST) was performed using mlst version 2.17.6 (13). For workflow management, Snakemake version 5.7.1 (14) was used. The pipeline is freely available from https://github.com/boasvdp/MRA_Streptococcus_suis.

Genomes of all five strains consisted of a single chromosome ranging from 2,042,889 to 2,292,626 bp with a GC content of 41.10 to 41.43% and a coverage of 23 to 72×, determined using Nanopore data (Table 1).

Draft assemblies of the five strains were 46 to 74 kbp smaller than the complete genomes. Mapping the draft genomes to the complete genomes revealed no missing regions in the draft genomes. The draft genomes are likely smaller than the complete genomes due to the collapse of repeats, which has been described before (15).

Data availability.

Nanopore, fastq, and fast5 data, as well as the assembled genome sequences, have been deposited in ENA under the accession numbers listed in Table 1 and study number PRJEB35407.

ACKNOWLEDGMENTS

This study was funded through EU-Horizon2020 grant 727966 (PIGSs) and an Amsterdam UMC Ph.D. grant. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

REFERENCES

1. Huong VTL, Ha N, Huy NT, Horby P, Nghia HDT, Thiem VD, Zhu X, Hoa NT, Hien TT, Zamora J, Schultsz C, Wertheim HFL, Hirayama K. 2014. Epidemiology, clinical manifestations, and outcomes of Streptococcus suis infection in humans. Emerg Infect Dis 20:1105–1114. doi: 10.3201/eid2007.131594. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
2. Wertheim HFL, Nghia HDT, Taylor W, Schultsz C. 2009. Streptococcus suis: an emerging human pathogen. Clin Infect Dis 48:617–625. doi: 10.1086/596763. [PubMed] [CrossRef] [Google Scholar]
3. Willemse N, Howell KJ, Weinert LA, Heuvelink A, Pannekoek Y, Wagenaar JA, Smith HE, van der Ende A, Schultsz C. 2016. An emerging zoonotic clone in the Netherlands provides clues to virulence and zoonotic potential of Streptococcus suis. Sci Rep 6:28984. doi: 10.1038/srep28984. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
4. Chen S, Zhou Y, Chen Y, Gu J. 2018. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890. doi: 10.1093/bioinformatics/bty560. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
5. Wick RR. 2019. Filtlong. https://github.com/rrwick/Filtlong.
6. Andrew S. 2010. FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
7. Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
8. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
9. Li H. 2018. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34:3094–3100. doi: 10.1093/bioinformatics/bty191. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
10. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup . 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25:2078–2079. doi: 10.1093/bioinformatics/btp352. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
11. Quinlan AR, Hall IM. 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26:841–842. doi: 10.1093/bioinformatics/btq033. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
12. Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153. [PubMed] [CrossRef] [Google Scholar]
13. Seemann T. 2019. mlst. https://github.com/tseemann/mlst.
14. Köster J, Rahmann S. 2012. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics 28:2520–2522. doi: 10.1093/bioinformatics/bts480. [PubMed] [CrossRef] [Google Scholar]
15. Treangen TJ, Salzberg SL. 2011. Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat Rev Genet 13:36–46. doi: 10.1038/nrg3117. [PMC free article] [PubMed] [CrossRef] [Google Scholar]

Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

-