Skip to main content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Stand Genomic Sci. 2017; 12: 31.
Published online 2017 May 8. doi: 10.1186/s40793-017-0242-6
PMCID: PMC5422911
PMID: 28491240

High quality permanent draft genome sequence of Chryseobacterium bovis DSM 19482T, isolated from raw cow milk

Abstract

Chryseobacterium bovis DSM 19482T (Hantsis-Zacharov et al., Int J Syst Evol Microbiol 58:1024-1028, 2008) is a Gram-negative, rod shaped, non-motile, facultative anaerobe, chemoorganotroph bacterium. C. bovis is a member of the Flavobacteriaceae, a family within the phylum Bacteroidetes. It was isolated when psychrotolerant bacterial communities in raw milk and their proteolytic and lipolytic traits were studied. Here we describe the features of this organism, together with the draft genome sequence and annotation. The DNA G + C content is 38.19%. The chromosome length is 3,346,045 bp. It encodes 3236 proteins and 105 RNA genes. The C. bovis genome is part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes study.

Keywords: Flavobacteriaceae, Psychrotolerant, Proteolysis, Lipolysis, Beta-carotene

Introduction

Chryseobacterium bovis DSM 19482 T (=LMG 24227 T; CIP 110170 T), was isolated by Hantsis-Zacharov and Halpern [1] from raw cow milk when psychrotolerant bacterial communities in raw milk, and their proteolytic and lipolytic traits, were studied. This study revealed that 5% out of the culturable psychrotolerant bacterial communities belonged to the genus Chryseobacterium. Chryseobacterium bovis proliferates at low temperatures and produce heat-stable proteolytic and lipolytic enzymes which remain active after the milk pasteurization process. This may be a limiting factor in maintaining the flavor quality of fluid milk and its products [1]. Strain C. bovis H9T DSM 19482 T was isolated in April 2004 from a modern farm equipped with automated milking facilities in northern Israel [2]. Three novel psychrotolerant Chryseobacterium species were isolated and identified from raw milk in the same study [1]: C. bovis, C. haifense and C. oranimense [24]. The genus Chryseobacterium [5] is a member of the family Flavobacteriaceae and currently consists of about 100 species with Chryseobacterium gleum as the type species. Species belonging to this genus exist in diverse environments such as milk, water, sludge, soil, animals, insects, plants and human samples [2, 6].

Here we describe a summary classification and a set of the features of the species C. bovis, together with the permanent draft genome sequence description and annotation of the type strain (DSM 19482 T).

Organism information

Classification and features

C. bovis strain DSM 19482 T shares typical characteristics of Chryseobacterium such as Gram-negative staining, occurrence as chemoheterotrophic rods and positive catalase and oxidase reactions. The strain contains flexirubin-type pigments, which are also typical for Chryseobacterium [2] (Table 1). The phylogenetic tree based on the 16S rRNA, also supports the fact that strain DSM 19482 T belongs to Chryseobacterium genus (Fig. 1).

Table 1

Classification and general features of Chryseobacterium bovis DSM 19482T according to the MIGS recommendations [25], published by the Genome Standards Consortium [26] and the Names for Life database [27]

MIGS IDPropertyTermEvidence codea
Current classificationDomain Bacteria TAS [28]
Phylum Bacteroidetes TAS [29]
Class Flavobacteriia TAS [30]
Order Flavobacteriales TAS [31]
Family Flavobacteriaceae TAS [32]
Genus Chryseobacterium TAS [5]
Species Chryseobacterium bovis TAS [2]
Type strain DSM 19482T TAS [2]
Gram stainNegativeTAS [2]
Cell shapeRodTAS [2]
MotilityNon-motileTAS [2]
SporulationNon-sporulatingIDS
Temperature range7–37 °CTAS [2]
Optimum Temperature30–32 °CTAS [2]
pH range, Optimum5.0–9.8; 6.5–8.5NAS
Carbon sourceGlucose, lactose, MaltoseTAS [2]
MIGS-6HabitatCow milkTAS [2]
MIGS-6.3Salinity, Optimum0–2.5%; 0–1.75%NAS
MIGS-22Oxygen requirementFacultative anaerobeTAS [2]
MIGS-15Biotic relationshipUnknownTAS [2]
MIGS-14PathogenicityUnknownTAS [2]
MIGS-4Geographic locationNorthern IsraelTAS [2]
MIGS-5Sample collection2004TAS [2]
MIGS-4.1Latitude32.635149NAS
MIGS-4.2Longitude35.362050NAS
MIGS-4.4AltitudeNot reported-

aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (ie, a direct report exists in the literature), NAS Non-traceable Author Statement (ie, not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [33]

An external file that holds a picture, illustration, etc.
Object name is 40793_2017_242_Fig1_HTML.jpg

Phylogenetic tree highlighting the position of Chryseobacterium bovis relative to type species within the genus Chryseobacterium. Maximum likelihood (ML) tree inferred under the GTR + CAT model and rooted with Elizabethkingia. The branches are scaled in terms of the expected number of substitutions per site. The numbers above the branches are support values when larger than 60% from ML (left) and maximum parsimony (MP, right) bootstrapping. Phylogenies were inferred by the GGDC web server [34] available at (http://ggdc.dsmz.de) using the DSMZ phylogenomics pipeline [35] adapted to single genes. A multiple sequence alignment was created with MUSCLE [36]. ML and MP trees were inferred from the alignment with RAxML [37] and TNT [38], respectively. For ML, rapid bootstrapping in conjunction with the autoMRE bootstopping criterion [39] and subsequent search for the best tree was used; for MP, 1000 bootstrapping replicates were used in conjunction with tree-bisection-and-reconnection branch swapping and ten random sequence addition replicates

Cells of C. bovis strain DSM 19482 T are non-motile rods, measuring 0.5–0.9 μm in width and 1.1–2.3 μm in length (Fig. 2). After 48 h incubation on standard plate-count agar (SPC) at 30 °C in the dark, colonies are circular with entire edges, opaque, smooth and cream-colored. When light is provided during growth, colonies are yellow-colored because of the production of carotenoid-type pigments (absorbance peaks at 454 and 481 nm). They also contain small amounts of flexirubin-type pigments [24].

An external file that holds a picture, illustration, etc.
Object name is 40793_2017_242_Fig2_HTML.jpg

Electron micrograph of negatively stained cells of Chryseobacterium bovis strain DSM 19482T. Cells are nonflagellated rods. Bar, 2 μm

Growth is observed under anaerobic conditions on SPC agar containing 0.1% (w/v) potassium nitrate but not on SPC agar with the addition of 0.5% glucose (indicating that glucose is not fermented) [2]. The strain grows at 7–37 °C (optimum, 30–32 °C), with 0–2.5% NaCl (optimum, 0–1.75%) and at pH 5.0–9.8 (optimum, pH 6.5–8.5) (Table 1). C. bovis does not grow on MacConkey or cetrimide agar. Casein, aesculin and tributyrin are hydrolysed. Glucose, mannose, maltose, arabinose, mannitol, N-acetylglucosamine, gluconate and adipic and malic acids are assimilated. Acid is produced from D-glucose, maltose, D-lactose and D-mannose. Acetoin is produced; gelatin is hydrolyzed; H2S and indole are not produced; urea is not hydrolyzed; citrate is not utilized; and arginine dihydrolase, lysine and ornithine decarboxylases and tryptophan deaminase activities are absent. Alkaline and acid phosphatases, esterase (C4), esterase lipase (C8), leucine arylamidase, valine arylamidase, naphthol-AS-BI-phosphohydrolase, α-glucosidase, ß-galactosidase and cystine arylamidase activities are present [2].

Chemotaxonomic data

The major fatty acids of the type strains are: iso-C15:0; antesio-C15:0 and iso-C17:0 3OH. Some strains in this species also possess iso-C17:0 ω9c as a major fatty acid [2].

Genome sequencing information

Genome project history

This organism was selected for sequencing based on its phylogenetic position [7] and is part of the study Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project [8]. The goal of the KMG-I study is to increase the coverage of sequenced reference microbial genomes [9]. The project is registered in the Genomes OnLine Database [10] and the permanent draft genome sequence is deposited in GenBank. Draft sequencing and assembly were performed at the DOE Joint Genome Institute (http://jgi.doe.gov/) using state of the art sequencing technology [11]. A summary of the project information is shown in Table 2.

Table 2

Genome sequencing project information

MIGS IDPropertyTerm
MIGS 31.1Finishing qualityLevel 2: High-Quality Draft
MIGS-28Libraries usedIllumina Std. shotgun library
MIGS 29Sequencing platformsIllumina HiSeq 2500, Illumina HiSeq 2500-1TB
MIGS 31.2Fold coverage230.5X
MIGS 30AssemblersVelvet (v. 1.2.07), ALLPATHS –LG
(v. r46652)
MIGS 32Gene calling methodProdigal 2.5
Locus TagLX71
Genbank IDFTPU01000000
Genbank date of release19-JAN-2017
GOLD IDGp0103631
BIOPROJECTPRJNA262259
MIGS-13Source Material IdentifierDSM 19482T
Project relevanceGEBA-KMG, Tree of Life

Growth conditions and genomic DNA preparation

A culture of DSM 19482 T was grown aerobically in DSMZ medium 381 [12] at 28 °C. Genomic DNA was isolated using a Jetflex Genomic DNA Purification Kit (GENOMED 600100) following the standard protocol provided by the manufacturer. DNA is available from the DSMZ through the DNA Bank Network [13].

Genome sequencing and assembly

The draft genome was generated at the DOE Joint genome Institute (JGI) using the Illumina technology [14]. An Illumina std shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 7,888,518 reads totaling 1183.3 Mb. All general aspects of library construction and sequencing performed at the JGI can be found at (http://www.jgi.doe.gov). All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts [15]. Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.2.07) [16], (2) 1–3 kb simulated paired end reads were created from Velvet contigs using wgsim (https://github.com/lh3/wgsim), (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r46652) [17]. Parameters for assembly steps were: (1) Velvet (velveth: 63 –shortPaired and velvetg: –very clean yes –exportFiltered yes –min contig lgth 500 –scaffolding no –cov cutoff 10) (2) wgsim (–e 0 –1 100 –2 100 –r 0 –R 0 –X 0) (3) Allpaths–LG (PrepareAllpathsInputs: PHRED 64 = 0 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI WARN ONLY = True OVERWRITE = True). The final draft assembly contained 101 contigs in 96 scaffolds, totalling 3.3 Mb in size. The final assembly was based on 1152.3 Mb of Illumina data. 230.5X input read coverage was used for the final assembly.

Genome annotation

Genes were identified using Prodigal [18], as part of the DOE-JGI genome annotation pipeline [19]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) nonredundant database, UniProt, TIGRFam, Pfam, KEGG, COG and InterPro databases. The tRNAScanSE tool [20] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [21]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [22]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes (IMG) platform [23] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome properties

The assembly of the draft genome sequence consists of 96 scaffolds amounting to 3,346,045 bp, and the G + C content is 38.19% (Table 3). Of the 3341 genes predicted, 3236 were protein-coding genes, and 105 RNAs. The majority of the protein-coding genes (69.95%) were assigned a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 4.

Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)3,346,045100.00
DNA coding (bp)2,970,60888.78
DNA G + C (bp)1,277,77838.19
DNA scaffolds96100.00
Total genes3341100.00
Protein coding genes323696.86
RNA genes1053.14
Pseudo genes00.00
Genes in internal clusters252375.52
Genes with function prediction233769.95
Genes assigned to COGs168850.52
Genes with Pfam domains242572.58
Genes with signal peptides3109.28
Genes with transmembrane helices69620.83
CRISPR repeats2

Table 4

Number of genes associated with the general COG functional categories

CodeValue% ageDescription
J1679.12Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K1156.28Transcription
L1106.01Replication, recombination and repair
B00.00Chromatin structure and dynamics
D241.31Cell cycle control, cell division, chromosome partitioning
V784.26Defense mechanisms
T563.06Signal transduction mechanisms
M18910.32Cell wall/membrane biogenesis
N160.87Cell motility
U180.98Intracellular trafficking, secretion and vesicular transport
O884.81Posttranslational modification, protein turnover, chaperones
C1035.63Energy production and conversion
G794.31Carbohydrate transport and metabolism
E1478.03Amino acid transport and metabolism
F553.00Nucleotide transport and metabolism
H1085.90Coenzyme transport and metabolism
I764.15Lipid transport and metabolism
P1206.55Inorganic ion transport and metabolism
Q321.75Secondary metabolites biosynthesis, transport and catabolism
R1488.08General function prediction only
S864.70Function unknown
-165349.49Not in COGs

Insights from the genome sequence

C. bovis DSM 19482 T showed the ability to hydrolyze casein and tributyrin [2] and these traits can also be observed in its genome. The following protease genes were detected: Membrane-associated serine protease, rhomboid family; ATP-dependent Clp protease ATP-binding subunit ClpB; Do/DeqQ family serine protease; ATP-dependent Clp protease ATP-binding subunit ClpX and transglutaminase-like enzyme, putative cysteine protease; ATP-dependent Lon protease (Lon functions in the cytosol) and cell division protease FtsH. The lipolytic properties of C. bovis DSM 19482 T are evident from the presence of the following genes: phospholipase/carboxylesterase; esterase/lipase superfamily enzyme and GDSL-like lipase/acylhydrolase.

C. bovis DSM 19482 T is producing carotenoid-type pigments under light conditions. Indeed, genes which are part of the carotenoid biosynthesis are found in its genome: phytoene desaturase (lycopene-forming), phytoene desaturase (neurosporene-forming), phytoene desaturase (zeta-carotene-forming), all-trans-zeta-carotene desaturase and beta-carotene 3-hydroxylase.

C. bovis DSM 19482 T was able to grow under anaerobic conditions when nitrate was provided. This ability is supported by the presence of the following genes: MFS transporter, NNP family, nitrate/nitrite transporter (two genes) and assimilatory nitrate reductase catalytic subunit.

Gliding motility properties are reflected by the presence of the genes that are exclusive to the Bacteroidetes phylum such as gliding motility-associated lipoprotein GldK and gliding motility-associated lipoprotein GldH. Another gene that supports the motility feature is the chemotaxis protein MotB gene.

Among the genes found in C. bovis DSM 19482 T genome are genes for resistance to different components. For example a gene for multidrug resistance protein, MATE family. Members of the Multi-Antimicrobial Extrusion (MATE) family function as drug/sodium antiporters. These proteins mediate resistance to a wide range of cationic dyes, fluroquinolones, aminoglycosides and other structurally diverse antibodies and drugs. These proteins are predicted to have twelve alpha-helical transmembrane regions. The Strain DSM 19482 T genome, also possesses a gene for cobalt-zinc-cadmium resistance protein CzcA. CzcA has a low cation-transport activity for cobalt and is essential for the expression of cobalt, zinc and cadmium resistance. Another gene found in the genome is a tellurite resistance protein TerC. TerC has been implicated in resistance to tellurium, and may be involved in efflux of tellurium ions. The quaternary ammonium compound-resistance protein SugE gene that is found in C. bovis DSM 19482 T genome encodes an efflux pump which confers resistance to cetylpyridinium, cetyldimethylethyl ammonium and cetrimide cations.

Resistance to antibiotics is revealed by the following genes: glycopeptide antibiotics resistance protein (plays a role in resistance to glycopeptide antibiotics such as vancomycin); MFS transporter, DHA1 family; tetracycline resistance protein gene; and Fusaric acid resistance protein-like gene, which is involved in the resistance (detoxification) of the fungal toxin Fusaric acid.

A gene for putative auto-transporter adhesin head GIN domain demonstrates the function of cell adhesion. Two genes indicate the possibility of C. bovis DSM 19482 T to produce a capsule, capsular exopolysaccharide family protein and polysaccharide export outer membrane protein.

Conclusions

In the current study we characterized the genome of C. bovis strain DSM 19482 T that was isolated from raw cow milk [2]. C. bovis is a psychrotolerant bacterium which can grow at 7 °C, although its optimal growth temperature is higher (30–32 °C). After milk collection, the milk is kept in cold storage, and psychrotolerants dominate the bacterial flora. These bacteria possess extracellular enzymes, mainly proteases and lipases which contribute to the spoilage of dairy products, as their enzymes can resist pasteurization [1]. The C. bovis DSM 19482 T genome demonstrates that indeed, this genome encodes proteases and lipases which may play a role in milk products spoilage.

C. bovis strain DSM 19482 T produces a carotenoid pigment, a feature that was also observed for C. haifense [3], but not for other species in this genus. This trait could be used for the commercial production of carotene.

C. bovis DSM 19482 T genome demonstrated the strains' potential to produce a multidrug-resistance protein, resistance to cobalt, zinc, cadmium, tellurite, cetylpyridinium, cetyldimethylethyl ammonium and cetrimide cations as well as resistance to glycopeptide antibiotics, tetracycline and resistance to the fungal toxin fusaric acid. The whole-genome sequence of C. oranimense G311, a strain that was isolated from a cystic fibrosis patient, also demonstrated multi-drug resistance [24]. Indication for a capsule-forming ability was apparent in both C. bovis DSM 19482 T and C. oranimense G311. Sharma et al. [24] suggested that the resistance of C. oranimense G311 to colistin maybe due to the production of capsular polysaccharides.

Acknowledgements

This project has been supported by the Community Sequencing Program of the U.S. Department of Energy’s Joint Genome Institute. The sequencing, assembly and automated genome analysis work at the DOE-JGI was supported by the Office of Science of the U.S. Department of Energy under contract no. DE-AC02-05CH11231. This work was also supported in part by a grant from the German Research Foundation (DFG, the Deutsche Forschungsgemeinschaft, GZ: HO 930/5-1 and 930/5-2.; Prof. Malka Halpern). We are grateful to Andrea Schütze for growing cells and to Meike Döppner for preparing gDNA (both at DSMZ).

Authors’ contributions

MH isolated and characterized strain DSM 19482T SL, MG, NCK, HPK and MH drafted the manuscript. MG, MH, AC, MP, KP, NV, NM, DS, TBK, CD, NS, VM, NI, TW and NCK sequenced, assembled and annotated the genome. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Abbreviations

GEBAGenomic encyclopedia of Bacteria and Archaea
KMGOne thousand microbial genomes
MIGSMinimum information about a genome sequence
NASNon-traceable
TASTraceable

References

1. Hantsis-Zacharov E, Halpern M. Psychrotrophic bacterial communities in raw milk and their proteolytic and lipolytic traits. Appl Environ Microbiol. 2007;73:7162–7168. doi: 10.1128/AEM.00866-07. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
2. Hantsis-Zacharov E, Senderovich Y, Halpern M. Chryseobacterium bovis sp. nov. isolated from raw cow’s milk. Int J Syst Evol Microbiol. 2008;58:1024–1028. doi: 10.1099/ijs.0.65500-0. [PubMed] [CrossRef] [Google Scholar]
3. Hantsis-Zacharov E, Halpern M. Chryseobacterium haifense sp. nov., a psychrotolerant bacterium isolated from raw milk. Int J Syst Evol Microbiol. 2007;57:2344–2348. doi: 10.1099/ijs.0.65115-0. [PubMed] [CrossRef] [Google Scholar]
4. Hantsis-Zacharov E, Shakéd T, Senderovich Y, Halpern M. (2008b) Chryseobacterium oranimense sp. nov., a psychrotolerant, proteolytic and lipolytic bacterium isolated from raw cow’s milk. Int J Syst Evol Microbiol. 2008;58:2635–2639. doi: 10.1099/ijs.0.65819-0. [PubMed] [CrossRef] [Google Scholar]
5. Vandanmme P, Bernardet JF, Segers P, Kersters K, Holmes B. New perspectives in the classification of the flavobacteria: description of Chryseobacterium gen. nov., Bergeyella gen. nov., and Empedobacter nom. rev. Int J Syst Bacteriol. 1994;44:827–831. doi: 10.1099/00207713-44-4-827. [CrossRef] [Google Scholar]
6. Bernardet JF, Hugo C, Bruun B. The genera Cryseobacterium and Elizabethkingia. In: Dworkin M, Falkow S, Rosenberg E, Schleifer K-H, Stackebrandt E, editors. The Prokaryotes A handbook on the biology of Bacteria. New York: Springer; 2006. pp. 628–676. [Google Scholar]
7. Göker M, Klenk HP. Phylogeny-driven target selection for large-scale genome-sequencing (and other) projects. Stand Genomic Sci. 2013;8:360–374. doi: 10.4056/sigs.3446951. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
8. Kyrpides NC, Woyke T, Eisen JA, Garrity G, Lilburn TG, Beck BJ, et al. Genomicencyclopedia of type strains, phase I: the one thousand microbial genomes (KMG-I) project. Stand Genomic Sci. 2013;9:628–6234. doi: 10.4056/sigs.5068949. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
9. Kyrpides NC, Hugenholtz P, Eisen JA, Woyke T, Göker M, Parker CT, et al. GenomicEncyclopedia of Bacteria and Archaea: sequencing a myriad of type strains. PLoS Biol. 2014;8:e1001920. doi: 10.1371/journal.pbio.1001920. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
10. Reddy TBK, Thomas AD, Stamatis D, Bertsch J, Isbandi M, Jansson J, et al. The Genomes OnLine Database (GOLD) v. 5: a metadata management system based on a four level (meta)genome project classification. Nucleic Acids Res. 2015;43:D1099–D1106. doi: 10.1093/nar/gku950. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
11. Mavromatis K, Land ML, Brettin TS, Quest DJ, Copeland A, Clum A, et al. The fast changing landscape of sequencing technologies and their impact on microbial assemblies and annotations. PLoS One. 2012;7:e48837. doi: 10.1371/journal.pone.0048837. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
12. http://www.dsmz.de (DSMZ list of growth media).
13. Gemeinholzer B, Dröge G, Zetzsche H, Haszprunar G, Klenk H-P, Güntsch A, et al. The DNA bank network: the start from a German initiative. Biopreserv Biobank. 2011;9:51–55. doi: 10.1089/bio.2010.0029. [PubMed] [CrossRef] [Google Scholar]
14. Bennett S. Solexa Ltd. Pharmacogenomics. 2004;5:433–438. doi: 10.1517/14622416.5.4.433. [PubMed] [CrossRef] [Google Scholar]
15. Mingkun L, Copeland A, Han J. DUK - A Fast and Efficient Kmer Matching Tool. 2011; Report Number: LBNL-4516E-Abs. https://pubarchive.lbl.gov/islandora/object/ir%3A155199/datastream/PDF/view.
16. Zerbino D, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–829. doi: 10.1101/gr.074492.107. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
17. Gnerre S, MacCallum I. High–quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci U S A. 2011;108:1513–1518. doi: 10.1073/pnas.1017351108. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
18. Hyatt D, Chen GL, Lacascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119. doi: 10.1186/1471-2105-11-119. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
19. Huntemann M, Ivanova NN, Mavromatis K, Tripp HJ, Paez-Espino D, Palaniappan K, et al. The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4) Stand Genomic Sci. 2015;10:86. doi: 10.1186/s40793-015-0077-y. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
20. Lowe TM, Eddy SR. tRNAscan–SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–964. doi: 10.1093/nar/25.5.0955. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
21. Pruesse E, Quast C, Knittel K, Fuchs B, Ludwig W, Peplies J, et al. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nuc Acids Res. 2007;35:2188–7196. doi: 10.1093/nar/gkm864. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
22. INFERNAL. Inference of RNA alignments. http://infernal.janelia.org.
23. Markowitz VM, Chen IM, Palaniappan K, Chu K, Szeto E, Pillay M, Ratner A, et al. IMG 4 version of the integrated microbial genomes comparative analysis system. Nucleic Acids Res. 2014;42(Database issue):D560–7. doi: 10.1093/nar/gkt963. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
24. Sharma P, Gupta SK, Diene SM, Rolain J-M. Whole-genome sequence of Chryseobacterium oranimense, a colistin-resistant bacterium isolated from a cystic fibrosis patient in France. Antimicrob Agents Chemother. 2015;59:1696–1706. doi: 10.1128/AAC.02417-14. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
25. Field D, Garrity GM, Gray T, Morrison N, Selengut J, Sterk P, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–547. doi: 10.1038/nbt1360. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
26. Field D, Amaral-Zettler L, Cochrane G, Cole JR, Dawyndt P, Garrity GM, et al. The genomic standards consortium. PLoS Biol. 2011;9:e1001088. doi: 10.1371/journal.pbio.1001088. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
27. Garrity GM. Names for Life Browser Tool takes expertise out of the database and puts it right in the browser. Microbiol Today. 2010;7:9. [Google Scholar]
28. Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci U S A. 1990;87:4576–4579. doi: 10.1073/pnas.87.12.4576. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
29. Krieg NR, Ludwig W, Euzéby J, Whitman WB, et al. Phylum XIV. Bacteroidetes phyl. nov. In: Krieg NR, Staley JT, Brown DR, Hedlund BP, Paster BJ, Ward NL, et al., editors. Bergey’s manual of systematic bacteriology. 2. New York: Springer; 2011. pp. 4–25. [Google Scholar]
30. Bernardet JF, Class II, et al. Flavobacteriia class. nov. In: Krieg NR, Staley JT, Brown DR, Hedlund BP, Paster BJ, Ward NL, et al., editors. Bergey’s manual of systematic bacteriology. 2. New York: Springer; 2011. pp. 4–105. [Google Scholar]
31. Garrity GM, Holt JG, et al. Taxonomic outline of the Archaea and Bacteria. In: Krieg NR, Staley JT, Brown DR, Hedlund BP, Paster BJ, Ward NL, et al., editors. Bergey’s manual of systematic bacteriology. 2. New York: Springer; 2011. pp. 155–66. [Google Scholar]
32. Bernardet JF, Nakagawa Y, Holmes B. Proposed minimal standards for describing new taxa of the family Flavobacteriaceae, and emended description of the family. Int J Syst Evol Microbiol. 2002;52:1049–70. [PubMed] [Google Scholar]
33. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25:25–29. doi: 10.1038/75556. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
34. Meier-Kolthoff JP, Auch AF, Klenk HP, Göker M. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics. 2013;4:60. doi: 10.1186/1471-2105-14-60. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
35. Meier-Kolthoff JP, Hahnke RL, Petersen J, Scheuner C, Michael V, Fiebig A, et al. Complete genome sequence of DSM 30083T, the type strain (U5/41T) of Escherichia coli, and a proposal for delineating subspecies in microbial taxonomy. Stand Genomic Sci. 2014;10:2. doi: 10.1186/1944-3277-9-2. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
36. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
37. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [PMC free article] [PubMed] [CrossRef] [Google Scholar]
38. Goloboff PA, Farris JS, Nixon KC. TNT, a free program for phylogenetic analysis. Cladistics. 2008;24:774–786. doi: 10.1111/j.1096-0031.2008.00217.x. [CrossRef] [Google Scholar]
39. Pattengale ND, Alipour M, Bininda-Emonds ORP, Moret BME, Stamatakis A. How many bootstrap replicates are necessary? J Comput Biol. 2010;17:337–354. doi: 10.1089/cmb.2009.0179. [PubMed] [CrossRef] [Google Scholar]

Articles from Standards in Genomic Sciences are provided here courtesy of BMC

-