Genome re-sequencing and reannotation of the Escherichia coli ER2566 strain and transcriptome sequencing under overexpression conditions
- PMID: 32546194
- PMCID: PMC7296898
- DOI: 10.1186/s12864-020-06818-1
Genome re-sequencing and reannotation of the Escherichia coli ER2566 strain and transcriptome sequencing under overexpression conditions
Abstract
Background: The Escherichia coli ER2566 strain (NC_CP014268.2) was developed as a BL21 (DE3) derivative strain and had been widely used in recombinant protein expression. However, like many other current RefSeq annotations, the annotation of the ER2566 strain was incomplete, with missing gene names and miscellaneous RNAs, as well as uncorrected annotations of some pseudogenes. Here, we performed a systematic reannotation of the ER2566 genome by combining multiple annotation tools with manual revision to provide a comprehensive understanding of the E. coli ER2566 strain, and used high-throughput sequencing to explore how the strain adapted under external pressure.
Results: The reannotation included noteworthy corrections to all protein-coding genes, led to the exclusion of 190 hypothetical genes or pseudogenes, and resulted in the addition of 237 coding sequences and 230 miscellaneous noncoding RNAs and 2 tRNAs. In addition, we further manually examined all 194 pseudogenes in the Ref-seq annotation and directly identified 123 (63%) as coding genes. We then used whole-genome sequencing and high-throughput RNA sequencing to assess mutational adaptations under consecutive subculture or overexpression burden. Whereas no mutations were detected in response to consecutive subculture, overexpression of the human papillomavirus 16 type capsid led to the identification of a mutation (position 1,094,824 within the 3' non-coding region) positioned 19-bp away from the lacI gene in the transcribed RNA, which was not detected at the genomic level by Sanger sequencing.
Conclusion: The ER2566 strain was used by both the general scientific community and the biotechnology industry. Reannotation of the E. coli ER2566 strain not only improved the RefSeq data but uncovered a key site that might be involved in the transcription and translation of genes encoding the lactose operon repressor. We proposed that our pipeline might offer a universal method for the reannotation of other bacterial genomes with high speed and accuracy. This study might facilitate a better understanding of gene function for the ER2566 strain under external burden and provided more clues to engineer bacteria for biotechnological applications.
Keywords: Engineer bacteria; Escherichia coli ER2566; Genome reannotation; Transcriptome sequencing.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures
![Fig. 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7298944/bin/12864_2020_6818_Fig1_HTML.gif)
![Fig. 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7298944/bin/12864_2020_6818_Fig2_HTML.gif)
![Fig. 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7298944/bin/12864_2020_6818_Fig3_HTML.gif)
![Fig. 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7298944/bin/12864_2020_6818_Fig4_HTML.gif)
![Fig. 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/7298944/bin/12864_2020_6818_Fig5_HTML.gif)
Similar articles
-
Omics-guided bacterial engineering of Escherichia coli ER2566 for recombinant protein expression.Appl Microbiol Biotechnol. 2023 Feb;107(2-3):853-865. doi: 10.1007/s00253-022-12339-6. Epub 2022 Dec 21. Appl Microbiol Biotechnol. 2023. PMID: 36539564 Free PMC article.
-
The Escherichia coli multiple antibiotic resistance activator protein represses transcription of the lac operon.Biochem Soc Trans. 2019 Apr 30;47(2):671-677. doi: 10.1042/BST20180498. Epub 2019 Mar 8. Biochem Soc Trans. 2019. PMID: 30850424 Review.
-
Genomic and transcriptomic landscape of Escherichia coli BL21(DE3).Nucleic Acids Res. 2017 May 19;45(9):5285-5293. doi: 10.1093/nar/gkx228. Nucleic Acids Res. 2017. PMID: 28379538 Free PMC article.
-
Genome reannotation of Escherichia coli CFT073 with new insights into virulence.BMC Genomics. 2009 Nov 22;10:552. doi: 10.1186/1471-2164-10-552. BMC Genomics. 2009. PMID: 19930606 Free PMC article.
-
The small RNA regulators of Escherichia coli: roles and mechanisms*.Annu Rev Microbiol. 2004;58:303-28. doi: 10.1146/annurev.micro.58.030603.123841. Annu Rev Microbiol. 2004. PMID: 15487940 Review.
Cited by
-
Loss to gain: pseudogenes in microorganisms, focusing on eubacteria, and their biological significance.Appl Microbiol Biotechnol. 2024 May 8;108(1):328. doi: 10.1007/s00253-023-12971-w. Appl Microbiol Biotechnol. 2024. PMID: 38717672 Free PMC article. Review.
-
Surface Plasmon Resonance as a Tool to Elucidate the Molecular Determinants of Key Transcriptional Regulators Controlling Rhizobial Lifestyles.Methods Mol Biol. 2024;2751:145-163. doi: 10.1007/978-1-0716-3617-6_10. Methods Mol Biol. 2024. PMID: 38265715
-
Structure and functions of a multireplicon genome of Antarctic Psychrobacter sp. ANT_H3: characterization of the genetic modules suitable for the construction of the plasmid-vectors for cold-active bacteria.J Appl Genet. 2023 Sep;64(3):545-557. doi: 10.1007/s13353-023-00759-7. Epub 2023 May 5. J Appl Genet. 2023. PMID: 37145222 Free PMC article.
References
-
- Shiloach J, Fass R. Growing E-coli to high cell density - a historical perspective on method development. Biotechnol Adv. 2005;23(5):345–357. - PubMed
-
- Correa A, Oppezzo P. Overcoming the solubility problem in E. coli: available approaches for recombinant protein production. Methods Mol Biol. 2015;1258:27–44. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous