Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jan 20;90(7):3627-39.
doi: 10.1128/JVI.02988-15.

Extensive Positive Selection Drives the Evolution of Nonstructural Proteins in Lineage C Betacoronaviruses

Affiliations

Extensive Positive Selection Drives the Evolution of Nonstructural Proteins in Lineage C Betacoronaviruses

Diego Forni et al. J Virol. .

Abstract

Middle East respiratory syndrome-related coronavirus (MERS-CoV) spreads to humans via zoonotic transmission from camels. MERS-CoV belongs to lineage C of betacoronaviruses (betaCoVs), which also includes viruses isolated from bats and hedgehogs. A large portion of the betaCoV genome consists of two open reading frames (ORF1a and ORF1b) that are translated into polyproteins. These are cleaved by viral proteases to generate 16 nonstructural proteins (nsp1 to nsp16) which compose the viral replication-transcription complex. We investigated the evolution of ORF1a and ORF1b in lineage C betaCoVs. Results indicated widespread positive selection, acting mostly on ORF1a. The proportion of positively selected sites in ORF1a was much higher than that previously reported for the surface-exposed spike protein. Selected sites were unevenly distributed, with nsp3 representing the preferential target. Several pairs of coevolving sites were also detected, possibly indicating epistatic interactions; most of these were located in nsp3. Adaptive evolution at nsp3 is ongoing in MERS-CoV strains, and two selected sites (G720 and R911) were detected in the protease domain. While position 720 is variable in camel-derived viruses, suggesting that the selective event does not represent a specific adaptation to humans, the R911C substitution was observed only in human-derived MERS-CoV isolates, including the viral strain responsible for the recent South Korean outbreak. It will be extremely important to assess whether these changes affect host range or other viral phenotypes. More generally, data herein indicate that CoV nsp3 represents a major selection target and that nsp3 sequencing should be envisaged in monitoring programs and field surveys.

Importance: Both severe acute respiratory syndrome coronavirus (SARS-CoV) and MERS-CoV originated in bats and spread to humans via an intermediate host. This clearly highlights the potential for coronavirus host shifting and the relevance of understanding the molecular events underlying the adaptation to new host species. We investigated the evolution of ORF1a and ORF1b in lineage C betaCoVs and in 87 sequenced MERS-CoV isolates. Results indicated widespread positive selection, stronger in ORF1a than in ORF1b. Several selected sites were found to be located in functionally relevant protein regions, and some of them corresponded to functional mutations in other coronaviruses. The proportion of selected sites we identified in ORF1a is much higher than that for the surface-exposed spike protein. This observation suggests that adaptive evolution in ORF1a might contribute to host shifts or immune evasion. Data herein also indicate that genetic diversity at nonstructural proteins should be taken into account when antiviral compounds are developed.

PubMed Disclaimer

Figures

Fig 1
Fig 1
(A) Schematic representation of ORF1a and ORF1b and their nsp products. nsps are colored in hues of blue depending on the percentage of negatively selected sites. nsp11 is shown in gray because it was not analyzed (because it is too short). Positively selected sites are represented by triangles, with colors corresponding to the selected branch in the phylogeny (see panel B). Coevolving sites are shown below the nsp structure, with different symbols indicating each pair of coevolving sites. (B) Maximum likelihood phylogenies for ORF1a (left) and ORF1b (right) in lineage C betaCoVs. Branches set as foreground lineages in independent branch site tests are highlighted with different colors and numbered. Thick branches yielded statistically significant evidence of positive selection. Branch length is proportional to synonymous substitution rate (dS). Coevolving sites are also reported, with different symbols as indicated in panel A. Positions are relative to each nsp; see also Table S3 in the supplemental material.
FIG 2
FIG 2
Maximum likelihood phylogeny for nsp3 sequences in a subset of isolates representing MERS-CoV major groups. The amino acid alignment of the region surrounding the two positively selected sites (magenta) in MERS-CoV isolates is also shown. Asterisks indicate viruses isolated from dromedary camels.
FIG 3
FIG 3
(A) Representation of nsp3 domain architecture. Positively selected sites are indicated by triangles, coevolving sites with symbols (see Fig. 1 legend). In the enlargements, positively selected sites were mapped onto known domain 3D structures of MERS-CoV or SARS-CoV (PDB codes 2GRI, 3EWR, 4RNA, and 2K87). The acidic domain is shown in gray because it was not analyzed (see the text). (B and C) Topology maps and probability diagrams of transmembrane helices for MERS-CoV nsp3 transmembrane domain (B) and MERS-CoV nsp4 (C). The conserved cysteine residues and the predicted N-glycosylation sites are mapped onto the luminal loops. Color codes are as in Fig. 1; yellow indicates protein regions or sites known to be functional and mentioned in the text.
FIG 4
FIG 4
(A) Structure of the dimeric form of MERS-CoV nsp5 (PDB code 4YLU). Positively selected sites affecting the dimerization process are labeled. Color codes are as in Fig. 1. Catalytic residues are in yellow. (B) Amino acid alignment of the nsp5 regions surrounding selected sites. Residues that confer a temperature-sensitive phenotype when mutated in MHV are underlined (see the text).
FIG 5
FIG 5
Ribbon representation of SARS-CoV nsp10-nsp16 complex (PDB code 2XYQ) (A) and SARS-CoV nsp10-nsp14 complex (PDB code 5C8U) (B). Positively selected sites in lineage C betaCoVs are shown in green (see Fig. 1), and residues involved in inter- or intraprotein interactions are shown in yellow. An amino acid alignment of the nsp10 exposed loop containing S61 and Q65 is also shown. Functional residues in MHV-CoV and SARS-CoV are underlined (see the text).

Similar articles

Cited by

References

    1. Zaki AM, van Boheemen S, Bestebroer TM, Osterhaus AD, Fouchier RA. 2012. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N Engl J Med 367:1814–1820. doi:10.1056/NEJMoa1211721. - DOI - PubMed
    1. Cotten M, Watson SJ, Zumla AI, Makhdoom HQ, Palser AL, Ong SH, Al Rabeeah AA, Alhakeem RF, Assiri A, Al-Tawfiq JA, Albarrak A, Barry M, Shibl A, Alrabiah FA, Hajjar S, Balkhy HH, Flemban H, Rambaut A, Kellam P, Memish ZA. 2014. Spread, circulation, and evolution of the Middle East respiratory syndrome coronavirus. mBio 5(1):e01062-13. doi:10.1128/mBio.01062-13. - DOI - PMC - PubMed
    1. Al-Tawfiq JA, Memish ZA. 2014. Middle East respiratory syndrome coronavirus: transmission and phylogenetic evolution. Trends Microbiol 22:573–579. doi:10.1016/j.tim.2014.08.001. - DOI - PMC - PubMed
    1. de Groot RJ, Baker SC, Baric RS, Brown CS, Drosten C, Enjuanes L, Fouchier RA, Galiano M, Gorbalenya AE, Memish ZA, Perlman S, Poon LL, Snijder EJ, Stephens GM, Woo PC, Zaki AM, Zambon M, Ziebuhr J. 2013. Middle East respiratory syndrome coronavirus (MERS-CoV): announcement of the Coronavirus Study Group. J Virol 87:7790–7792. doi:10.1128/JVI.01244-13. - DOI - PMC - PubMed
    1. Lauber C, Goeman JJ, Parquet Mdel C, Nga PT, Snijder EJ, Morita K, Gorbalenya AE. 2013. The footprint of genome architecture in the largest genome expansion in RNA viruses. PLoS Pathog 9:e1003500. doi:10.1371/journal.ppat.1003500. - DOI - PMC - PubMed

Publication types

Substances

-