Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct 20:2020:7234961.
doi: 10.1155/2020/7234961. eCollection 2020.

Sequence Analysis and Structure Prediction of SARS-CoV-2 Accessory Proteins 9b and ORF14: Evolutionary Analysis Indicates Close Relatedness to Bat Coronavirus

Affiliations

Sequence Analysis and Structure Prediction of SARS-CoV-2 Accessory Proteins 9b and ORF14: Evolutionary Analysis Indicates Close Relatedness to Bat Coronavirus

Chittaranjan Baruah et al. Biomed Res Int. .

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has a single-stranded RNA genome that encodes 14 open reading frames (ORFs), eight of which encode accessory proteins that allow the virus to infect the host and promote virulence. The genome expresses around 29 structural and nonstructural protein products. The accessory proteins of SARS-CoV-2 are not essential for virus replication but do affect viral release, stability, and pathogenesis and finally contribute to virulence. This paper has attempted the structure prediction and functional analysis of two such accessory proteins, 9b and ORF14, in the absence of experimental structures. Sequence analysis, structure prediction, functional characterization, and evolutionary analysis based on the UniProtKB reviewed the amino acid sequences of SARS-CoV-2 9b (P0DTD2) and ORF14 (P0DTD3) proteins. Modeling has been presented with the introduction of hybrid comparative and ab initio modeling. QMEANDisCo 4.0.0 and ProQ3 for global and local (per residue) quality estimates verified the structures as high quality, which may be attributed to structure-based drug design targets. Tunnel analysis revealed the presence of 1-2 highly active tunneling sites, perhaps which will able to provide certain inputs for advanced structure-based drug design or to formulate potential vaccines in the absence of a complete experimental structure. The evolutionary analysis of both proteins of human SARS-CoV-2 indicates close relatedness to the bat coronavirus. The whole-genome phylogeny indicates that only the new bat coronavirus followed by pangolin coronaviruses has a close evolutionary relationship with the novel SARS-CoV-2.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there are no conflicts of interest.

Figures

Figure 1
Figure 1
A conceptual framework of the present study for analysis of SARS-CoV-2 accessory proteins ORF9b and ORF14.
Figure 2
Figure 2
Amino acid composition of SARS-CoV-2 9b protein (leucine and valine-rich).
Figure 3
Figure 3
Amino acid composition of SARS-CoV-2 ORF14 protein (leucine and valine-rich).
Figure 4
Figure 4
Structure of SARS-CoV-2 9b protein along with major active sites.
Figure 5
Figure 5
Structure of SARS-CoV-2 9b protein and its two high-throughput tunnels. Tunnels are colored on the basis of preferences in throughput values, i.e., tunnel-1 (blue) and tunnel-2 (green). The high relevance pocket is shown (yellow).
Figure 6
Figure 6
Structure of SARS-CoV-2 ORF14 protein along with major active sites.
Figure 7
Figure 7
Structure of SARS-CoV-2 ORF14 protein along with its throughput tunnels (blue).
Figure 8
Figure 8
Protein family membership of SARS-CoV-2 protein 9b resembles the SARS-like protein (IPR018542) and the 9b SARS InterPro homologous superfamily (9-97; IPR037223).
Figure 9
Figure 9
Protein family membership of SARS-CoV-2 protein 14 resembles SARS-like protein (IPR035113). The protein is with three domains: (i) noncytoplasmic domain (1-51), (ii) transmembrane region (52-72), and (iii) cytoplasmic domain (73-73).
Figure 10
Figure 10
Evolutionary analysis of 9b protein of SARS-CoV-2 by Maximum Likelihood method and JTT matrix-based model (Jones and Taylor, 1992). The tree with the highest log likelihood (-1113.75) is shown. This analysis involved 9 amino acid sequences. There were a total of 141 positions in the final dataset. Evolutionary analyses were conducted in MEGA X.
Figure 11
Figure 11
Evolutionary analysis of ORF14 protein of SARS-CoV-2 by Maximum Likelihood method and JTT matrix-based model (Jones and Taylor, 1992). The tree with the highest log likelihood (-497.56) is shown. The percentage of trees, in which the associated taxa clustered together, is shown next to the branches. This analysis involved 14 amino acid sequences. There were a total of 73 positions in the final dataset. Evolutionary analyses were conducted in MEGA X.
Figure 12
Figure 12
Evolutionary relationships of different coronaviruses based on whole-genome bootstrap phylogenetic analysis (ML tree).

Similar articles

Cited by

References

    1. Yang H., Bartlam M., Rao Z. Drug design targeting the main protease, the Achilles’ heel of coronaviruses. Current Pharmaceutical Design. 2006;12(35):4573–4590. doi: 10.2174/138161206779010369. - DOI - PubMed
    1. Baruah C., Devi P., Sharma D. K. BioRxiv. 2020. In silico proteome analysis of severe acute respiratory syndrome coronavirus 2(SARS-CoV-2) - DOI
    1. Lu G., Wang Q., Gao G. F. Bat-to-human: spike features determining 'host jump' of coronaviruses SARS-CoV, MERS-CoV, and beyond. Trends in microbiology. 2015;23(8):468–478. doi: 10.1016/j.tim.2015.06.003. - DOI - PMC - PubMed
    1. Lu G., Liu D. SARS-like virus in the Middle East: a truly bat-related coronavirus causing human diseases. Protein & Cell. 2012;3(11):803–805. doi: 10.1007/s13238-012-2811-1. - DOI - PMC - PubMed
    1. Paules C. I., Marston H. D., Fauci A. S. Coronavirus Infections—More than just the common cold. JAMA. 2020;323(8):707–708. doi: 10.1001/jama.2020.0757. - DOI - PubMed

MeSH terms

-