Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 14:13:891418.
doi: 10.3389/fgene.2022.891418. eCollection 2022.

A Bioinformatics Approach to Investigate Structural and Non-Structural Proteins in Human Coronaviruses

Affiliations

A Bioinformatics Approach to Investigate Structural and Non-Structural Proteins in Human Coronaviruses

Vittoria Cicaloni et al. Front Genet. .

Abstract

Recent studies confirmed that people unexposed to SARS-CoV-2 have preexisting reactivity, probably due to previous exposure to widely circulating common cold coronaviruses. Such preexistent reactivity against SARS-CoV-2 comes from memory T cells that can specifically recognize a SARS-CoV-2 epitope of structural and non-structural proteins and the homologous epitopes from common cold coronaviruses. Therefore, it is important to understand the SARS-CoV-2 cross-reactivity by investigating these protein sequence similarities with those of different circulating coronaviruses. In addition, the emerging SARS-CoV-2 variants lead to an intense interest in whether mutations in proteins (especially in the spike) could potentially compromise vaccine effectiveness. Since it is not clear that the differences in clinical outcomes are caused by common cold coronaviruses, a deeper investigation on cross-reactive T-cell immunity to SARS-CoV-2 is crucial to examine the differential COVID-19 symptoms and vaccine performance. Therefore, the present study can be a starting point for further research on cross-reactive T cell recognition between circulating common cold coronaviruses and SARS-CoV-2, including the most recent variants Delta and Omicron. In the end, a deep learning approach, based on Siamese networks, is proposed to accurately and efficiently calculate a BLAST-like similarity score between protein sequences.

Keywords: SARS-CoV-2; Siamese networks; cross-reactivity; long short-term memories; similarity score.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Basic features of seven HCoVs. The two blue panels list alpha-HCoVs and the five orange panels list beta-HCoVs. All the boxes include the year of identification, receptor, and major proteins. Two beta-HCoVs (HCoV-OC43 and HCoV-HKU1) originate in mice and contain hemagglutinin-esterase (HE) structural protein along with S, E, M, and N. Other five HCoVs originated in bats and have four major structural proteins S, E, M, and N.
FIGURE 2
FIGURE 2
Distribution of similarity between pairs of examples in the data set. The number of pairs is proportional to similarity.
FIGURE 3
FIGURE 3
Distribution of protein sequence length in the data set.
FIGURE 4
FIGURE 4
Percent identity matrix. In (A), the percent identity matrix is reported, which shows the % of identities among the HCoVs. In (B), the similarity phylogram tree is shown. The branch length is proportional to the number of nucleotide substitutions, that is, to the number of evolutionary events that took place after the branching point.
FIGURE 5
FIGURE 5
Percent identity matrix. In A, the percent identity matrix is reported, which shows the % of identities among the S proteins of HCoVs.

Similar articles

Cited by

References

    1. Ahsan M. A., Liu Y., Feng C., Hofestädt R., Chen M. (2021). OverCOVID: an Integrative Web Portal for SARS-CoV-2 Bioinformatics Resources. J. Integr. Bioinforma. 18 (1), 9–17. 10.1515/jib-2020-0046 - DOI - PMC - PubMed
    1. Bisht H., Roberts A., Vogel L., Bukreyev A., Collins P. L., Murphy B. R., et al. (2004). Severe Acute Respiratory Syndrome Coronavirus Spike Protein Expressed by Attenuated Vaccinia Virus Protectively Immunizes Mice. Proc. Natl. Acad. Sci. 101 (17), 6641–6646. 10.1073/pnas.0401939101 - DOI - PMC - PubMed
    1. Bromley J., Bentz J. W., Bottou L., Guyon I., LeCun Y., Moore C., et al. (1993). Signature Verification Using a “Siamese” Time Delay Neural Network. Int. J. Pattern Recognit. Artif. Intell. 7 (04), 669–688. 10.1142/S0218001493000339 - DOI
    1. Butler D., Mozsary C., Meydan C., Foox J., Rosiene J., Shaiber A., et al. (2021). Shotgun Transcriptome and Isothermal Profiling of SARS-CoV-2 Infection Reveals Unique Host Responses, Viral Diversification, and Drug Interactions. Nat. Commun. 12 (1), 1–7. 10.1038/s41467-021-21361-7 - DOI - PMC - PubMed
    1. Cao Z., Liu L., Du L., Zhang C., Jiang S., Li T., et al. (2010). Potent and Persistent Antibody Responses against the Receptor-Binding Domain of SARS-CoV Spike Protein in Recovered Patients. Virology J. 7 (1), 1–6. 10.1186/1743-422X-7-299 - DOI - PMC - PubMed
-