Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Sep 29:19:5510-5524.
doi: 10.1016/j.csbj.2021.09.025. eCollection 2021.

Machine learning applications in RNA modification sites prediction

Affiliations
Review

Machine learning applications in RNA modification sites prediction

A El Allali et al. Comput Struct Biotechnol J. .

Abstract

Ribonucleic acid (RNA) modifications are post-transcriptional chemical composition changes that have a fundamental role in regulating the main aspect of RNA function. Recently, large datasets have become available thanks to the recent development in deep sequencing and large-scale profiling. This availability of transcriptomic datasets has led to increased use of machine learning based approaches in epitranscriptomics, particularly in identifying RNA modifications. In this review, we comprehensively explore machine learning based approaches used for the prediction of 11 RNA modification types, namely, m 1 A , m 6 A , m 5 C , 5 hmC , ψ , 2 ' - O - Me , ac 4 C , m 7 G , A - to - I , m 2 G , and D . This review covers the life cycle of machine learning methods to predict RNA modification sites including available benchmark datasets, feature extraction, and classification algorithms. We compare available methods in terms of datasets, target species, approach, and accuracy for each RNA modification type. Finally, we discuss the advantages and limitations of the reviewed approaches and suggest future perspectives.

Keywords: Deep learning; Feature extraction; Machine learning; Prediction; RNA modification.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

None
Graphical abstract
Fig. 1
Fig. 1
Chemical structures of popular RNA modifications.
Fig. 2
Fig. 2
Supervised learning workflow.

Similar articles

Cited by

References

    1. Xuan J.J., Sun W.J., Lin P.H., Zhou K.R., Liu S., Zheng L.L., Qu L.H., Yang J.H. Rmbase v2.0: Deciphering the map of rna modifications from epitranscriptome sequencing data. Nucleic Acids Res. 2018;46:D327–D334. doi: 10.1093/nar/gkx934. - DOI - PMC - PubMed
    1. W.A. Cantara, P.F. Crain, J. Rozenski, J.A. Mccloskey, K.A. Harris, X. Zhang, F.A.P. Vendeix, D. Fabris, P.F. Agris, The rna modification database, rnamdb: 2011 update, Nucleic Acids Research doi:10.1093/nar/gkq1028. - PMC - PubMed
    1. Linder B., Grozhik A.V., Olarerin-George A.O., Meydan C., Mason C.E., Jaffrey S.R. Single-nucleotide-resolution mapping of m6a and m6am throughout the transcriptome. Nat Methods. 2015;12:767–772. doi: 10.1038/nmeth.3453. - DOI - PMC - PubMed
    1. Hauenschild R., Tserovski L., Schmid K., Thüring K., Winz M.L., Sharma S., Entian K.D., Wacheul L., Lafontaine D.L., Anderson J., Alfonzo J., Hildebrandt A., Jäschke A., Motorin Y., Helm M. The reverse transcription signature of n-1-methyladenosine in rna-seq is sequence dependent. Nucleic Acids Res. 2015;43:9950–9964. doi: 10.1093/nar/gkv895. - DOI - PMC - PubMed
    1. Tserovski L., Marchand V., Hauenschild R., Blanloeil-Oillo F., Helm M., Motorin Y. High-throughput sequencing for 1-methyladenosine (m1a) mapping in rna. Methods. 2016;107:110–121. doi: 10.1016/j.ymeth.2016.02.012. - DOI - PubMed

LinkOut - more resources

-