PEPred-Suite: improved and robust prediction of therapeutic peptides using adaptive feature representation learning
- PMID: 30994882
- DOI: 10.1093/bioinformatics/btz246
PEPred-Suite: improved and robust prediction of therapeutic peptides using adaptive feature representation learning
Abstract
Motivation: Prediction of therapeutic peptides is critical for the discovery of novel and efficient peptide-based therapeutics. Computational methods, especially machine learning based methods, have been developed for addressing this need. However, most of existing methods are peptide-specific; currently, there is no generic predictor for multiple peptide types. Moreover, it is still challenging to extract informative feature representations from the perspective of primary sequences.
Results: In this study, we have developed PEPred-Suite, a bioinformatics tool for the generic prediction of therapeutic peptides. In PEPred-Suite, we introduce an adaptive feature representation strategy that can learn the most representative features for different peptide types. To be specific, we train diverse sequence-based feature descriptors, integrate the learnt class information into our features, and utilize a two-step feature optimization strategy based on the area under receiver operating characteristic curve to extract the most discriminative features. Using the learnt representative features, we trained eight random forest models for eight different types of functional peptides, respectively. Benchmarking results showed that as compared with existing predictors, PEPred-Suite achieves better and robust performance for different peptides. As far as we know, PEPred-Suite is currently the first tool that is capable of predicting so many peptide types simultaneously. In addition, our work demonstrates that the learnt features can reliably predict different peptides.
Availability and implementation: The user-friendly webserver implementing the proposed PEPred-Suite is freely accessible at http://server.malab.cn/PEPred-Suite.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Similar articles
-
Computational tools for exploring peptide-membrane interactions in gram-positive bacteria.Comput Struct Biotechnol J. 2023 Mar 2;21:1995-2008. doi: 10.1016/j.csbj.2023.02.051. eCollection 2023. Comput Struct Biotechnol J. 2023. PMID: 36950221 Free PMC article. Review.
-
Large-scale comparative review and assessment of computational methods for anti-cancer peptide identification.Brief Bioinform. 2021 Jul 20;22(4):bbaa312. doi: 10.1093/bib/bbaa312. Brief Bioinform. 2021. PMID: 33316035 Free PMC article. Review.
-
Iterative feature representations improve N4-methylcytosine site prediction.Bioinformatics. 2019 Dec 1;35(23):4930-4937. doi: 10.1093/bioinformatics/btz408. Bioinformatics. 2019. PMID: 31099381
-
Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species.Bioinformatics. 2019 Apr 15;35(8):1326-1333. doi: 10.1093/bioinformatics/bty824. Bioinformatics. 2019. PMID: 30239627
-
ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides.Bioinformatics. 2018 Dec 1;34(23):4007-4016. doi: 10.1093/bioinformatics/bty451. Bioinformatics. 2018. PMID: 29868903 Free PMC article.
Cited by
-
Contrastive learning for enhancing feature extraction in anticancer peptides.Brief Bioinform. 2024 Mar 27;25(3):bbae220. doi: 10.1093/bib/bbae220. Brief Bioinform. 2024. PMID: 38725157 Free PMC article.
-
ACP-DRL: an anticancer peptides recognition method based on deep representation learning.Front Genet. 2024 Apr 9;15:1376486. doi: 10.3389/fgene.2024.1376486. eCollection 2024. Front Genet. 2024. PMID: 38655048 Free PMC article.
-
ACPPfel: Explainable deep ensemble learning for anticancer peptides prediction based on feature optimization.Front Genet. 2024 Feb 29;15:1352504. doi: 10.3389/fgene.2024.1352504. eCollection 2024. Front Genet. 2024. PMID: 38487252 Free PMC article.
-
Deepstacked-AVPs: predicting antiviral peptides using tri-segment evolutionary profile and word embedding based multi-perspective features with deep stacking model.BMC Bioinformatics. 2024 Mar 7;25(1):102. doi: 10.1186/s12859-024-05726-5. BMC Bioinformatics. 2024. PMID: 38454333 Free PMC article.
-
TPpred-LE: therapeutic peptide function prediction based on label embedding.BMC Biol. 2023 Oct 31;21(1):238. doi: 10.1186/s12915-023-01740-w. BMC Biol. 2023. PMID: 37904157 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials