VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens
- PMID: 18226234
- PMCID: PMC2254373
- DOI: 10.1186/1471-2105-9-62
VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens
Abstract
Background: Prediction of bacterial virulent protein sequences has implications for identification and characterization of novel virulence-associated factors, finding novel drug/vaccine targets against proteins indispensable to pathogenicity, and understanding the complex virulence mechanism in pathogens.
Results: In the present study we propose a bacterial virulent protein prediction method based on bi-layer cascade Support Vector Machine (SVM). The first layer SVM classifiers were trained and optimized with different individual protein sequence features like amino acid composition, dipeptide composition (occurrences of the possible pairs of ith and i+1th amino acid residues), higher order dipeptide composition (pairs of ith and i+2nd residues) and Position Specific Iterated BLAST (PSI-BLAST) generated Position Specific Scoring Matrices (PSSM). In addition, a similarity-search based module was also developed using a dataset of virulent and non-virulent proteins as BLAST database. A five-fold cross-validation technique was used for the evaluation of various prediction strategies in this study. The results from the first layer (SVM scores and PSI-BLAST result) were cascaded to the second layer SVM classifier to train and generate the final classifier. The cascade SVM classifier was able to accomplish an accuracy of 81.8%, covering 86% area in the Receiver Operator Characteristic (ROC) plot, better than that of either of the layer one SVM classifiers based on single or multiple sequence features.
Conclusion: VirulentPred is a SVM based method to predict bacterial virulent proteins sequences, which can be used to screen virulent proteins in proteomes. Together with experimentally verified virulent proteins, several putative, non annotated and hypothetical protein sequences have been predicted to be high scoring virulent proteins by the prediction method. VirulentPred is available as a freely accessible World Wide Web server - VirulentPred, at http://bioinfo.icgeb.res.in/virulent/.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2254373/bin/1471-2105-9-62-1.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2254373/bin/1471-2105-9-62-2.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2254373/bin/1471-2105-9-62-3.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2254373/bin/1471-2105-9-62-4.gif)
![Figure 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2254373/bin/1471-2105-9-62-5.gif)
Similar articles
-
VirulentPred 2.0: An improved method for prediction of virulent proteins in bacterial pathogens.Protein Sci. 2023 Dec;32(12):e4808. doi: 10.1002/pro.4808. Protein Sci. 2023. PMID: 37872744 Free PMC article.
-
FaaPred: a SVM-based prediction method for fungal adhesins and adhesin-like proteins.PLoS One. 2010 Mar 15;5(3):e9695. doi: 10.1371/journal.pone.0009695. PLoS One. 2010. PMID: 20300572 Free PMC article.
-
A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.In Silico Biol. 2008;8(2):129-40. In Silico Biol. 2008. PMID: 18928201
-
CyclinPred: a SVM-based method for predicting cyclin protein sequences.PLoS One. 2008 Jul 2;3(7):e2605. doi: 10.1371/journal.pone.0002605. PLoS One. 2008. PMID: 18596929 Free PMC article.
-
ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST.Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W414-9. doi: 10.1093/nar/gkh350. Nucleic Acids Res. 2004. PMID: 15215421 Free PMC article.
Cited by
-
Secretome analysis and virulence assessment in Abiotrophia defectiva.J Oral Microbiol. 2024 Feb 12;16(1):2307067. doi: 10.1080/20002297.2024.2307067. eCollection 2024. J Oral Microbiol. 2024. PMID: 38352067 Free PMC article.
-
In silico exploration of hypothetical proteins in Neisseria gonorrhoeae for identification of therapeutic targets.In Silico Pharmacol. 2024 Feb 5;12(1):10. doi: 10.1007/s40203-023-00186-w. eCollection 2024. In Silico Pharmacol. 2024. PMID: 38327876
-
Computational design of experimentally validated multi-epitopes vaccine against hepatitis E virus: An immunological approach.PLoS One. 2023 Dec 14;18(12):e0294663. doi: 10.1371/journal.pone.0294663. eCollection 2023. PLoS One. 2023. PMID: 38096182 Free PMC article.
-
Immunoinformatics and reverse vaccinology approach in designing a novel highly immunogenic multivalent peptide-based vaccine against the human monkeypox virus.Front Mol Biosci. 2023 Nov 22;10:1295817. doi: 10.3389/fmolb.2023.1295817. eCollection 2023. Front Mol Biosci. 2023. PMID: 38074091 Free PMC article.
-
A comprehensive investigation of protein expression profiles in L. monocytogenes exposed to thermal abuse, mild acid, and salt stress conditions.Front Microbiol. 2023 Oct 9;14:1271787. doi: 10.3389/fmicb.2023.1271787. eCollection 2023. Front Microbiol. 2023. PMID: 37876777 Free PMC article.
References
-
- Brogden KA, Roth JA, Stanton TB, Bolin CA, Minion FC, Wannemuehler MJ. Virulence Mechanisms of Bacterial Pathogens. 3. ASM Press, Washington DC; 2000.
-
- Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM, McKenney K, Sutton GG, FitzHugh W, Fields CA, Gocayne JD, Scott JD, Shirley R, Liu LI, Glodek A, Kelley JM, Weidman JF, Phillips CA, Spriggs T, Hedblom E, Cotton MD, Utterback TR, Hanna MC, Nguyen DT, Saudek DM, Brandon RC, Fine LD, Fritchman JL, Fuhrmann JL, Geoghagen NSM, Gnehm CL, McDonald LA, Small KV, Fraser CM, Smith HO, Venter JC. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269:496–512. doi: 10.1126/science.7542800. - DOI - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials