TOUCHSTONE II: a new approach to ab initio protein structure prediction
- PMID: 12885659
- PMCID: PMC1303233
- DOI: 10.1016/S0006-3495(03)74551-2
TOUCHSTONE II: a new approach to ab initio protein structure prediction
Abstract
We have developed a new combined approach for ab initio protein structure prediction. The protein conformation is described as a lattice chain connecting C(alpha) atoms, with attached C(beta) atoms and side-chain centers of mass. The model force field includes various short-range and long-range knowledge-based potentials derived from a statistical analysis of the regularities of protein structures. The combination of these energy terms is optimized through the maximization of correlation for 30 x 60,000 decoys between the root mean square deviation (RMSD) to native and energies, as well as the energy gap between native and the decoy ensemble. To accelerate the conformational search, a newly developed parallel hyperbolic sampling algorithm with a composite movement set is used in the Monte Carlo simulation processes. We exploit this strategy to successfully fold 41/100 small proteins (36 approximately 120 residues) with predicted structures having a RMSD from native below 6.5 A in the top five cluster centroids. To fold larger-size proteins as well as to improve the folding yield of small proteins, we incorporate into the basic force field side-chain contact predictions from our threading program PROSPECTOR where homologous proteins were excluded from the data base. With these threading-based restraints, the program can fold 83/125 test proteins (36 approximately 174 residues) with structures having a RMSD to native below 6.5 A in the top five cluster centroids. This shows the significant improvement of folding by using predicted tertiary restraints, especially when the accuracy of side-chain contact prediction is >20%. For native fold selection, we introduce quantities dependent on the cluster density and the combination of energy and free energy, which show a higher discriminative power to select the native structure than the previously used cluster energy or cluster size, and which can be used in native structure identification in blind simulations. These procedures are readily automated and are being implemented on a genomic scale.
Figures
Similar articles
-
Tertiary structure predictions on a comprehensive benchmark of medium to large size proteins.Biophys J. 2004 Oct;87(4):2647-55. doi: 10.1529/biophysj.104.045385. Biophys J. 2004. PMID: 15454459 Free PMC article.
-
Ab initio protein structure prediction.Curr Opin Struct Biol. 2002 Apr;12(2):176-81. doi: 10.1016/s0959-440x(02)00306-8. Curr Opin Struct Biol. 2002. PMID: 11959494 Review.
-
Combined multiple sequence reduced protein model approach to predict the tertiary structure of small proteins.Pac Symp Biocomput. 1998:377-88. Pac Symp Biocomput. 1998. PMID: 9697197
-
Fold assembly of small proteins using monte carlo simulations driven by restraints derived from multiple sequence alignments.J Mol Biol. 1998 Mar 27;277(2):419-48. doi: 10.1006/jmbi.1997.1595. J Mol Biol. 1998. PMID: 9514747
-
Dynamic Monte Carlo simulations of a new lattice model of globular protein folding, structure and dynamics.J Mol Biol. 1991 Sep 20;221(2):499-531. doi: 10.1016/0022-2836(91)80070-b. J Mol Biol. 1991. PMID: 1920430 Review.
Cited by
-
Homology modeling of Forkhead box protein C2: identification of potential inhibitors using ligand and structure-based virtual screening.Mol Divers. 2023 Aug;27(4):1661-1674. doi: 10.1007/s11030-022-10519-0. Epub 2022 Sep 1. Mol Divers. 2023. PMID: 36048303
-
Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions.Nat Commun. 2021 Aug 18;12(1):5011. doi: 10.1038/s41467-021-25316-w. Nat Commun. 2021. PMID: 34408149 Free PMC article.
-
Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations.Cell Rep Methods. 2021 Jul 26;1(3):100014. doi: 10.1016/j.crmeth.2021.100014. Epub 2021 Jun 21. Cell Rep Methods. 2021. PMID: 34355210 Free PMC article.
-
Toward the solution of the protein structure prediction problem.J Biol Chem. 2021 Jul;297(1):100870. doi: 10.1016/j.jbc.2021.100870. Epub 2021 Jun 11. J Biol Chem. 2021. PMID: 34119522 Free PMC article. Review.
-
Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.PLoS Comput Biol. 2021 Mar 26;17(3):e1008865. doi: 10.1371/journal.pcbi.1008865. eCollection 2021 Mar. PLoS Comput Biol. 2021. PMID: 33770072 Free PMC article.
References
-
- Anfinsen, C. B. 1973. Principles that govern the folding of protein chains. Science. 181:223–230. - PubMed
-
- Baker, D. 2000. A surprising simplicity to protein folding. Nature. 405:39–42. - PubMed
-
- Benner, S. A., and D. Gerloff. 1991. Patterns of divergence in homologous proteins as indicators of secondary and tertiary structure: a prediction of the structure of the catalytic domain of protein kinases. Adv. Enzyme Regul. 31:121–181. - PubMed
-
- Betancourt, M. R., and J. Skolnick. 2001. Finding the needle in a haystack: educing native folds from ambiguous ab initial protein structure predictions. J. Comput. Chem. 22:339–353.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous