Computational protein design as a tool for fold recognition.

Abstract : Computationally designed protein sequences have been proposed as a basis to perform fold recognition and homology searching. To investigate this possibility, an automated procedure is used to completely redesign 24 SH3 proteins and 22 SH2 proteins. We use the experimental backbone coordinates as fixed templates in the folded state and a molecular mechanics model to compute the pairwise interaction energies between all sidechain types and conformations. Energy calculations are done with the Proteins@Home volunteer computing platform. A heuristic algorithm is then used to scan the sequence and conformational space for optimal solutions. We produced 200,000-450,000 sequences for each backbone template. The designed sequences ressemble moderately-distant, natural homologues of the initial templates, according to their identity scores and their similarity with respect to the Pfam sets of SH2 and SH3 domains. Standard homology detection tools document their native-like character: the Conserved Domain Database recognizes 61% (52%) of our low-energy sequences as SH3 (SH2) domains; the SUPERFAMILY, Hidden-Markov Model library recognizes 81% (84%). Conversely, position specific scoring matrices (PSSMs) derived from our designed sequences can be used to detect natural homologues in sequence databases. Within SwissProt, a set of natural SH3 PSSMs detects 772 SH3 domains, for example; our designed PSSMs detect 67% of these, plus one additional sequence and two false positives. If six amino acids involved in substrate binding (a selective pressure not accounted for in our design) are reset to their experimental types, then 77% of the experimental SH3 domains are detected. Results for the SH2 domains are similar. Several directions to improve the method further are discussed.
Document type :
Journal articles
Complete list of metadatas

https://hal-polytechnique.archives-ouvertes.fr/hal-00488182
Contributor : Thomas Simonson <>
Submitted on : Tuesday, June 1, 2010 - 1:47:45 PM
Last modification on : Wednesday, March 27, 2019 - 3:56:02 PM

Identifiers

Collections

Citation

Marcel Schmidt Am Busch, David Mignon, Thomas Simonson. Computational protein design as a tool for fold recognition.. Proteins - Structure, Function and Bioinformatics, Wiley, 2009, 77 (1), pp.139-58. ⟨10.1002/prot.22426⟩. ⟨hal-00488182⟩

Share

Metrics

Record views

203