Correlated mutations distinguish misfolded and properly folded proteins release_xui43uzbjzhfddyyxilgq4pa64

by Paweł Woźniak, Malgorzata Kotulska, Gert Vriend

Abstract

Knowledge about the three dimensional structure of proteins is crucial in order to learn about their behavior, stability, or role as a target in drug design. Unfortunately, traditional experimental methods used in structure determination such as X-ray crystallography and NMR are costly and time-consuming. Therefore, computational methods that allow for protein structure reconstruction from sequence only are greatly desired. One of these is the recently developed direct coupling analysis (DCA) method [1, 2] which achieves the best results in residue-residue contact prediction from multiple sequence alignments only. Predicted contacts are used as restraints in the reconstruction of the three-dimensional structure of a protein. Unfortunately, the accuracy of DCA methods is on the order of 40% among the 100 strongest predicted contacts, which is insufficient for ab initio protein structure reconstruction. However, the results of DCA can support protein structure reconstruction in a different way. Our results show that DCA can indicate the best protein structure among its structural variants by the prediction of residue-residue contacts [3]. We counted the number of correctly predicted contacts within the strongest 100 DCA predictions for a set of obsolete PDB entries and their successors and for 22 proteins for which the Decoys 'R' Us database [4] provided properly folded and misfolded structures. These numbers were related to structure similarity scores, such as RMSD or TM-score [5]. DCA correctly predicts significantly more contacts for properly folded structures than for misfolded ones. Our method works much better for structures determined with X-ray crystallography than with the NMR spectroscopy [3]. The method will not detect misfolded proteins per se, but when a protein structure experimentalist needs to choose between alternative folds for the same protein, DCA can help. [1] F. Morcos et al., Direct-coupling analysis of residue coevolution captures native contacts across many protein families, 2011, Proc Natl Acad Sci U S A 108(49):E1293-301. [2] C. Feinauer et al., Improving contact prediction along three dimensions, 2014, PLoS Comput Biol., 10(10):e1003847. [3] P.P. Wozniak, G. Vriend, M. Kotulska, Correlated mutations select misfolded from properly folded proteins, 2016, Bioinformatics, (article accepted). [4] R. Samudrala, M. Levitt, Decoys 'R' Us: A database of incorrect protein conformations to improve protein structure prediction, 2000, Protein Science 9: 1399-1401. [5] Y. Zhang, J. Skolnick, TM-align: A protein structure alignment algorithm based on TM-score, 2005, Nucleic Acids Research, 33: 2302-2309.
In application/xml+jats format

Released as a post by PeerJ
Release Date 2017-01-31
Publisher PeerJ

Known Files and URLs

application/pdf  121.6 kB
sha1:295285760655dcaa30c8...
web.archive.org (webarchive)
peerj.com (web)
Read Full Text
Type  post
Stage   unknown
Date   2017-01-31
Work Entity
grouping other versions (eg, pre-print) and variants of this release
Cite This Release
Fatcat Bits

State is "active". Revision:
e841faea-1cd8-4ebc-943b-d4dc9f3cba94
As JSON object via API