View Single Post
  #1  
Unread 03-08-2015, 01:23 PM
nmrlearner's Avatar
nmrlearner nmrlearner is offline
Senior Member
 
Join Date: Jan 2005
Posts: 23,175
Points: 193,617, Level: 100
Points: 193,617, Level: 100 Points: 193,617, Level: 100 Points: 193,617, Level: 100
Level up: 0%, 0 Points needed
Level up: 0% Level up: 0% Level up: 0%
Activity: 50.7%
Activity: 50.7% Activity: 50.7% Activity: 50.7%
Last Achievements
Award-Showcase
NMR Credits: 0
NMR Points: 0
Downloads: 0
Uploads: 0
Default Application of Data Mining Tools for Classification of Protein Structural Class from Residue Based Averaged NMR Chemical Shifts

Application of Data Mining Tools for Classification of Protein Structural Class from Residue Based Averaged NMR Chemical Shifts

Publication date: Available online 7 March 2015
Source:Biochimica et Biophysica Acta (BBA) - Proteins and Proteomics

Author(s): Arun.V. Kumar , Rehana F.M. Ali , Yu Cao , V.V. Krishnan

The number of protein sequences deriving from genome sequencing projects is outpacing our knowledge about the function of these proteins. With the gap between experimentally characterized and uncharacterized proteins continuing to widen, it is necessary to develop new computational methods and tools for protein structural information that is directly related to function. Nuclear magnetic resonance (NMR) provides powerful means to determine three-dimensional structures of proteins in the solution state. However, translation of the NMR spectral parameters to even low-resolution structural information such as protein class requires multiple time consuming steps. In this paper, we present an unorthodox method to predict the protein structural class directly by using the residue’s averaged chemical shifts (ACS) based on machine learning algorithms. Experimental chemical shift information from 1491 proteins obtained from Biological Magnetic Resonance Bank (BMRB) and their respective protein structural classes derived from structural classification of proteins (SCOP) were used to construct a data set with 119 attributes and 5 different classes. Twenty four different classification schemes were evaluated using several performance measures. Overall the residue based ACS values can predict the protein structural classes with 80 % accuracy measured by Matthew Correlation coefficient. Specifically protein classes defined by mixed ?? or small proteins are classified with > 90% correlation. Our results indicate that this NMR-based method can be utilized as a low-resolution tool for protein structural class identification without any prior chemical shift assignments.
Graphical abstract








More...
Reply With Quote


Did you find this post helpful? Yes | No