Title
Prediction of mitochondrial matrix protein structures based on feature selection and fragment assembly
Date Issued
03 April 2012
Access level
open access
Resource Type
conference paper
Author(s)
Asencio-Cortés G.
Márquez Chamorro A.
Ruiz R.
Pablo de Olavide University
University of Ciego de Ávila, Cuba
Abstract
Protein structure prediction consists in determining the thre-e-dimensional conformation of a protein based only on its amino acid sequence. This is currently a difficult and significant challenge in structural bioinformatics because these structures are necessary for drug designing. This work proposes a method that reconstructs protein structures from protein fragments assembled according to their physico-chemical similarities, using information extracted from known protein structures. Our prediction system produces distance maps to represent protein structures, which provides more information than contact maps, which are predicted by many proposals in the literature. Most commonly used amino acid physico-chemical properties are hydrophobicity, polarity and charge. In our method, we performed a feature selection on the 544 properties of the AAindex repository, resulting in 16 properties which were used to predictions. We tested our proposal on 74 mitochondrial matrix proteins with a maximum sequence identity of 30% obtained from the Protein Data Bank. We achieved a recall of 0.80 and a precision of 0.79 with an 8-angstrom cut-off and a minimum sequence separation of 7 amino acids. Finally, we compared our system with other relevant proposal on the same benchmark and we achieved a recall improvement of 50.82%. Therefore, for the studied proteins, our method provides a notable improvement in terms of recall. © 2012 Springer-Verlag.
Start page
156
End page
167
Volume
7246 LNCS
Language
English
OCDE Knowledge area
Bioinformática
Subjects
Scopus EID
2-s2.0-84859141632
Source
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Resource of which it is part
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN of the container
03029743
ISBN of the container
978-364229065-7
Conference
10th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics, EvoBIO 2012
Sponsor(s)
University of Malaga, School of Computer Science, University of Malaga, School of Telecommunications, Malaga Convention Bureau, Edinburgh Napier Univ., Inst. Informatics Digit. Innovating
Sources of information:
Directorio de Producción Científica
Scopus