Title
A Gaussian Model for Feature Selection in Protein Fold Recognition
Date Issued
27 December 2018
Access level
metadata only access
Resource Type
conference paper
Author(s)
Shiguihara-Juárez P.
University of Pittsburgh
Publisher(s)
Institute of Electrical and Electronics Engineers Inc.
Abstract
Protein fold recognition is an important task to discover new biological functions of proteins. In this context, machine learning techniques have been used to protein fold recognition, stating this task as a classification problem. However, in many cases, the similarity of patterns to protein fold recognition becomes this process in a complex task, limiting the performance of the machine learning techniques. In this paper, we propose a feature selection method to support machine learning methods for protein fold recognition, using gaussian distributions in the process of features analysis. We cluster features by gaussian distributions. These clusters give information to reduce the dimensionality of the features. After that, we use baselines classifiers to protein fold recognition, using a well-known dataset for this task. The results suggest that the clustering and reduction of dimensionality of features using gaussian distribution can help to improve the accuracy of machine learning techniques on this task.
Language
English
OCDE Knowledge area
Ciencias de la computación
Scopus EID
2-s2.0-85061483159
ISBN of the container
9781538683743
Conference
Proceedings of the 2018 IEEE Sciences and Humanities International Research Conference, SHIRCON 2018
Sources of information:
Directorio de Producción Científica
Scopus