Title
GO-based Functional Dissimilarity of Gene Sets
Date Issued
01 September 2011
Access level
open access
Resource Type
journal article
Author(s)
Pablo de Olavide University
Publisher(s)
BioMed Central Ltd.
Abstract
Background: The Gene Ontology (GO) provides a controlled vocabulary for describing the functions of genes and can be used to evaluate the functional coherence of gene sets. Many functional coherence measures consider each pair of gene functions in a set and produce an output based on all pairwise distances. A single gene can encode multiple proteins that may differ in function. For each functionality, other proteins that exhibit the same activity may also participate. Therefore, an identification of the most common function for all of the genes involved in a biological process is important in evaluating the functional similarity of groups of genes and a quantification of functional coherence can helps to clarify the role of a group of genes working together.Results: To implement this approach to functional assessment, we present GFD (GO-based Functional Dissimilarity), a novel dissimilarity measure for evaluating groups of genes based on the most relevant functions of the whole set. The measure assigns a numerical value to the gene set for each of the three GO sub-ontologies.Conclusions: Results show that GFD performs robustly when applied to gene set of known functionality (extracted from KEGG). It performs particularly well on randomly generated gene sets. An ROC analysis reveals that the performance of GFD in evaluating the functional dissimilarity of gene sets is very satisfactory. A comparative analysis against other functional measures, such as GS2 and those presented by Resnik and Wang, also demonstrates the robustness of GFD. © 2011 Díaz-Díaz and Aguilar-Ruiz; licensee BioMed Central Ltd.
Volume
12
Language
English
OCDE Knowledge area
Genética, Herencia
Scopus EID
2-s2.0-80052151544
PubMed ID
Source
BMC Bioinformatics
Sponsor(s)
This research was partially supported by the Ministry of Science and Innovation, projects TIN2007-68084-C02-00 and PCI2006-A7-0575, and by the Junta de Andalucia, projects P07-TIC-02611 and TIC-200. ND thanks Jeffrey Chuang and Kourosh Zarringhalam for helpful discussions.
Sources of information: Directorio de Producción Científica Scopus