Title
Natural encoding for evolutionary supervised learning
Date Issued
01 August 2007
Access level
open access
Resource Type
journal article
Author(s)
Giráldez R.
Riquelme J.C.
Universidad Pablo de Olavide
Abstract
Some of the most influential factors in the quality of the solutions found by an evolutionary algorithm (EA) are a correct coding of the search space and an appropriate evaluation function of the potential solutions. EAs are often used to learn decision rules from datasets, which are encoded as individuals in the genetic population. In this paper, the coding of the search space for the obtaining of those decision rules is approached, i.e., the representation of the individuals of the genetic population and also the design of specific genetic operators. Our approach, called "natural coding,"uses one gene per feature in the dataset (continuous or discrete). The examples from the datasets are also encoded into the search space, where the genetic population evolves, and therefore the evaluation process is improved substantially. Genetic operators for the natural coding are formally defined as algebraic expressions. Experiments with several datasets from the University of California at Irvine (UCI) machine learning repository show that as the genetic operators are better guided through the search space, the number of rules decreases considerably while maintaining the accuracy, similar to that of hybrid coding, which joins the well-known binary and real representations to encode discrete and continuous attributes, respectively. The computational cost associated with the natural coding is also reduced with regard to the hybrid representation. Our algorithm, HIDER*, has been statistically tested against C4.5 and C4.5 Rules, and performed well. The knowledge models obtained are simpler, with very few decision rules, and therefore easier to understand, which is an advantage in many domains. The experiments with high-dimensional datasets showed the same good behavior, maintaining the quality of the knowledge model with respect to prediction accuracy. © 2006 IEEE.
Start page
466
End page
479
Volume
11
Issue
4
Language
English
OCDE Knowledge area
Otras ingenierías y tecnologías Educación general (incluye capacitación, pedadogía)
Scopus EID
2-s2.0-34547906519
Source
IEEE Transactions on Evolutionary Computation
ISSN of the container
1089778X
Sponsor(s)
Manuscript received December 18, 2005; revised April 22, 2006. This work was supported in part by the Spanish Research Agency CICYT under Grant TIN2004-00159 and in part by Junta de Andalucía (III Research Plan). J. S. Aguilar-Ruiz and R. Giráldez are with the School of Engineering, Pablo de Olavide University, 41013 Seville, Spain (e-mail: jsagurui@upo.es; rgirroj@upo.es). J. C. Riquelme is with the Department of Computer Science, University of Seville, 41004 Seville, Spain (e-mail: riquelme@lsi.us.es). Digital Object Identifier 10.1109/TEVC.2006.883466
Sources of information: Directorio de Producción Científica Scopus