Title
Data set editing by ordered projection
Date Issued
01 January 2001
Access level
open access
Resource Type
journal article
Author(s)
Universidad de Sevilla
Publisher(s)
IOS Press
Abstract
This paper presents a new approach to data set editing. The algorithm (EOP: Editing by Ordered Projection) has some interesting characteristics: important reduction of the number of examples from the database; lower computational cost (O(mn \log n)) with respect to other typical algorithms due to the absence of distance calculations; conservation of the decision boundaries, especially from the point of view of the application of axis-parallel classifiers. The performance of EOP is analysed in two ways: percentage of reduction and classification. EOP has been compared to IB2, ENN and SHRINK concerning the percentage of reduction and the computational cost. In addition, we have analysed the accuracy of k-NN and C4.5 after applying the reduction techniques. An extensive empirical study using databases with continuous attributes from the UCI repository shows that EOP is a valuable preprocessing method for the later application of any axis-parallel learning algorithm. © 2001-IOS Press. All rights reserved.
Start page
405
End page
417
Volume
5
Issue
5
Language
English
OCDE Knowledge area
Ciencias de la computación
Ciencias de la Información
Subjects
Scopus EID
2-s2.0-33845969697
Source
Intelligent Data Analysis
ISSN of the container
1088467X
Sources of information:
Directorio de Producción Científica
Scopus