Title
Soft constraints for pattern mining
Date Issued
01 April 2015
Access level
open access
Resource Type
journal article
Author(s)
University of Caen, Campus II Côte de Nacre
Publisher(s)
Kluwer Academic Publishers
Abstract
Constraint-based pattern discovery is at the core of numerous data mining tasks. Patterns are extracted with respect to a given set of constraints (frequency, closedness, size, etc). In practice, many constraints require threshold values whose choice is often arbitrary. This difficulty is even harder when several thresholds are required and have to be combined. Moreover, patterns barely missing a threshold will not be extracted even if they may be relevant. The paper advocates the introduction of softness into the pattern discovery process. By using Constraint Programming, we propose efficient methods to relax threshold constraints as well as constraints involved in patterns such as the top-k patterns and the skypatterns. We show the relevance and the efficiency of our approach through a case study in chemoinformatics for discovering toxicophores.
Start page
193
End page
221
Volume
44
Issue
2
Language
English
OCDE Knowledge area
Ingeniería ambiental
Subjects
Scopus EID
2-s2.0-84925292002
Source
Journal of Intelligent Information Systems
ISSN of the container
09259902
Sources of information:
Directorio de Producción Científica
Scopus