Title
COPPER - Constraint optimized prefixspan for epidemiological research
Date Issued
01 January 2015
Access level
open access
Resource Type
conference paper
Author(s)
Publisher(s)
Elsevier B.V.
Abstract
Sequential pattern mining, is a data mining technique used to study the temporal evolution of events describing a complex phenomenon. This technique has a limited application due to the high number of common sequences generated by dense datasets. To tackle this problem, we propose COP, an extension of the PrefixSpan algorithm oriented towards optimizing the relevance of the results obtained in the sequential patterns mining process. Indeed, we use multiple and simultaneous constraints that represent the expertise of researchers in a specific domain. Experiments conducted on datasets associated to dengue epidemic monitoring show an improve in result relevance from an expert's point of view, as well as, a considerable speed gains for mining dense datasets.
Start page
433
End page
438
Volume
63
Language
English
OCDE Knowledge area
Ciencias de la información
Salud pública, Salud ambiental
Epidemiología
Subjects
Scopus EID
2-s2.0-84954116180
Source
Procedia Computer Science
ISSN of the container
18770509
Conference
6th International Conference on Emerging Ubiquitous Systems and Pervasive Networks, EUSPN 2015
Sponsor(s)
We would like to acknowledge and thank the Regional epidemiology unit of the French Institute for Public Health Surveillance for the data provided that was used in the experimental portion of this work. Additionally, this paper was written in the context of project financed by FONDECYT.
Sources of information:
Directorio de Producción Científica
Scopus