Title
Discovery of motifs to forecast outlier occurrence in time series
Date Issued
01 September 2011
Access level
open access
Resource Type
journal article
Author(s)
Universidad de Pablo de Olavide
Abstract
The forecasting process of real-world time series has to deal with especially unexpected values, commonly known as outliers. Outliers in time series can lead to unreliable modeling and poor forecasts. Therefore, the identification of future outlier occurrence is an essential task in time series analysis to reduce the average forecasting error. The main goal of this work is to predict the occurrence of outliers in time series, based on the discovery of motifs. In this sense, motifs will be those pattern sequences preceding certain data marked as anomalous by the proposed metaheuristic in a training set. Once the motifs are discovered, if data to be predicted are preceded by any of them, such data are identified as outliers, and treated separately from the rest of regular data. The forecasting of outlier occurrence has been added as an additional step in an existing time series forecasting algorithm (PSF), which was based on pattern sequence similarities. Robust statistical methods have been used to evaluate the accuracy of the proposed approach regarding the forecasting of both occurrence of outliers and their corresponding values. Finally, the methodology has been tested on six electricity-related time series, in which most of the outliers were properly found and forecasted. © 2011 Elsevier B.V. All rights reserved.
Start page
1652
End page
1665
Volume
32
Issue
12
Language
English
OCDE Knowledge area
Ciencias de la computación
Ingeniería de sistemas y comunicaciones
Subjects
Scopus EID
2-s2.0-79959216499
Source
Pattern Recognition Letters
ISSN of the container
01678655
Source funding
Ministerio de Ciencia y Tecnología
Sponsor(s)
The authors want to acknowledge the invaluable help provided by Prof. R. A. Maronna regarding robust statistical methods. The authors want also to thank Prof. S. Gelper for having provided useful material to make robust predictions. The research was partially supported by the Spanish Ministry of Science and Technology under project TIN2007-68084-C-00 , and Junta de Andalucía under project P07-TIC-02611 .
Sources of information:
Directorio de Producción Científica
Scopus