Title
A Comparison of Learnheuristics Using Different Reward Functions to Solve the Set Covering Problem
Date Issued
01 January 2021
Access level
metadata only access
Resource Type
conference paper
Author(s)
Crawford B.
Cisternas-Caneo F.
Tapia D.
de la Fuente-Mella H.
Palma W.
Lemus-Romani J.
Castillo M.
Becerra-Rozas M.
Pontificia Universidad Católica de Valparaíso
Publisher(s)
Springer Science and Business Media Deutschland GmbH
Abstract
The high computational capacity that we have thanks to the new technologies allows us to communicate two great worlds such as optimization methods and machine learning. The concept behind the hybridization of both worlds is called Learnheuristics which allows to improve optimization methods through machine learning techniques where the input data for learning is the data produced by the optimization methods during the search process. Among the most outstanding machine learning techniques is Q-Learning whose learning process is based on rewarding or punishing the agents according to the consequences of their actions and this reward or punishment is carried out by means of a reward function. This work seeks to compare different Learnheuristics instances composed by Sine Cosine Algorithm and Q-Learning whose different lies in the reward function applied. Preliminary results indicate that there is an influence on the quality of the solutions based on the reward function applied.
Start page
74
End page
85
Volume
1443
Language
English
OCDE Knowledge area
Robótica, Control automático
Educación general (incluye capacitación, pedadogía)
Sistemas de automatización, Sistemas de control
Subjects
Scopus EID
2-s2.0-85115138840
ISBN
9783030856717
ISSN of the container
18650929
ISBN of the container
978-303085671-7
DOI of the container
10.1007/978-3-030-85672-4_6
Conference
Communications in Computer and Information Science
Source funding
Agencia Nacional de Investigación y Desarrollo
Agenția Națională pentru Cercetare și Dezvoltare
Sponsor(s)
Felipe Cisternas-Caneo and Marcelo Becerra-Rozas are supported by Grant DI Investigación Interdisciplinaria del Pregrado/VRIEA/PUCV/ 039.324/2020. Broderick Crawford and Wenceslao Palma are supported by Grant CON-ICYT /FONDECYT/REGULAR/1210810. Ricardo Soto is supported by Grant CON-ICYT/FONDECYT /REGULAR/1190129. Broderick Crawford, Ricardo Soto and Hanns de la Fuente-Mella are supported by Grant Núcleo de Investigación en Data Ana-lytics/VRIEA /PUCV/039.432/2020. José Lemus-Romani is supported by National Agency for Research and Development (ANID)/Scholarship Program/DOCTORADO NACIONAL /2019-21191692.
Sources of information:
Directorio de Producción Científica
Scopus