Title
Variable neighborhood search for the large phylogeny problem using gene order data
Date Issued
05 July 2017
Access level
metadata only access
Resource Type
conference paper
Author(s)
Ayala-Rincon M.
Universidade de Brasília
Publisher(s)
Institute of Electrical and Electronics Engineers Inc.
Abstract
Computing evolutionary distances using gene order data is a complex combinatory problem; nevertheless, for specific metrics exact polynomial algorithms were proposed, having in many cases non trivial approaches. This scenario can become harder if we want to reconstruct phylogenies based on gene order data: first it is necessary to explore the search space of possible tree structures which is well-known to be exponential; second, it is necessary a method for evaluating the cost of these trees, i.e. to find a labeling of the internal nodes that leads to the most parsimonious cost of a tree under a given evolutionary distance. The latter problem was shown to be NP-hard even for 3 genomes (median problem) under many evolutionary distances. In this paper we propose a variable neighborhood search approach for solving the large phylogeny problem for data based on gene orders. Also, a greedy approach is proposed for the small phylogeny problem aiming to reduce the running time of the Kovac et al. dynamic programming approach. Our proposed algorithms were implemented as the software called HELPHY. Experiments showed that the running time is improved for finding trees with good scores (reversal distance) for the Campanulaceae dataset, and a new tree structure was found having the best known score (double cut and join distance) for the case of Hemiascomycetes dataset.
Start page
59
End page
66
Language
English
OCDE Knowledge area
Bioinformática
Scopus EID
2-s2.0-85027830487
ISBN of the container
978-150904601-0
Conference
2017 IEEE Congress on Evolutionary Computation, CEC 2017 - Proceedings
Sources of information:
Directorio de Producción Científica
Scopus