Title
Tensor-based reinforcement learning for network routing
Date Issued
01 April 2021
Access level
open access
Resource Type
journal article
Author(s)
Tsai K.C.
Zhuang Z.
Wang J.
Qi Q.
Wang L.C.
Han Z.
University of Houston
Publisher(s)
Institute of Electrical and Electronics Engineers Inc.
Abstract
In the recent years, we have witnessed an explosion of networking applications due to the reasons such as the rapid development of cloud infrastructure, edge computing, and the Internet of Things. Furthermore, those applications become complex, the problem related to the large size of the state space and limited metric collection has emerged. This leads to an urging demand for adaptive management method in network routing. However, the complexity of traditional routing algorithms can be prohibited for practical systems. To overcome this challenge, we propose a novel tensor-based reinforcement learning method to route and schedule the packet flows, which is adaptive and model-free. Moreover, we improve the learning quality and efficiency by combining the Tucker decomposition technique within the learning process so that the machine learning direction can be obtained with low complexity. Finally, simulation results show that our proposed algorithm can achieve better performance under the same training episode and more stable results with less convergence time than conventional routing method, K-shortest path, traditional reinforcement learning approaches (i.e. Q-learning and SARSA) and comparable results to DQL.
Start page
617
End page
629
Volume
15
Issue
3
Language
English
OCDE Knowledge area
Ingeniería de sistemas y comunicaciones Ciencias de la computación
Scopus EID
2-s2.0-85100775196
Source
IEEE Journal on Selected Topics in Signal Processing
ISSN of the container
19324553
Sponsor(s)
Manuscript received June 13, 2020; revised November 8, 2020 and January 7, 2021; accepted January 11, 2021. Date of publication February 1, 2021; date of current version March 29, 2021. This work was supported in part by NSF EARS-1 839 818, CNS-1 717 454, CNS-1 731 424, CNS-1 702 850, USA, Ministry of Science and Technology under the Grant MOST 109-2634-F-009-018 through Pervasive Artificial Intelligence Research (PAIR) Labs, Taiwan, and in part by the Higher Education Sprout Project of the National Chiao Tung University and Ministry of Education, Taiwan. The guest editor coordinating the review of this manuscript and approving it for publication was Dr. Hongyang Chen. (Corresponding author: Kai-Chu Tsai.) Kai-Chu Tsai is with the Department of Electrical and Computer Engineering, University of Houston, Houston, TX USA (e-mail: ktsai3@uh.edu).
Sources of information: Directorio de Producción Científica Scopus