Title
Optically connected and reconfigurable GPU architecture for optimized peer-to-peer access
Date Issued
01 October 2018
Access level
metadata only access
Resource Type
conference output
Author(s)
Anderson E.
Gazman A.
Azevedo R.
Bergman K.
University of Campinas
Publisher(s)
Association for Computing Machinery
Abstract
Increasing industry interest in the optimization of inter-GPU communication has motivated this work to explore new ways to enable peer-to-peer access. Specifically, this paper investigates how reconfigurable optical links between GPUs in multi-GPU servers can allow for minimized memory transfer latencies for given machine learning applications. Silicon photonics (SiP) is proposed as the enabling technology for such a reconfigurable architecture due to the potential for scalable and cost-efficient production. We evaluated our architecture using traffic obtained from an NVLink-connected 8 GPU server executing a set of machine learning models including AlexNet, DenseNet, NASNet, ResNet, MobileNet, and VGG16. Our results show up to 24.91% reduction of the total relative transmission latency (RTL) between peers.
Language
English
OCDE Knowledge area
Hardware, Arquitectura de computadoras Informática y Ciencias de la Información
Scopus EID
2-s2.0-85060987011
Resource of which it is part
ACM International Conference Proceeding Series
ISBN of the container
9781450364751
Conference
2018 International Symposium on Memory Systems, MEMSYS 2018 Alexandria 1 October 2018 through 4 October 2018
Sponsor(s)
This work was supported by the São Paulo Research Foundation (FAPESP-2014/016429), CAPES (PROCAD 2966/2014) and the NSF IGERT Program (DGE-1069240).
Sources of information: Directorio de Producción Científica Scopus