Title
On Graph Construction for Classification of Clinical Trials Protocols Using Graph Neural Networks
Date Issued
01 January 2022
Access level
open access
Resource Type
conference paper
Author(s)
University of Geneva
Publisher(s)
Springer Science and Business Media Deutschland GmbH
Abstract
A recent trend in health-related machine learning proposes the use of Graph Neural Networks (GNN’s) to model biomedical data. This is justified due to the complexity of healthcare data and the modelling power of graph abstractions. Thus, GNN’s emerge as the natural choice to learn from increasing amounts of healthcare data. While formulating the problem, however, there are usually multiple design choices and decisions that can affect the final performance. In this work, we focus on Clinical Trial (CT) protocols consisting of hierarchical documents, containing free text as well as medical codes and terms, and design a classifier to predict each CT protocol termination risk as “low” or “high”. We show that while using GNN’s to solve this classification task is very successful, the way the graph is constructed is also of importance and one can benefit from making a priori useful information more explicit. While a natural choice is to consider each CT protocol as an independent graph and pose the problem as a graph classification, consistent performance improvements can be achieved by considering them as super-nodes in one unified graph and connecting them according to some metadata, like similar medical condition or intervention, and finally approaching the problem as a node classification task rather than graph classification. We validate this hypothesis experimentally on a large-scale manually labeled CT database. This provides useful insights on the flexibility of graph-based modeling for machine learning in the healthcare domain.
Start page
249
End page
259
Volume
13263 LNAI
Language
English
OCDE Knowledge area
Ciencias del cuidado de la salud y servicios (administración de hospitales, financiamiento)
Sistemas de automatización, Sistemas de control
Subjects
Scopus EID
2-s2.0-85135030396
ISBN
9783031093418
ISSN of the container
03029743
ISBN of the container
978-303109341-8
Conference
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Sources of information:
Directorio de Producción Científica
Scopus