Skip to yearly menu bar Skip to main content


Poster

HEALNet: Multimodal Fusion for Heterogeneous Biomedical Data

Konstantin Hemker · Nikola Simidjievski · Mateja Jamnik

East Exhibit Hall A-C #3705
[ ] [ Project Page ]
Wed 11 Dec 11 a.m. PST — 2 p.m. PST

Abstract:

Technological advances in medical data collection, such as high-resolution histopathology and high-throughput genomic sequencing, have contributed to the rising requirement for multi-modal biomedical modelling, specifically for image, tabular, and graph data. Most multi-modal deep learning approaches use modality-specific architectures that are often trained separately and cannot capture the crucial cross-modal information that motivates the integration of different data sources. This paper presents the Hybrid Early-fusion Attention Learning Network (HEALNet) – a flexible multi-modal fusion architecture, which a) preserves modality-specific structural information, b) captures the cross-modal interactions and structural information in a shared latent space, c) can effectively handle missing modalities during training and inference, and d) enables intuitive model inspection by learning on the raw data input instead of opaque embeddings. We conduct multi-modal survival analysis on Whole Slide Images and Multi-omic data on four cancer datasets from The Cancer Genome Atlas (TCGA). HEALNet achieves state-of-the-art performance, substantially improving over uni-modal and multi-modal fusion baselines whilst being robust in scenarios with missing modalities.

Live content is unavailable. Log in and register to view live content