Bridging Imaging and Clinical Scores in Parkinson&amp;#039;s Progression Via Multimodal Self-Supervised Deep Learning

Francisco J. Martinez-Murcia; Juan Eloy Arco; Carmen Jimenez-Mesa; Fermin Segovia; Ignacio A. Illan; Javier Ramirez; Juan Manuel Gorriz

doi:10.1142/S0129065724500436

Bridging Imaging and Clinical Scores in Parkinson’s Progression Via Multimodal Self-Supervised Deep Learning

LATiDOS

neurodegenerative disease progression

Alzheimer's Disease

featured

Authors

Affiliations

Francisco J. Martinez-Murcia

Dept. of Signal Theory, Networking and Communications

Andalusian Institute for Data Science and Artificial Intelligence (DASCI)

Juan Eloy Arco

Dept. of Signal Theory, Networking and Communications

Andalusian Institute for Data Science and Artificial Intelligence (DASCI)

Carmen Jimenez-Mesa

Dept. of Signal Theory, Networking and Communications

Andalusian Institute for Data Science and Artificial Intelligence (DASCI)

Fermin Segovia

Dept. of Signal Theory, Networking and Communications

Andalusian Institute for Data Science and Artificial Intelligence (DASCI)

Ignacio A. Illan

Dept. of Signal Theory, Networking and Communications

Andalusian Institute for Data Science and Artificial Intelligence (DASCI)

Javier Ramirez

Dept. of Signal Theory, Networking and Communications

Andalusian Institute for Data Science and Artificial Intelligence (DASCI)

Juan Manuel Gorriz

Dept. of Signal Theory, Networking and Communications

Andalusian Institute for Data Science and Artificial Intelligence (DASCI)

Published

May 22, 2024

Abstract

Neurodegenerative diseases pose a formidable challenge to medical research, demanding a nuanced understanding of their progressive nature. In this regard, latent generative models can effectively be used in a data-driven modeling of different dimensions of neurodegeneration, framed within the context of the manifold hypothesis. This paper proposes a joint framework for a multi-modal, common latent generative model to address the need for a more comprehensive understanding of the neurodegenerative landscape in the context of Parkinson’s disease (PD). The proposed architecture uses coupled variational autoencoders (VAEs) to joint model a common latent space to both neuroimaging and clinical data from the Parkinson’s Progression Markers Initiative (PPMI). Alternative loss functions, different normalization procedures, and the interpretability and explainability of latent generative models are addressed, leading to a model that was able to predict clinical symptomatology in the test set, as measured by the unified Parkinson’s disease rating scale (UPDRS), with R2 up to 0.86 for same-modality and 0.441 cross-modality (using solely neuroimaging). The findings provide a foundation for further advancements in the field of clinical research and practice, with potential applications in decision-making processes for PD. The study also highlights the limitations and capabilities of the proposed model, emphasizing its direct interpretability and potential impact on understanding and interpreting neuroimaging patterns associated with PD symptomatology.

1. INTRODUCTION

Parkinson’s disease (PD) affects more than 6.2 million people worldwide. It is characterized by a loss of dopamine-producing neurons in the brain, causing tremors, rigidity, and cognitive decline among other symptoms. Ioflupane I-123 binds to the presynaptic dopamine transporters (DaTs), allowing to visualize and quantify DaT concentration at the striata, which is key to characterize PD in SPECT imaging. Early detection and monitoring of PD progression could be tackled by modelling a self-supervised, low-dimensional representation of a FPCIT dataset, which enables us to longitudinally compare images and identify patterns of change that are indicative of neurodegeneration.

2. METHODOLOGY

This work proposes a novel approach to detect and quantify subtle changes in DaT concentration and distribution in the brain using 3D convolutional variational AEs (CVAEs). The composite variables of the latent space are then modelled using Decission Trees and XGBoost regression, and the spaces are interpreted through visualization and SHAP.

2.1 DATASET

Data used for this work was obtained from the Parkinson’s Progression Markers Initiative. Specific cohort from the PPMI database that follows individuals diagnosed with PD and Healthy Control subjects (HC) during 5 years, recording symptomatology via the MDS-UPDRS scale. All the Ioflupane I-123 SPECT scans have been intensity normalized to non-specific areas and a sigmoid function was applied to compress the high-intensity values.

2.2 3D CONVOLUTIONAL VAE

Figure 1: Structure of the convolutional VAE

2.3. EVALUATION

Use the μ parameters of the latent distribution Nd(μd,σd) as features.
K-Means features (KMF) are generated to capture important data characteristics.
Regression using Decision Trees (DT) and XGBoost.
Performance metrics such as Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and coefficient of determination R2 are computed using 10-fold cross-validation.
SHapley Additive exPlanations (SHAP) is applied to the outputs of each system.

3. RESULTS

R2>0 for almost all target UPDRS categories and d-dimensional latent spaces.
Low latent dimensionality (3, 8) is generally for individual perceived symptomatology (UPDRS 1, 2 and 4).
UPDRS - total is better accounted for by the 20-D model (MAE = 12.21, R2=0.26), pointing at higher complexity of the composite and stronger relationship with FPCIT patterns.
UPDRS 3, more dependent on measured motor symptoms, benefits from the 20-D space.

Figure 3: Interpretability of the manifold

For top performing 20-D XGB model of UPDRS (total), SHAP reveals:
- top-3 features that contribute to the output of the algorithm are latent variables 2, 10 and 0.
- Approx. linear dependency between importance to the algorithm output and values of the latent variables.
- Composition of variables 2 and 10 accounts for relevant characteristics of Ioflupane SPECT imaging: the general intensity of the striata, the separation between them and the uptake ratio between its anterior and posterior parts.
- Anterior-posterior uptake ratio more related to progression in the first symptoms. Average intensity for the automatic diagnosis of PD.

5. CONCLUSIONS

The latent features of trained CVAEs are related to different aspects of the MDS-UPDRS scale with R2>0.20.
Best performance for UPDRS (total) with latent variables 2 and 10.
Anterior/Posterior ratio are more related to progression of the first symptoms. Average striatal intensity for the diagnosis of PD.

This work paves the way for exploratory analysis of links between neuroimaging patterns and neurological disorders in an hypothesis-free environment.

Citation

For attribution, please cite this work as:

Martinez-Murcia, Francisco J., Juan Eloy Arco, Carmen Jimenez-Mesa, Fermin Segovia, Ignacio A. Illan, Javier Ramirez, and Juan Manuel Gorriz. 2024. “Bridging Imaging and Clinical Scores in Parkinson’s Progression Via Multimodal Self-Supervised Deep Learning.” International Journal of Neural Systems 34 (8): 2450043. https://doi.org/10.1142/S0129065724500436.