Matrix-assisted laser desorption/ionization (MALDI) imaging mass spectrometry (IMS) is a technique that can reveal powerful insights into the correlation between molecular distributions and histological features. Due to their high-dimensional, hierarchical and spatial nature, MALDI IMS datasets present numerous statistical challenges. In collaboration with the bioimaging team at GlaxoSmithKline (GSK), we have developed special purpose statistical workflows in R that provide end-to-end support for the entire MALDI IMS analysis pipeline, from study design and assay quantification to functional pharmacology. These applications leverage numerous R packages, with a particular focus on the “tidyverse” and “tidymodels” ecosystems due to their modularity and interconnectedness (to protect GSK’s intellectual property, we are currently unable to share our code). Our workflows include robust smoothing and estimation of calibration curves; non-trivial animal and tissue sample size calculations via in silico experiments; and AI/ML implementations for prediction of drug effects from the high-dimensional molecular space. These solutions addressed unique biological and quantitative challenges, and yielded actionable insights for GSK’s bioimaging team.
Bioimaging, R workflow, high dimensional data
The MALDI technology enables the mapping of molecular profiles to histology.
All studies were conducted in accordance with the GSK Policy on the Care, Welfare and Treatment of Laboratory Animals and were reviewed by the Institutional Animal Care and Use Committee either at GSK or by the ethical review process at the institution where the work was performed.
In this note, we will focus on the study design and functional pharmacology workflows. We have also developed a workflow for assay quantification and calibration.
MALDI datasets are hierarchical, with tissue sections nested in animals. We simulated from historical data to develop guidelines for animal and tissue section sample sizes.
For the functional pharmacology analysis, the goal is to use ions to predict binary drug response