Related papers: A versatile foundation model for cine cardiac magnetic resonance image analysis tasks

A versatile foundation model for cine cardiac magnetic resonance image analysis tasks

URL: http://arxiv.org/abs/2506.00679v2
Date: Sun, 31 Aug 2025 12:02:44 GMT
Title: A versatile foundation model for cine cardiac magnetic resonance image analysis tasks
Authors: Yunguan Fu, Wenjia Bai, Weixi Yi, Charlotte Manisty, Anish N Bhuva, Thomas A Treibel, James C Moon, Matthew J Clarkson, Rhodri Huw Davies, Yipeng Hu,
Abstract summary: We present a versatile foundation model that can perform a range of clinically-relevant image analysis tasks.<n>A multi-view convolution-transformer masked autoencoder, named as CineMA, was trained on 15 million cine images from 74,916 subjects.
Score: 6.488550274514015
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Here we present a versatile foundation model that can perform a range of clinically-relevant image analysis tasks, including segmentation, landmark localisation, diagnosis, and prognostication. A multi-view convolution-transformer masked autoencoder, named as CineMA, was trained on 15 million cine images from 74,916 subjects. The model was validated on multiple image analysis tasks and compared to existing models on >4,500 images from eight independent datasets with diverse population characteristics, representing the largest benchmark study for cine CMR so far. CineMA consistently outperformed conventional convolutional neural networks (CNNs) in delineating ventricular boundaries and estimating ejection fraction, a key measure of cardiac function. The improved performance was preserved, even when the model only used half of fine-tuning data. CineMA also surpassed CNNs in disease detection and matched their performance in long-axis function measurement. Interestingly, we found that CineMA can also detect cardiac changes in systemic diseases, such as diabetes, hypertension and cancer, and can also predict mortality. Finally, we assessed model fairness and demonstrated consistent model performance across demographic subgroups. These findings highlight CineMA's accuracy, learning efficiency, adaptability, and fairness, underscoring its potential as a foundation model for automated cardiac image analysis to support clinical workflow and cardiovascular research. All training and inference code and models are made publicly available at https://github.com/mathpluscode/CineMA.

Related papers

Are Video Models Emerging as Zero-Shot Learners and Reasoners in Medical Imaging? [21.25724100313781]
We evaluate a large vision model (LVM) in a zero-shot setting across four representative tasks.<n>The model can delineate anatomical structures in CT scans and achieve competitive performance on segmentation, denoising, and motion prediction.<n>We evaluate the LVM on 4D CT data from 122 patients, totaling over 1,820 3D CT volumes.
arXiv Detail & Related papers (2025-10-11T15:19:03Z)
Extreme Cardiac MRI Analysis under Respiratory Motion: Results of the CMRxMotion Challenge [56.28872161153236]
Deep learning models have achieved state-of-the-art performance in automated Cardiac Magnetic Resonance (CMR) analysis.<n>The efficacy of these models is highly dependent on the availability of high-quality, artifact-free images.<n>To promote research in this domain, we organized the MICCAI CMRxMotion challenge.
arXiv Detail & Related papers (2025-07-25T11:12:21Z)
Phenotype-Guided Generative Model for High-Fidelity Cardiac MRI Synthesis: Advancing Pretraining and Clinical Applications [9.113410118160438]
We present Cardiac Phenotype-Guided CMR Generation (CPGG), a novel approach for generating diverse CMR data.<n>CPGG framework consists of two stages: in the first stage, a generative model is trained using cardiac phenotypes derived from CMR data.<n>In the second stage, a masked autoregressive diffusion model, conditioned on these phenotypes, generates high-fidelity CMR cine sequences.
arXiv Detail & Related papers (2025-05-06T11:06:41Z)
Optimizing CNN Architectures for Advanced Thoracic Disease Classification [0.0]
We evaluate various CNN architectures to address challenges like dataset imbalance, variations in image quality, and hidden biases.<n>Our results highlight the potential of CNNs in medical imaging but emphasize that issues like unbalanced datasets and variations in image acquisition methods must be addressed for optimal model performance.
arXiv Detail & Related papers (2025-02-15T00:27:37Z)
A Unified Model for Compressed Sensing MRI Across Undersampling Patterns [69.19631302047569]
We propose a unified MRI reconstruction model robust to various measurement undersampling patterns and image resolutions.<n>Our model improves SSIM by 11% and PSNR by 4 dB over a state-of-the-art CNN (End-to-End VarNet) with 600$times$ faster inference than diffusion methods.
arXiv Detail & Related papers (2024-10-05T20:03:57Z)
Towards a vision foundation model for comprehensive assessment of Cardiac MRI [11.838157772803282]
We introduce a vision foundation model trained for cardiac magnetic resonance imaging (CMR) assessment. We finetune the model in supervised way for 9 clinical tasks typical to a CMR workflow. We demonstrate improved accuracy and robustness across all tasks, over a range of available labeled dataset sizes.
arXiv Detail & Related papers (2024-10-02T15:32:01Z)
Classification, Regression and Segmentation directly from k-Space in Cardiac MRI [11.690226907936903]
We propose KMAE, a Transformer-based model specifically designed to process k-space data directly. KMAE can handle critical cardiac disease classification, relevant phenotype regression, and cardiac segmentation tasks. We utilize this model to investigate the potential of k-space-based diagnosis in cardiac MRI.
arXiv Detail & Related papers (2024-07-29T15:35:35Z)
LaMoD: Latent Motion Diffusion Model For Myocardial Strain Generation [5.377722774297911]
We introduce a novel Latent Motion Diffusion model (LaMoD) to predict highly accurate DENSE motions from standard CMR videos.<n> Experimental results demonstrate that our proposed method, LaMoD, significantly improves the accuracy of motion analysis in standard CMR images.
arXiv Detail & Related papers (2024-07-02T12:54:32Z)
CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI [40.11088079783521]
The CMRxRecon2024 dataset is the largest and most protocal-diverse publicly available cardiac k-space dataset.<n>It is acquired from 330 healthy volunteers, covering commonly used modalities, anatomical views, and acquisition trajectories in clinical cardiac MRI.
arXiv Detail & Related papers (2024-06-27T09:50:20Z)
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images [68.42215385041114]
This paper introduces a novel lightweight multi-level adaptation and comparison framework to repurpose the CLIP model for medical anomaly detection. Our approach integrates multiple residual adapters into the pre-trained visual encoder, enabling a stepwise enhancement of visual features across different levels. Our experiments on medical anomaly detection benchmarks demonstrate that our method significantly surpasses current state-of-the-art models.
arXiv Detail & Related papers (2024-03-19T09:28:19Z)
Successive Subspace Learning for Cardiac Disease Classification with Two-phase Deformation Fields from Cine MRI [36.044984400761535]
This work proposes a lightweight successive subspace learning framework for CVD classification. It is based on an interpretable feedforward design, in conjunction with a cardiac atlas. Compared with 3D CNN-based approaches, our framework achieves superior classification performance with 140$times$ fewer parameters.
arXiv Detail & Related papers (2023-01-21T15:00:59Z)
Motion-related Artefact Classification Using Patch-based Ensemble and Transfer Learning in Cardiac MRI [5.186000805926489]
We propose an automatic cardiac MRI quality estimation framework using ensemble and transfer learning. Multiple pre-trained models were initialised and fine-tuned on 2-dimensional image patches sampled from the training data. It achieved a classification accuracy of 78.8% and 70.0% on the training set and validation set, respectively.
arXiv Detail & Related papers (2022-10-14T11:31:40Z)
Automated SSIM Regression for Detection and Quantification of Motion Artefacts in Brain MR Images [54.739076152240024]
Motion artefacts in magnetic resonance brain images are a crucial issue. The assessment of MR image quality is fundamental before proceeding with the clinical diagnosis. An automated image quality assessment based on the structural similarity index (SSIM) regression has been proposed here.
arXiv Detail & Related papers (2022-06-14T10:16:54Z)
Preservation of High Frequency Content for Deep Learning-Based Medical Image Classification [74.84221280249876]
An efficient analysis of large amounts of chest radiographs can aid physicians and radiologists. We propose a novel Discrete Wavelet Transform (DWT)-based method for the efficient identification and encoding of visual information.
arXiv Detail & Related papers (2022-05-08T15:29:54Z)
CNN-based Cardiac Motion Extraction to Generate Deformable Geometric Left Ventricle Myocardial Models from Cine MRI [0.0]
We propose a framework for the development of patient-specific geometric models of LV myocardium from cine cardiac MR images. We use the VoxelMorph-based convolutional neural network (CNN) to propagate the isosurface mesh and volume mesh of the end-diastole frame to the subsequent frames of the cardiac cycle.
arXiv Detail & Related papers (2021-03-30T21:34:29Z)
Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance. For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming. In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z)
Improved Slice-wise Tumour Detection in Brain MRIs by Computing Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods. We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder. We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z)
Deep Generative Model-based Quality Control for Cardiac MRI Segmentation [30.09405692032434]
We propose a novel deep generative model-based framework for quality control of cardiac MRI segmentation. The proposed method achieves high prediction accuracy on two publicly available cardiac MRI datasets.
arXiv Detail & Related papers (2020-06-23T23:15:54Z)
Segmentation of the Myocardium on Late-Gadolinium Enhanced MRI based on 2.5 D Residual Squeeze and Excitation Deep Learning Model [55.09533240649176]
The aim of this work is to develop an accurate automatic segmentation method based on deep learning models for the myocardial borders on LGE-MRI. A total number of 320 exams (with a mean number of 6 slices per exam) were used for training and 28 exams used for testing. The performance analysis of the proposed ensemble model in the basal and middle slices was similar as compared to intra-observer study and slightly lower at apical slices.
arXiv Detail & Related papers (2020-05-27T20:44:38Z)
A Global Benchmark of Algorithms for Segmenting Late Gadolinium-Enhanced Cardiac Magnetic Resonance Imaging [90.29017019187282]
" 2018 Left Atrium Challenge" using 154 3D LGE-MRIs, currently the world's largest cardiac LGE-MRI dataset. Analyse of the submitted algorithms using technical and biological metrics was performed. Results show the top method achieved a dice score of 93.2% and a mean surface to a surface distance of 0.7 mm.
arXiv Detail & Related papers (2020-04-26T08:49:17Z)
Improving Calibration and Out-of-Distribution Detection in Medical Image Segmentation with Convolutional Neural Networks [8.219843232619551]
Convolutional Neural Networks (CNNs) have shown to be powerful medical image segmentation models. We advocate for multi-task learning, i.e., training a single model on several different datasets. We show that not only a single CNN learns to automatically recognize the context and accurately segment the organ of interest in each context, but also that such a joint model often has more accurate and better-calibrated predictions.
arXiv Detail & Related papers (2020-04-12T23:42:51Z)
An interpretable classifier for high-resolution breast cancer screening images utilizing weakly supervised localization [45.00998416720726]
We propose a framework to address the unique properties of medical images. This model first uses a low-capacity, yet memory-efficient, network on the whole image to identify the most informative regions. It then applies another higher-capacity network to collect details from chosen regions. Finally, it employs a fusion module that aggregates global and local information to make a final prediction.
arXiv Detail & Related papers (2020-02-13T15:28:42Z)

This list is automatically generated from the titles and abstracts of the papers in this site.