XAttn-BMD: Multimodal Deep Learning with Cross-Attention for Femoral Neck Bone Mineral Density Estimation
- URL: http://arxiv.org/abs/2511.14604v1
- Date: Tue, 18 Nov 2025 15:53:42 GMT
- Title: XAttn-BMD: Multimodal Deep Learning with Cross-Attention for Femoral Neck Bone Mineral Density Estimation
- Authors: Yilin Zhang, Leo D. Westbury, Elaine M. Dennison, Nicholas C. Harvey, Nicholas R. Fuggle, Rahman Attar,
- Abstract summary: Poor bone health is a significant public health concern, and low bone mineral density leads to an increased fracture risk.<n>We present XAttn-BMD, a multimodal deep learning framework that predicts femoral neck BMD from hip X-ray images and structured clinical metadata.<n>It utilizes a novel bidirectional cross-attention mechanism to dynamically integrate image and metadata features for cross-modal mutual reinforcement.
- Score: 4.192785353394277
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Poor bone health is a significant public health concern, and low bone mineral density (BMD) leads to an increased fracture risk, a key feature of osteoporosis. We present XAttn-BMD (Cross-Attention BMD), a multimodal deep learning framework that predicts femoral neck BMD from hip X-ray images and structured clinical metadata. It utilizes a novel bidirectional cross-attention mechanism to dynamically integrate image and metadata features for cross-modal mutual reinforcement. A Weighted Smooth L1 loss is tailored to address BMD imbalance and prioritize clinically significant cases. Extensive experiments on the data from the Hertfordshire Cohort Study show that our model outperforms the baseline models in regression generalization and robustness. Ablation studies confirm the effectiveness of both cross-attention fusion and the customized loss function. Experimental results show that the integration of multimodal data via cross-attention outperforms naive feature concatenation without cross-attention, reducing MSE by 16.7%, MAE by 6.03%, and increasing the R2 score by 16.4%, highlighting the effectiveness of the approach for femoral neck BMD estimation. Furthermore, screening performance was evaluated using binary classification at clinically relevant femoral neck BMD thresholds, demonstrating the model's potential in real-world scenarios.
Related papers
- A multimodal vision foundation model for generalizable knee pathology [40.03838145472935]
Musculoskeletal disorders represent an urgent demand for precise interpretation of medical imaging.<n>Current artificial intelligence approaches in orthopedics rely on task-specific, supervised learning paradigms.<n>We introduce OrthoFoundation, a multimodal vision foundation model optimized for musculoskeletal pathology.
arXiv Detail & Related papers (2026-01-26T08:14:51Z) - Deep Learning-Based BMD Estimation from Radiographs with Conformal Uncertainty Quantification [0.0]
This study proposes using widely available knee X-rays for opportunistic Bone Mineral Density estimation via deep learning.<n>It provides statistically rigorous, patient-specific prediction intervals with guaranteed coverage.
arXiv Detail & Related papers (2025-05-28T16:33:49Z) - AXIAL: Attention-based eXplainability for Interpretable Alzheimer's Localized Diagnosis using 2D CNNs on 3D MRI brain scans [43.06293430764841]
This study presents an innovative method for Alzheimer's disease diagnosis using 3D MRI designed to enhance the explainability of model decisions.
Our approach adopts a soft attention mechanism, enabling 2D CNNs to extract volumetric representations.
With voxel-level precision, our method identified which specific areas are being paid attention to, identifying these predominant brain regions.
arXiv Detail & Related papers (2024-07-02T16:44:00Z) - Learning-based Bone Quality Classification Method for Spinal Metastasis [36.59899006688448]
Early detection of spinal metastasis is critical for accurate staging and optimal treatment.
In this paper, we explore a learning-based automatic bone quality classification method for spinal metastasis based on CT images.
arXiv Detail & Related papers (2024-02-14T02:53:51Z) - Cross-modality Guidance-aided Multi-modal Learning with Dual Attention
for MRI Brain Tumor Grading [47.50733518140625]
Brain tumor represents one of the most fatal cancers around the world, and is very common in children and the elderly.
We propose a novel cross-modality guidance-aided multi-modal learning with dual attention for addressing the task of MRI brain tumor grading.
arXiv Detail & Related papers (2024-01-17T07:54:49Z) - Medical Report Generation based on Segment-Enhanced Contrastive
Representation Learning [39.17345313432545]
We propose MSCL (Medical image with Contrastive Learning) to segment organs, abnormalities, bones, etc.
We introduce a supervised contrastive loss that assigns more weight to reports that are semantically similar to the target while training.
Experimental results demonstrate the effectiveness of our proposed model, where we achieve state-of-the-art performance on the IU X-Ray public dataset.
arXiv Detail & Related papers (2023-12-26T03:33:48Z) - Guided Reconstruction with Conditioned Diffusion Models for Unsupervised Anomaly Detection in Brain MRIs [35.46541584018842]
Unsupervised Anomaly Detection (UAD) aims to identify any anomaly as an outlier from a healthy training distribution.<n>generative models are used to learn the reconstruction of healthy brain anatomy for a given input image.<n>We propose conditioning the denoising process of diffusion models with additional information derived from a latent representation of the input image.
arXiv Detail & Related papers (2023-12-07T11:03:42Z) - A Two-Stage Generative Model with CycleGAN and Joint Diffusion for
MRI-based Brain Tumor Detection [41.454028276986946]
We propose a novel framework Two-Stage Generative Model (TSGM) to improve brain tumor detection and segmentation.
CycleGAN is trained on unpaired data to generate abnormal images from healthy images as data prior.
VE-JP is implemented to reconstruct healthy images using synthetic paired abnormal images as a guide.
arXiv Detail & Related papers (2023-11-06T12:58:26Z) - Bone mineral density estimation from a plain X-ray image by learning
decomposition into projections of bone-segmented computed tomography [4.872603360039571]
Osteoporosis is a prevalent bone disease that causes fractures in fragile bones, leading to a decline in daily living activities.
To frequently monitor bone health, low-cost, low-dose, and ubiquitously available diagnostic methods are highly anticipated.
In this study, we aim to perform bone mineral density estimation from a plain X-ray image for opportunistic screening.
arXiv Detail & Related papers (2023-07-21T11:49:30Z) - Lumbar Bone Mineral Density Estimation from Chest X-ray Images:
Anatomy-aware Attentive Multi-ROI Modeling [23.014342480592873]
Osteoporosis is a chronic metabolic bone disease that is often under-diagnosed and under-treated due to the limited access to bone mineral density examinations.
In this paper, we propose a method to predict BMD from Chest X-ray (CXR), one of the most commonly accessible and low-cost medical imaging examinations.
arXiv Detail & Related papers (2022-01-05T22:03:32Z) - Semi-Supervised Learning for Bone Mineral Density Estimation in Hip
X-ray Images [19.17169803995019]
Bone mineral density is a clinically critical indicator of osteoporosis.
Due to the limited accessibility of DEXA machines and examinations, osteoporosis is often under-diagnosed and under-treated.
arXiv Detail & Related papers (2021-03-24T20:59:54Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Introducing Anisotropic Minkowski Functionals for Local Structure
Analysis and Prediction of Biomechanical Strength of Proximal Femur Specimens [0.0]
Bone fragility and fracture caused by osteoporosis or injury are prevalent in adults over the age of 50 and can reduce their quality of life.
This study proposes a new method to predict the bone strength of proximal femur specimens from quantitative multi-detector computer tomography (MDCT) images.
arXiv Detail & Related papers (2020-04-02T14:33:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.