LatXGen: Towards Radiation-Free and Accurate Quantitative Analysis of Sagittal Spinal Alignment Via Cross-Modal Radiographic View Synthesis
- URL: http://arxiv.org/abs/2509.24165v1
- Date: Mon, 29 Sep 2025 01:29:53 GMT
- Title: LatXGen: Towards Radiation-Free and Accurate Quantitative Analysis of Sagittal Spinal Alignment Via Cross-Modal Radiographic View Synthesis
- Authors: Moxin Zhao, Nan Meng, Jason Pui Yin Cheung, Chris Yuk Kwan Tang, Chenxi Yu, Wenting Zhong, Pengyu Lu, Chang Shi, Yipeng Zhuang, Teng Zhang,
- Abstract summary: Adolescent Idiopathic Scoliosis (AIS) is a complex three-dimensional spinal deformity.<n>LatXGen is a novel generative framework that synthesizes realistic lateral spinal radiographs.<n>LatXGen produces accurate radiographs and outperforms existing GAN-based methods in both visual fidelity and quantitative metrics.
- Score: 5.000218104207936
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Adolescent Idiopathic Scoliosis (AIS) is a complex three-dimensional spinal deformity, and accurate morphological assessment requires evaluating both coronal and sagittal alignment. While previous research has made significant progress in developing radiation-free methods for coronal plane assessment, reliable and accurate evaluation of sagittal alignment without ionizing radiation remains largely underexplored. To address this gap, we propose LatXGen, a novel generative framework that synthesizes realistic lateral spinal radiographs from posterior Red-Green-Blue and Depth (RGBD) images of unclothed backs. This enables accurate, radiation-free estimation of sagittal spinal alignment. LatXGen tackles two core challenges: (1) inferring sagittal spinal morphology changes from a lateral perspective based on posteroanterior surface geometry, and (2) performing cross-modality translation from RGBD input to the radiographic domain. The framework adopts a dual-stage architecture that progressively estimates lateral spinal structure and synthesizes corresponding radiographs. To enhance anatomical consistency, we introduce an attention-based Fast Fourier Convolution (FFC) module for integrating anatomical features from RGBD images and 3D landmarks, and a Spatial Deformation Network (SDN) to model morphological variations in the lateral view. Additionally, we construct the first large-scale paired dataset for this task, comprising 3,264 RGBD and lateral radiograph pairs. Experimental results demonstrate that LatXGen produces anatomically accurate radiographs and outperforms existing GAN-based methods in both visual fidelity and quantitative metrics. This study offers a promising, radiation-free solution for sagittal spine assessment and advances comprehensive AIS evaluation.
Related papers
- Preoperative-to-intraoperative Liver Registration for Laparoscopic Surgery via Latent-Grounded Correspondence Constraints [51.7011449975586]
Land-Reg is a deformable registration framework that learns latent-grounded 2D-3D landmark correspondences.<n>For rigid registration, Land-Reg embraces a Cross-modal Latent Alignment module.<n>An Uncertainty-enhanced Overlap Landmark Detector with similarity matching is proposed to robustly estimate explicit 2D-3D landmark correspondences.
arXiv Detail & Related papers (2026-03-02T10:44:03Z) - US-X Complete: A Multi-Modal Approach to Anatomical 3D Shape Recovery [38.0728021642047]
We present a novel multi-modal learning method for completing occluded anatomical structures in 3D ultrasound.<n>We generate paired training data consisting of: (1) 2D lateral vertebral views that simulate X-ray scans, and (2) 3D partial vertebrae representations.<n>Our method integrates morphological information both imaging modalities and demonstrates significant improvements in vertebral reconstruction.
arXiv Detail & Related papers (2025-11-19T16:45:04Z) - Silhouette-to-Contour Registration: Aligning Intraoral Scan Models with Cephalometric Radiographs [10.70146635420186]
We propose DentalSCR, a pose-stable, contour-guided framework for accurate and interpretable silhouette-to-contour registration.<n>We evaluate DentalSCR on 34 expert-annotated clinical cases.
arXiv Detail & Related papers (2025-11-18T10:50:04Z) - Submillimeter-Accurate 3D Lumbar Spine Reconstruction from Biplanar X-Ray Images: Incorporating a Multi-Task Network and Landmark-Weighted Loss [4.252036643472159]
Current fully automated methods only achieve millimeter-level accuracy, making it difficult to meet clinical standards.<n>This study developed and validated a fully automated method for high-accuracy 3D reconstruction of the lumbar spine from biplanar X-ray images.<n>The proposed method achieved a 3D reconstruction accuracy of 0.80mm, representing a significant improvement over the mainstream approaches.
arXiv Detail & Related papers (2025-03-18T15:00:39Z) - 3D Spine Shape Estimation from Single 2D DXA [49.53978253009771]
We propose an automated framework to estimate the 3D spine shape from 2D DXA scans.<n>We achieve this by explicitly predicting the sagittal view of the spine from the DXA scan.
arXiv Detail & Related papers (2024-12-02T13:58:26Z) - B-Spine: Learning B-Spline Curve Representation for Robust and
Interpretable Spinal Curvature Estimation [50.208310028625284]
We propose B-Spine, a novel deep learning pipeline to learn B-spline curve representation of the spine.
We estimate the Cobb angles for spinal curvature estimation from low-quality X-ray images.
arXiv Detail & Related papers (2023-10-14T15:34:57Z) - Act Like a Radiologist: Radiology Report Generation across Anatomical Regions [50.13206214694885]
X-RGen is a radiologist-minded report generation framework across six anatomical regions.
In X-RGen, we seek to mimic the behaviour of human radiologists, breaking them down into four principal phases.
We enhance the recognition capacity of the image encoder by analysing images and reports across various regions.
arXiv Detail & Related papers (2023-05-26T07:12:35Z) - Medical Image Captioning via Generative Pretrained Transformers [57.308920993032274]
We combine two language models, the Show-Attend-Tell and the GPT-3, to generate comprehensive and descriptive radiology records.
The proposed model is tested on two medical datasets, the Open-I, MIMIC-CXR, and the general-purpose MS-COCO.
arXiv Detail & Related papers (2022-09-28T10:27:10Z) - Context-Aware Transformers For Spinal Cancer Detection and Radiological
Grading [70.04389979779195]
This paper proposes a novel transformer-based model architecture for medical imaging problems involving analysis of vertebrae.
It considers two applications of such models in MR images: (a) detection of spinal metastases and the related conditions of vertebral fractures and metastatic cord compression.
We show that by considering the context of vertebral bodies in the image, SCT improves the accuracy for several gradings compared to previously published model.
arXiv Detail & Related papers (2022-06-27T10:31:03Z) - MedNeRF: Medical Neural Radiance Fields for Reconstructing 3D-aware
CT-Projections from a Single X-ray [14.10611608681131]
Excessive ionising radiation can lead to deterministic and harmful effects on the body.
This paper proposes a Deep Learning model that learns to reconstruct CT projections from a few or even a single-view X-ray.
arXiv Detail & Related papers (2022-02-02T13:25:23Z) - XraySyn: Realistic View Synthesis From a Single Radiograph Through CT
Priors [118.27130593216096]
A radiograph visualizes the internal anatomy of a patient through the use of X-ray, which projects 3D information onto a 2D plane.
To the best of our knowledge, this is the first work on radiograph view synthesis.
We show that by gaining an understanding of radiography in 3D space, our method can be applied to radiograph bone extraction and suppression without groundtruth bone labels.
arXiv Detail & Related papers (2020-12-04T05:08:53Z) - Inferring the 3D Standing Spine Posture from 2D Radiographs [5.114998342130747]
An upright spinal pose (i.e. standing) under natural weight bearing is crucial for such bio-mechanical analysis.
We propose a novel neural network architecture working vertebra-wise, termed emphTransVert, which takes 2D radiographs and infers the spine's 3D posture.
arXiv Detail & Related papers (2020-07-13T18:37:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.