Related papers: Depth-Sequence Transformer (DST) for Segment-Specific ICA Calcification Mapping on Non-Contrast CT

Depth-Sequence Transformer (DST) for Segment-Specific ICA Calcification Mapping on Non-Contrast CT

URL: http://arxiv.org/abs/2507.08214v2
Date: Wed, 16 Jul 2025 19:10:13 GMT
Title: Depth-Sequence Transformer (DST) for Segment-Specific ICA Calcification Mapping on Non-Contrast CT
Authors: Xiangjian Hou, Ebru Yaman Akcicek, Xin Wang, Kazem Hashemizadeh, Scott Mcnally, Chun Yuan, Xiaodong Ma,
Abstract summary: Conventional 3D models are forced to process downsampled volumes or isolated patches.<n>We reformulate the 3D challenge as a textbfParallel Probabilistic Landmark localization task along the 1D axial dimension.<n>We propose the textbfDepth-Sequence Transformer (DST), a framework that processes full-resolution CT volumes as sequences of 2D slices.
Score: 27.975558644423664
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While total intracranial carotid artery calcification (ICAC) volume is an established stroke biomarker, growing evidence shows this aggregate metric ignores the critical influence of plaque location, since calcification in different segments carries distinct prognostic and procedural risks. However, a finer-grained, segment-specific quantification has remained technically infeasible. Conventional 3D models are forced to process downsampled volumes or isolated patches, sacrificing the global context required to resolve anatomical ambiguity and render reliable landmark localization. To overcome this, we reformulate the 3D challenge as a \textbf{Parallel Probabilistic Landmark Localization} task along the 1D axial dimension. We propose the \textbf{Depth-Sequence Transformer (DST)}, a framework that processes full-resolution CT volumes as sequences of 2D slices, learning to predict $N=6$ independent probability distributions that pinpoint key anatomical landmarks. Our DST framework demonstrates exceptional accuracy and robustness. Evaluated on a 100-patient clinical cohort with rigorous 5-fold cross-validation, it achieves a Mean Absolute Error (MAE) of \textbf{0.1 slices}, with \textbf{96\%} of predictions falling within a $\pm1$ slice tolerance. Furthermore, to validate its architectural power, the DST backbone establishes the best result on the public Clean-CC-CCII classification benchmark under an end-to-end evaluation protocol. Our work delivers the first practical tool for automated segment-specific ICAC analysis. The proposed framework provides a foundation for further studies on the role of location-specific biomarkers in diagnosis, prognosis, and procedural planning.

Related papers

GEPAR3D: Geometry Prior-Assisted Learning for 3D Tooth Segmentation [0.15487122608774898]
Tooth segmentation in Cone-Beam Computed Tomography (CBCT) remains challenging.<n>We introduce GEPAR3D, a novel approach that unifies instance detection and multi-class segmentation into a single step to improve root segmentation.<n>We leverage a deep watershed method, modeling each tooth as a continuous 3D energy basin encoding voxel distances to boundaries.
arXiv Detail & Related papers (2025-07-31T20:46:58Z)
MOSAIC: A Multi-View 2.5D Organ Slice Selector with Cross-Attentional Reasoning for Anatomically-Aware CT Localization in Medical Organ Segmentation [0.8747606955991707]
Existing 3D segmentation approaches are computationally and memory intensive, often processing entire volumes that contain many anatomically irrelevant slices.<n>We propose a novel, anatomically-aware slice selector pipeline that reduces input volume prior to segmentation.<n>Our proposed model acts as an "expert" in anatomical localization, reasoning over multi-view representations to selectively retain slices with high structural relevance.
arXiv Detail & Related papers (2025-05-15T19:32:28Z)
Myocardial Region-guided Feature Aggregation Net for Automatic Coronary artery Segmentation and Stenosis Assessment using Coronary Computed Tomography Angiography [13.885760158090692]
Myocardial Region-guided Feature Aggregation Net is a novel U-shaped dual-encoder architecture that integrates anatomical prior knowledge to enhance robustness in coronary artery segmentation.<n>Our framework incorporates three key innovations: (1) a Myocardial Region-guided Module that directs attention to coronary regions via bridging expansion and multi-scale feature fusion, (2) a Residual Feature Extraction Module that combines parallel spatial channel attention with residual blocks to enhance local-global feature discrimination, and (3) a Multi-scale Feature Fusion Module for adaptive aggregation of hierarchical vascular features.
arXiv Detail & Related papers (2025-04-27T16:43:52Z)
Towards Unifying Anatomy Segmentation: Automated Generation of a Full-body CT Dataset via Knowledge Aggregation and Anatomical Guidelines [113.08940153125616]
We generate a dataset of whole-body CT scans with $142$ voxel-level labels for 533 volumes providing comprehensive anatomical coverage. Our proposed procedure does not rely on manual annotation during the label aggregation stage. We release our trained unified anatomical segmentation model capable of predicting $142$ anatomical structures on CT data.
arXiv Detail & Related papers (2023-07-25T09:48:13Z)
Accurate Fine-Grained Segmentation of Human Anatomy in Radiographs via Volumetric Pseudo-Labeling [66.75096111651062]
We created a large-scale dataset of 10,021 thoracic CTs with 157 labels. We applied an ensemble of 3D anatomy segmentation models to extract anatomical pseudo-labels. Our resulting segmentation models demonstrated remarkable performance on CXR.
arXiv Detail & Related papers (2023-06-06T18:01:08Z)
Med-Query: Steerable Parsing of 9-DoF Medical Anatomies with Query Embedding [14.901279446640393]
We propose a steerable, robust, and efficient computing framework for detection, identification, and segmentation of anatomies in CT scans.<n>Considering the complicated shapes, sizes, and orientations of anatomies, we present a nine degrees of freedom (9-DoF) pose estimation solution in full 3D space.<n>We have validated our method on three medical imaging parsing tasks: ribs, spine, and abdominal organs.
arXiv Detail & Related papers (2022-12-05T04:04:21Z)
Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network. We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module. Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z)
Segmentation of Bruch's Membrane in retinal OCT with AMD using anatomical priors and uncertainty quantification [4.5206601127476445]
We propose an end-to-end deep learning method for automated Bruch's membrane (BM) segmentation in AMD patients. An Attention U-Net is trained to output a probability density function of the BM position, while taking into account the natural curvature of the surface. Besides the surface position, the method also estimates an A-scan wise uncertainty measure of the segmentation output.
arXiv Detail & Related papers (2022-10-26T15:49:07Z)
A unified 3D framework for Organs at Risk Localization and Segmentation for Radiation Therapy Planning [56.52933974838905]
Current medical workflow requires manual delineation of organs-at-risk (OAR) In this work, we aim to introduce a unified 3D pipeline for OAR localization-segmentation. Our proposed framework fully enables the exploitation of 3D context information inherent in medical imaging.
arXiv Detail & Related papers (2022-03-01T17:08:41Z)
Cross-Site Severity Assessment of COVID-19 from CT Images via Domain Adaptation [64.59521853145368]
Early and accurate severity assessment of Coronavirus disease 2019 (COVID-19) based on computed tomography (CT) images offers a great help to the estimation of intensive care unit event. To augment the labeled data and improve the generalization ability of the classification model, it is necessary to aggregate data from multiple sites. This task faces several challenges including class imbalance between mild and severe infections, domain distribution discrepancy between sites, and presence of heterogeneous features.
arXiv Detail & Related papers (2021-09-08T07:56:51Z)
An Uncertainty-Driven GCN Refinement Strategy for Organ Segmentation [53.425900196763756]
We propose a segmentation refinement method based on uncertainty analysis and graph convolutional networks. We employ the uncertainty levels of the convolutional network in a particular input volume to formulate a semi-supervised graph learning problem. We show that our method outperforms the state-of-the-art CRF refinement method by improving the dice score by 1% for the pancreas and 2% for spleen.
arXiv Detail & Related papers (2020-12-06T18:55:07Z)
AttentionAnatomy: A unified framework for whole-body organs at risk segmentation using multiple partially annotated datasets [30.23917416966188]
Organs-at-risk (OAR) delineation in computed tomography (CT) is an important step in Radiation Therapy (RT) planning. Our proposed end-to-end convolutional neural network model, called textbfAttentionAnatomy, can be jointly trained with three partially annotated datasets. Experimental results of our proposed framework presented significant improvements in both Sorensen-Dice coefficient (DSC) and 95% Hausdorff distance.
arXiv Detail & Related papers (2020-01-13T18:31:34Z)

This list is automatically generated from the titles and abstracts of the papers in this site.