Related papers: STACT-Time: Spatio-Temporal Cross Attention for Cine Thyroid Ultrasound Time Series Classification

STACT-Time: Spatio-Temporal Cross Attention for Cine Thyroid Ultrasound Time Series Classification

URL: http://arxiv.org/abs/2506.18172v1
Date: Sun, 22 Jun 2025 21:14:04 GMT
Title: STACT-Time: Spatio-Temporal Cross Attention for Cine Thyroid Ultrasound Time Series Classification
Authors: Irsyad Adam, Tengyue Zhang, Shrayes Raman, Zhuyu Qiu, Brandon Taraku, Hexiang Feng, Sile Wang, Ashwath Radhachandran, Shreeram Athreya, Vedrana Ivezic, Peipei Ping, Corey Arnold, William Speier,
Abstract summary: Thyroid cancer is among the most common cancers in the United States.<n>Recent deep learning approaches have sought to improve risk stratification, but they often fail to utilize the rich temporal and spatial context provided by US cine clips.<n>We propose the Spatio-Temporal Cross Attention for Cine Thyroid Ultrasound Time Series Classification (STACT-Time) model.
Score: 2.510842391292067
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Thyroid cancer is among the most common cancers in the United States. Thyroid nodules are frequently detected through ultrasound (US) imaging, and some require further evaluation via fine-needle aspiration (FNA) biopsy. Despite its effectiveness, FNA often leads to unnecessary biopsies of benign nodules, causing patient discomfort and anxiety. To address this, the American College of Radiology Thyroid Imaging Reporting and Data System (TI-RADS) has been developed to reduce benign biopsies. However, such systems are limited by interobserver variability. Recent deep learning approaches have sought to improve risk stratification, but they often fail to utilize the rich temporal and spatial context provided by US cine clips, which contain dynamic global information and surrounding structural changes across various views. In this work, we propose the Spatio-Temporal Cross Attention for Cine Thyroid Ultrasound Time Series Classification (STACT-Time) model, a novel representation learning framework that integrates imaging features from US cine clips with features from segmentation masks automatically generated by a pretrained model. By leveraging self-attention and cross-attention mechanisms, our model captures the rich temporal and spatial context of US cine clips while enhancing feature representation through segmentation-guided learning. Our model improves malignancy prediction compared to state-of-the-art models, achieving a cross-validation precision of 0.91 (plus or minus 0.02) and an F1 score of 0.89 (plus or minus 0.02). By reducing unnecessary biopsies of benign nodules while maintaining high sensitivity for malignancy detection, our model has the potential to enhance clinical decision-making and improve patient outcomes.

Related papers

One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training [45.49415063761575]
EndoRare is a one-shot, retraining-free generative framework that synthesizes diverse, high-fidelity lesion exemplars from a single reference image.<n>We validated the framework across four rare pathologies.<n>These results establish a practical, data-efficient pathway to bridge the rare-disease gap in both computer-aided diagnostics and clinical education.
arXiv Detail & Related papers (2025-12-30T15:07:09Z)
A Deep Learning Framework for Thyroid Nodule Segmentation and Malignancy Classification from Ultrasound Images [2.875000842489767]
We propose a fully automated, two-stage framework for interpretable malignancy prediction.<n>Our method achieves interpretability by forcing the model to focus only on clinically relevant regions.<n>This is the first fully automated end-to-end pipeline for both detecting thyroid nodules on ultrasound images and predicting their malignancy.
arXiv Detail & Related papers (2025-11-14T23:23:24Z)
Multi-Task Diffusion Approach For Prediction of Glioma Tumor Progression [0.6978367196609415]
Glioma is an aggressive brain malignancy that poses significant challenges for accurate evolution prediction.<n>In this paper, we present a multitask diffusion framework for time-agnostic, pixel-wise prediction of glioma progression.
arXiv Detail & Related papers (2025-09-13T14:42:46Z)
ONCOPILOT: A Promptable CT Foundation Model For Solid Tumor Evaluation [3.8763197858217935]
ONCOPILOT is an interactive radiological foundation model trained on approximately 7,500 CT scans covering the whole body. It performs 3D tumor segmentation using visual prompts like point-click and bounding boxes, outperforming state-of-the-art models. ONCOPILOT also accelerates measurement processes and reduces inter-reader variability.
arXiv Detail & Related papers (2024-10-10T13:36:49Z)
Spatiotemporal Graph Neural Network Modelling Perfusion MRI [12.712005118761516]
Per vascular MRI (pMRI) offers valuable insights into tumority and promises to predict tumor genotypes. Yet effective models tailored to 4D pMRI are still lacking. This study presents the first attempt to model 4D pMRI using a GNN-based model.
arXiv Detail & Related papers (2024-06-10T16:24:46Z)
Autonomous Path Planning for Intercostal Robotic Ultrasound Imaging Using Reinforcement Learning [45.5123007404575]
The US examination for thoracic application is still challenging due to the acoustic shadow cast by the subcutaneous rib cage. We present a reinforcement learning approach for planning scanning paths between ribs to monitor changes in lesions on internal organs. Experiments have been carried out on unseen CTs with randomly defined single or multiple scanning targets.
arXiv Detail & Related papers (2024-04-15T16:52:53Z)
CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers [66.15847237150909]
We introduce a self-supervised deep learning architecture to segment catheters in longitudinal ultrasound images. The network architecture builds upon AiAReSeg, a segmentation transformer built with the Attention in Attention mechanism. We validated our model on a test dataset, consisting of unseen synthetic data and images collected from silicon aorta phantoms.
arXiv Detail & Related papers (2024-03-21T15:13:36Z)
Domain Transfer Through Image-to-Image Translation for Uncertainty-Aware Prostate Cancer Classification [42.75911994044675]
We present a novel approach for unpaired image-to-image translation of prostate MRIs and an uncertainty-aware training approach for classifying clinically significant PCa. Our approach involves a novel pipeline for translating unpaired 3.0T multi-parametric prostate MRIs to 1.5T, thereby augmenting the available training data. Our experiments demonstrate that the proposed method significantly improves the Area Under ROC Curve (AUC) by over 20% compared to the previous work.
arXiv Detail & Related papers (2023-07-02T05:26:54Z)
Using Spatio-Temporal Dual-Stream Network with Self-Supervised Learning for Lung Tumor Classification on Radial Probe Endobronchial Ultrasound Video [0.0]
During the biopsy process of lung cancer, physicians use real-time ultrasound images to find suitable lesion locations for sampling. Previous studies have employed 2D convolutional neural networks to effectively differentiate between benign and malignant lung lesions. This study designs an automatic diagnosis system based on a 3D neural network.
arXiv Detail & Related papers (2023-05-04T10:39:37Z)
Artificial-intelligence-based molecular classification of diffuse gliomas using rapid, label-free optical imaging [59.79875531898648]
DeepGlioma is an artificial-intelligence-based diagnostic screening system. DeepGlioma can predict the molecular alterations used by the World Health Organization to define the adult-type diffuse glioma taxonomy.
arXiv Detail & Related papers (2023-03-23T18:50:18Z)
Generative AI for Rapid Diffusion MRI with Improved Image Quality, Reliability and Generalizability [3.6119644566822484]
We employ a Swin UNEt Transformers model, trained on augmented Human Connectome Project data, to perform generalized denoising of dMRI. We demonstrate super-resolution with artificially downsampled HCP data in normal adult volunteers. We exceed current state-of-the-art denoising methods in accuracy and test-retest reliability of rapid diffusion tensor imaging requiring only 90 seconds of scan time.
arXiv Detail & Related papers (2023-03-10T03:39:23Z)
Learned super resolution ultrasound for improved breast lesion characterization [52.77024349608834]
Super resolution ultrasound localization microscopy enables imaging of the microvasculature at the capillary level. In this work we use a deep neural network architecture that makes effective use of signal structure to address these challenges. By leveraging our trained network, the microvasculature structure is recovered in a short time, without prior PSF knowledge, and without requiring separability of the UCAs.
arXiv Detail & Related papers (2021-07-12T09:04:20Z)
Spectral-Spatial Recurrent-Convolutional Networks for In-Vivo Hyperspectral Tumor Type Classification [49.32653090178743]
We demonstrate the feasibility of in-vivo tumor type classification using hyperspectral imaging and deep learning. Our best model achieves an AUC of 76.3%, significantly outperforming previous conventional and deep learning methods.
arXiv Detail & Related papers (2020-07-02T12:00:53Z)
Microvascular Dynamics from 4D Microscopy Using Temporal Segmentation [81.30750944868142]
We are able to track changes in cerebral blood volume over time and identify spontaneous arterial dilations that propagate towards the pial surface. This new imaging capability is a promising step towards characterizing the hemodynamic response function upon which functional magnetic resonance imaging (fMRI) is based.
arXiv Detail & Related papers (2020-01-14T22:55:03Z)

This list is automatically generated from the titles and abstracts of the papers in this site.