Related papers: Towards Segmenting the Invisible: An End-to-End Registration and Segmentation Framework for Weakly Supervised Tumour Analysis

Towards Segmenting the Invisible: An End-to-End Registration and Segmentation Framework for Weakly Supervised Tumour Analysis

URL: http://arxiv.org/abs/2602.05453v1
Date: Thu, 05 Feb 2026 08:55:26 GMT
Title: Towards Segmenting the Invisible: An End-to-End Registration and Segmentation Framework for Weakly Supervised Tumour Analysis
Authors: Budhaditya Mukhopadhyay, Chirag Mandal, Pavan Tummala, Naghmeh Mahmoodian, Andreas Nürnberger, Soumick Chatterjee,
Abstract summary: Liver tumour ablation presents a significant clinical challenge.<n>It is often invisible on intra-operative CT due to minimal contrast between pathological and healthy tissue.<n>This work investigates the feasibility of cross-modality weak supervision for scenarios where pathology is visible in one modality but absent in another.
Score: 0.5716776378742904
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Liver tumour ablation presents a significant clinical challenge: whilst tumours are clearly visible on pre-operative MRI, they are often effectively invisible on intra-operative CT due to minimal contrast between pathological and healthy tissue. This work investigates the feasibility of cross-modality weak supervision for scenarios where pathology is visible in one modality (MRI) but absent in another (CT). We present a hybrid registration-segmentation framework that combines MSCGUNet for inter-modal image registration with a UNet-based segmentation module, enabling registration-assisted pseudo-label generation for CT images. Our evaluation on the CHAOS dataset demonstrates that the pipeline can successfully register and segment healthy liver anatomy, achieving a Dice score of 0.72. However, when applied to clinical data containing tumours, performance degrades substantially (Dice score of 0.16), revealing the fundamental limitations of current registration methods when the target pathology lacks corresponding visual features in the target modality. We analyse the "domain gap" and "feature absence" problems, demonstrating that whilst spatial propagation of labels via registration is feasible for visible structures, segmenting truly invisible pathology remains an open challenge. Our findings highlight that registration-based label transfer cannot compensate for the absence of discriminative features in the target modality, providing important insights for future research in cross-modality medical image analysis. Code an weights are available at: https://github.com/BudhaTronix/Weakly-Supervised-Tumour-Detection

Related papers

Multi-View Stenosis Classification Leveraging Transformer-Based Multiple-Instance Learning Using Real-World Clinical Data [76.89269238957593]
Coronary artery stenosis is a leading cause of cardiovascular disease, diagnosed by analyzing the coronary arteries from multiple angiography views.<n>We propose SegmentMIL, a transformer-based multi-view multiple-instance learning framework for patient-level stenosis classification.
arXiv Detail & Related papers (2026-02-02T13:07:52Z)
Hide-and-Seek Attribution: Weakly Supervised Segmentation of Vertebral Metastases in CT [68.09387763135236]
We introduce a weakly supervised method trained solely on vertebra-level healthy/malignant labels, without any lesion masks.<n>We achieve strong blastic/lytic performance despite no mask supervision.
arXiv Detail & Related papers (2025-12-07T14:03:28Z)
Leveraging Unlabeled Scans for NCCT Image Segmentation in Early Stroke Diagnosis: A Semi-Supervised GAN Approach [4.199320411821769]
Ischemic stroke is a time-critical medical emergency where rapid diagnosis is essential for improving patient outcomes.<n>Non-contrast computed tomography (NCCT) serves as the frontline imaging tool, yet it often fails to reveal the subtle ischemic changes present in the early, hyperacute phase.<n>We introduce a semi-supervised segmentation method using generative adversarial networks (GANs) to accurately delineate early ischemic stroke regions.
arXiv Detail & Related papers (2025-11-24T18:14:53Z)
Self-Supervised Anatomical Consistency Learning for Vision-Grounded Medical Report Generation [61.350584471060756]
Vision-grounded medical report generation aims to produce clinically accurate descriptions of medical images.<n>We propose Self-Supervised Anatomical Consistency Learning (SS-ACL) to align generated reports with corresponding anatomical regions.<n>SS-ACL constructs a hierarchical anatomical graph inspired by the invariant top-down inclusion structure of human anatomy.
arXiv Detail & Related papers (2025-09-30T08:59:06Z)
GlanceSeg: Real-time microaneurysm lesion segmentation with gaze-map-guided foundation model for early detection of diabetic retinopathy [13.055297330424397]
Early-stage diabetic retinopathy (DR) presents challenges in clinical diagnosis due to minute microangioma lesions. We propose a human-in-the-loop, label-free early DR diagnosis framework called GlanceSeg, based on segment anything model (SAM) GlanceSeg enables real-time segmentation of microangioma lesions as ophthalmologists review fundus images.
arXiv Detail & Related papers (2023-11-14T10:59:45Z)
Co-Learning Semantic-aware Unsupervised Segmentation for Pathological Image Registration [11.471174214165751]
We propose GIRNet, a novel unsupervised approach for pathological image registration.<n>The registration of pathological images is achieved in a completely unsupervised learning framework.<n>Our results show that our method can accurately achieve the registration of pathological images and identify lesions even in challenging imaging modalities.
arXiv Detail & Related papers (2023-10-17T07:13:28Z)
Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network. We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module. Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z)
SeATrans: Learning Segmentation-Assisted diagnosis model via Transforme [13.63128987400635]
We propose Vision-Assisted diagnosis Transformer (SeATrans) to transfer the segmentation knowledge to the disease diagnosis network. We first propose an asymmetric multi-scale interaction strategy to correlate each single low-level diagnosis feature with multi-scale segmentation features. To model the segmentation-diagnosis interaction, SeA-block first embeds the diagnosis feature based on the segmentation information via the encoder, and then transfers the embedding back to the diagnosis feature space by a decoder.
arXiv Detail & Related papers (2022-06-12T15:10:33Z)
An Algorithm for the Labeling and Interactive Visualization of the Cerebrovascular System of Ischemic Strokes [59.116811751334225]
VirtualDSA++ is an algorithm designed to segment and label the cerebrovascular tree on CTA scans. We extend the labeling mechanism for the cerebral arteries to identify occluded vessels. We present the generic concept of iterative systematic search for pathways on all nodes of said model, which enables new interactive features.
arXiv Detail & Related papers (2022-04-26T14:20:26Z)
FetReg: Placental Vessel Segmentation and Registration in Fetoscopy Challenge Dataset [57.30136148318641]
Fetoscopy laser photocoagulation is a widely used procedure for the treatment of Twin-to-Twin Transfusion Syndrome (TTTS) This may lead to increased procedural time and incomplete ablation, resulting in persistent TTTS. Computer-assisted intervention may help overcome these challenges by expanding the fetoscopic field of view through video mosaicking and providing better visualization of the vessel network. We present a large-scale multi-centre dataset for the development of generalized and robust semantic segmentation and video mosaicking algorithms for the fetal environment with a focus on creating drift-free mosaics from long duration fetoscopy videos.
arXiv Detail & Related papers (2021-06-10T17:14:27Z)
Anatomy-guided Multimodal Registration by Learning Segmentation without Ground Truth: Application to Intraprocedural CBCT/MR Liver Segmentation and Registration [12.861503169117208]
Multimodal image registration has many applications in diagnostic medical imaging and image-guided interventions. The ability to register peri-procedurally acquired diagnostic images into the intraprocedural environment can potentially improve the intra-procedural tumor targeting. We propose an anatomy-preserving domain adaptation to segmentation network (APA2Seg-Net) for learning segmentation without target modality ground truth.
arXiv Detail & Related papers (2021-04-14T18:07:03Z)

This list is automatically generated from the titles and abstracts of the papers in this site.