Scaling Artificial Intelligence for Multi-Tumor Early Detection with More Reports, Fewer Masks
- URL: http://arxiv.org/abs/2510.14803v1
- Date: Thu, 16 Oct 2025 15:35:44 GMT
- Title: Scaling Artificial Intelligence for Multi-Tumor Early Detection with More Reports, Fewer Masks
- Authors: Pedro R. A. S. Bassi, Xinze Zhou, Wenxuan Li, Szymon Płotka, Jieneng Chen, Qi Chen, Zheren Zhu, Jakub Prządo, Ibrahim E. Hamacı, Sezgin Er, Yuhan Wang, Ashwin Kumar, Bjoern Menze, Jarosław B. Ćwikła, Yuyin Zhou, Akshay S. Chaudhari, Curtis P. Langlotz, Sergio Decherchi, Andrea Cavalli, Kang Wang, Yang Yang, Alan L. Yuille, Zongwei Zhou,
- Abstract summary: We introduce R-Super, which trains AI to segment tumors that match descriptions in medical reports.<n>When trained on 101,654 reports, AI models achieved performance comparable to those trained on 723 masks.<n>R-Super enabled segmentation of tumors in the spleen, gallbladder, prostate, bladder, uterus, and esophagus.
- Score: 59.37427210144734
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Early tumor detection save lives. Each year, more than 300 million computed tomography (CT) scans are performed worldwide, offering a vast opportunity for effective cancer screening. However, detecting small or early-stage tumors on these CT scans remains challenging, even for experts. Artificial intelligence (AI) models can assist by highlighting suspicious regions, but training such models typically requires extensive tumor masks--detailed, voxel-wise outlines of tumors manually drawn by radiologists. Drawing these masks is costly, requiring years of effort and millions of dollars. In contrast, nearly every CT scan in clinical practice is already accompanied by medical reports describing the tumor's size, number, appearance, and sometimes, pathology results--information that is rich, abundant, and often underutilized for AI training. We introduce R-Super, which trains AI to segment tumors that match their descriptions in medical reports. This approach scales AI training with large collections of readily available medical reports, substantially reducing the need for manually drawn tumor masks. When trained on 101,654 reports, AI models achieved performance comparable to those trained on 723 masks. Combining reports and masks further improved sensitivity by +13% and specificity by +8%, surpassing radiologists in detecting five of the seven tumor types. Notably, R-Super enabled segmentation of tumors in the spleen, gallbladder, prostate, bladder, uterus, and esophagus, for which no public masks or AI models previously existed. This study challenges the long-held belief that large-scale, labor-intensive tumor mask creation is indispensable, establishing a scalable and accessible path toward early detection across diverse tumor types. We plan to release our trained models, code, and dataset at https://github.com/MrGiovanni/R-Super
Related papers
- Novel Deep Learning Architectures for Classification and Segmentation of Brain Tumors from MRI Images [0.8846824366848378]
Brain tumors pose a significant threat to human life, therefore it is very much necessary to detect them accurately in the early stages.<n>The incidence of brain tumors has risen amongst children and adolescents in recent years, resulting in a substantial volume of data.<n>We have proposed two novel Deep Learning Architectures - (a) SAETCN (Self-Attention Enhancement Tumor Classification Network) for the classification of different kinds of brain tumors.
arXiv Detail & Related papers (2025-12-06T18:49:57Z) - Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data [62.63749675817477]
AbdomenAtlas 2.0 is a dataset of 10,135 CT scans with a total of 15,130 tumor instances per-voxel manually annotated in six organs.<n>It achieves notable improvements over public datasets, with a +7% gain on DSC tests and +16% on out-of-distribution tests.
arXiv Detail & Related papers (2025-10-16T16:08:09Z) - Tumor Synthesis conditioned on Radiomics [1.3521165953335823]
We propose a tumor-generation model that utilizes radiomics features as generative conditions.<n>Our model employs a GAN-based model to generate tumor masks and a diffusion-based approach to generate tumor texture conditioned on radiomics features.<n>Our method allows the user to generate tumor images according to user-specified radiomics features such as size, shape, and texture at an arbitrary location.
arXiv Detail & Related papers (2025-09-29T02:04:12Z) - Learning Segmentation from Radiology Reports [46.073392569125375]
Tumor segmentation in CT scans is key for diagnosis, surgery, and prognosis.<n>Yet segmentation masks are scarce because their creation requires time and expertise.<n>We propose a report-supervision loss (R-Super) that converts radiology reports into voxel-wise supervision for tumor segmentation AI.
arXiv Detail & Related papers (2025-07-08T01:37:34Z) - From Pixel to Cancer: Cellular Automata in Computed Tomography [12.524228287083888]
Tumor synthesis seeks to create artificial tumors in medical images.
This paper establishes a set of generic rules to simulate tumor development.
We integrate the tumor state into the original computed tomography (CT) images to generate synthetic tumors across different organs.
arXiv Detail & Related papers (2024-03-11T06:46:31Z) - Towards Generalizable Tumor Synthesis [48.45704270448412]
Tumor synthesis enables the creation of artificial tumors in medical images, facilitating the training of AI models for tumor detection and segmentation.
This paper made a progressive stride toward generalizable tumor synthesis by leveraging a critical observation.
We have ascertained that generative AI models, e.g., Diffusion Models, can create realistic tumors generalized to a range of organs even when trained on a limited number of tumor examples from only one organ.
arXiv Detail & Related papers (2024-02-29T18:57:39Z) - Early Detection and Localization of Pancreatic Cancer by Label-Free
Tumor Synthesis [11.86190788916592]
Early detection and localization of pancreatic cancer can increase the 5-year survival rate for patients from 8.5% to 20%.
Training AI models require a vast number of annotated examples, but the availability of CT scans obtaining early-stage tumors is constrained.
We develop a tumor synthesis method that can synthesize enormous examples of small pancreatic tumors in the healthy pancreas without the need for manual annotation.
arXiv Detail & Related papers (2023-08-06T03:37:34Z) - CancerUniT: Towards a Single Unified Model for Effective Detection,
Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection
of CT Scans [45.83431075462771]
Human readers or radiologists routinely perform full-body multi-organ multi-disease detection and diagnosis in clinical practice.
Most medical AI systems are built to focus on single organs with a narrow list of a few diseases.
CancerUniT is a query-based Mask Transformer model with the output of multi-tumor prediction.
arXiv Detail & Related papers (2023-01-28T20:09:34Z) - Stan: Small tumor-aware network for breast ultrasound image segmentation [68.8204255655161]
We propose a novel deep learning architecture called Small Tumor-Aware Network (STAN) to improve the performance of segmenting tumors with different size.
The proposed approach outperformed the state-of-the-art approaches in segmenting small breast tumors.
arXiv Detail & Related papers (2020-02-03T22:25:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.