Reasoning Language Model for Personalized Lung Cancer Screening
- URL: http://arxiv.org/abs/2509.06169v1
- Date: Sun, 07 Sep 2025 18:38:39 GMT
- Title: Reasoning Language Model for Personalized Lung Cancer Screening
- Authors: Chuang Niu, Ge Wang,
- Abstract summary: Lung CT Screening Reporting and Data System (Lung-RADS) faces trade-offs between sensitivity and specificity.<n>We propose a reasoning language model (RLM) to integrate radiology findings with longitudinal medical records for individualized lung cancer risk assessment.
- Score: 10.241766336141685
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate risk assessment in lung cancer screening is critical for enabling early cancer detection and minimizing unnecessary invasive procedures. The Lung CT Screening Reporting and Data System (Lung-RADS) has been widely used as the standard framework for patient management and follow-up. Nevertheless, Lung-RADS faces trade-offs between sensitivity and specificity, as it stratifies risk solely based on lung nodule characteristics without incorporating various risk factors. Here we propose a reasoning language model (RLM) to integrate radiology findings with longitudinal medical records for individualized lung cancer risk assessment. Through a systematic study including dataset construction and distillation, supervised fine-tuning, reinforcement learning, and comprehensive evaluation, our model makes significant improvements in risk prediction performance on datasets in the national lung screening trial. Notably, RLM can decompose the risk evaluation task into sub-components, analyze the contributions of diverse risk factors, and synthesize them into a final risk score computed using our data-driven system equation. Our approach improves both predictive accuracy and monitorability through the chain of thought reasoning process, thereby facilitating clinical translation into lung cancer screening.
Related papers
- LungEvaty: A Scalable, Open-Source Transformer-based Deep Learning Model for Lung Cancer Risk Prediction in LDCT Screening [37.29507297342265]
LungEvaty is a transformer-based framework for predicting 1-6 year lung cancer risk from a single LDCT scan.<n>It learns directly from large-scale screening data to capture comprehensive anatomical and pathological cues relevant for malignancy risk.<n>LungEvaty was trained on more than 90,000 CT scans, including over 28,000 for fine-tuning and 6,000 for evaluation.
arXiv Detail & Related papers (2025-11-25T09:38:10Z) - Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models [70.64969663547703]
AdaCVD is an adaptable CVD risk prediction framework built on large language models extensively fine-tuned on over half a million participants from the UK Biobank.<n>It addresses key clinical challenges across three dimensions: it flexibly incorporates comprehensive yet variable patient information; it seamlessly integrates both structured data and unstructured text; and it rapidly adapts to new patient populations using minimal additional data.
arXiv Detail & Related papers (2025-05-30T14:42:02Z) - HACSurv: A Hierarchical Copula-Based Approach for Survival Analysis with Dependent Competing Risks [51.95824566163554]
We introduce HACSurv, a survival analysis method that learns Hierarchical Archimedean Copulas structures.<n>By capturing the dependencies between risks and censoring, HACSurv improves the accuracy of survival predictions.
arXiv Detail & Related papers (2024-10-19T18:52:18Z) - Improving Lung Cancer Diagnosis and Survival Prediction with Deep Learning and CT Imaging [12.276877277186284]
Lung cancer is a major cancer-related deaths, and early diagnosis and treatment are crucial for improving patients' survival outcomes.
We propose to employ neural convolutional networks of networks obtained between the risk of lung cancer and the lungs in CT experiments.
Results demonstrate the effectiveness of both the mini-batched loss and binary cross-entropy to predict both lung cancer and the risk of the occurrence.
arXiv Detail & Related papers (2024-08-18T05:45:08Z) - A new algorithm for Subgroup Set Discovery based on Information Gain [58.720142291102135]
Information Gained Subgroup Discovery (IGSD) is a new SD algorithm for pattern discovery.
We compare IGSD with two state-of-the-art SD algorithms: FSSD and SSD++.
IGSD provides better OR values than FSSD and SSD++, stating a higher dependence between patterns and targets.
arXiv Detail & Related papers (2023-07-26T21:42:34Z) - Penalized Deep Partially Linear Cox Models with Application to CT Scans
of Lung Cancer Patients [42.09584755334577]
Lung cancer is a leading cause of cancer mortality globally, highlighting the importance of understanding its mortality risks to design effective therapies.
The National Lung Screening Trial (NLST) employed computed tomography texture analysis to quantify the mortality risks of lung cancer patients.
We propose a novel Penalized Deep Partially Linear Cox Model (Penalized DPLC), which incorporates the SCAD penalty to select important texture features and employs a deep neural network to estimate the nonparametric component of the model.
arXiv Detail & Related papers (2023-03-09T15:38:16Z) - SurvLatent ODE : A Neural ODE based time-to-event model with competing
risks for longitudinal data improves cancer-associated Deep Vein Thrombosis
(DVT) prediction [68.8204255655161]
We propose a generative time-to-event model, SurvLatent ODE, which parameterizes a latent representation under irregularly sampled data.
Our model then utilizes the latent representation to flexibly estimate survival times for multiple competing events without specifying shapes of event-specific hazard function.
SurvLatent ODE outperforms the current clinical standard Khorana Risk scores for stratifying DVT risk groups.
arXiv Detail & Related papers (2022-04-20T17:28:08Z) - Enhancing Cancer Prediction in Challenging Screen-Detected Incident Lung
Nodules Using Time-Series Deep Learning [2.744770849264355]
Lung cancer screening (LCS) using annual low-dose computed tomography (CT) scanning has been proven to significantly reduce lung cancer mortality.
Improving risk stratification of malignancy risk in lung nodules can be enhanced using machine/deep learning algorithms.
Here we show the performance of our time-series deep learning model (DeepCAD-NLM-L) which integrates multi-model information across three longitudinal data domains.
arXiv Detail & Related papers (2022-03-30T18:40:36Z) - CoRSAI: A System for Robust Interpretation of CT Scans of COVID-19
Patients Using Deep Learning [133.87426554801252]
We adopted an approach based on using an ensemble of deep convolutionalneural networks for segmentation of lung CT scans.
Using our models we are able to segment the lesions, evaluatepatients dynamics, estimate relative volume of lungs affected by lesions and evaluate the lung damage stage.
arXiv Detail & Related papers (2021-05-25T12:06:55Z) - 3D Neural Network for Lung Cancer Risk Prediction on CT Volumes [0.6810862244331126]
Lung cancer is the most common cause of cancer death in the United States.
Lung cancer CT screening has been shown to reduce mortality by up to 40% and is now included in US screening guidelines.
Despite the use of standards for radiological diagnosis, persistent inter-grader variability and incomplete characterization of comprehensive imaging findings remain as limitations of current methods.
In this report, we reproduce a state-of-the-art deep learning algorithm for lung cancer risk prediction.
arXiv Detail & Related papers (2020-07-25T10:01:22Z) - Deep Learning for Automatic Pneumonia Detection [72.55423549641714]
Pneumonia is the leading cause of death among young children and one of the top mortality causes worldwide.
Computer-aided diagnosis systems showed the potential for improving diagnostic accuracy.
We develop the computational approach for pneumonia regions detection based on single-shot detectors, squeeze-and-excitation deep convolution neural networks, augmentations and multi-task learning.
arXiv Detail & Related papers (2020-05-28T10:54:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.