Related papers: Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent

Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent

URL: http://arxiv.org/abs/2512.20586v1
Date: Tue, 23 Dec 2025 18:32:17 GMT
Title: Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent
Authors: Humza Nusrat, Luke Francisco, Bing Luo, Hassan Bagher-Ebadian, Joshua Kim, Karen Chin-Snyder, Salim Siddiqui, Mira Shah, Eric Mellon, Mohammad Ghassemi, Anthony Doemer, Benjamin Movsas, Kundan Thind,
Abstract summary: We tested whether chain-of-thought reasoning improves agentic planning in a retrospective cohort of 41 patients with brain metastases treated with 18 Gy single-fraction radiosurgery.<n>The reasoning variant showed comparable plan dosimetry relative to human planners on primary endpoints.
Score: 3.1808466401480984
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Stereotactic radiosurgery (SRS) demands precise dose shaping around critical structures, yet black-box AI systems have limited clinical adoption due to opacity concerns. We tested whether chain-of-thought reasoning improves agentic planning in a retrospective cohort of 41 patients with brain metastases treated with 18 Gy single-fraction SRS. We developed SAGE (Secure Agent for Generative Dose Expertise), an LLM-based planning agent for automated SRS treatment planning. Two variants generated plans for each case: one using a non-reasoning model, one using a reasoning model. The reasoning variant showed comparable plan dosimetry relative to human planners on primary endpoints (PTV coverage, maximum dose, conformity index, gradient index; all p > 0.21) while reducing cochlear dose below human baselines (p = 0.022). When prompted to improve conformity, the reasoning model demonstrated systematic planning behaviors including prospective constraint verification (457 instances) and trade-off deliberation (609 instances), while the standard model exhibited none of these deliberative processes (0 and 7 instances, respectively). Content analysis revealed that constraint verification and causal explanation concentrated in the reasoning agent. The optimization traces serve as auditable logs, offering a path toward transparent automated planning.

Related papers

Personalized Medication Planning via Direct Domain Modeling and LLM-Generated Heuristics [69.68947055238557]
General domain description language (pddlp) used to generate personalized treatments.<n>General search used to scale up medication planning to levels allowing closer work with clinicians.<n>Results indicate dramatic improvements in coverage and planning time, scaling up the number of medications to at least 28, and bringing medication planning one step closer to practical applications.
arXiv Detail & Related papers (2026-01-07T08:19:29Z)
Zero-Shot Large Language Model Agents for Fully Automated Radiotherapy Treatment Planning [14.814676057920067]
A large language model (LLM)-based agent navigates inverse treatment planning for intensity-modulated radiation therapy (IMRT)<n>The agent's decision-making process is informed by current observations and previous optimization attempts and evaluations.<n>This study demonstrates the feasibility of a zero-shot, LLM-driven workflow for automated IMRT treatment planning in a commercial TPS.
arXiv Detail & Related papers (2025-10-12T19:21:21Z)
Organ-Agents: Virtual Human Physiology Simulator via LLMs [66.40796430669158]
Organ-Agents is a multi-agent framework that simulates human physiology via LLM-driven agents.<n>We curated data from 7,134 sepsis patients and 7,895 controls, generating high-resolution trajectories across 9 systems and 125 variables.<n>Organ-Agents achieved high simulation accuracy on 4,509 held-out patients, with per-system MSEs 0.16 and robustness across SOFA-based severity strata.
arXiv Detail & Related papers (2025-08-20T01:58:45Z)
New Insights into Automatic Treatment Planning for Cancer Radiotherapy Using Explainable Artificial Intelligence [1.8515971640245998]
This study aims to uncover the opaque decision-making process of an artificial intelligence (AI) agent for automatic treatment planning.<n>We examined a previously developed AI agent based on the Actor-Critic with Experience Replay (ACER) network, which automatically tunes treatment planning parameters.
arXiv Detail & Related papers (2025-08-19T19:38:16Z)
A learning-driven automatic planning framework for proton PBS treatments of H&N cancers [2.0765076553348316]
Inverse parameter is a learning-to-optimize (L2O) method that predicts update steps by learning from task-specific data distributions.<n>In experiments, total 97 patients with bilateral or ipsilateral H&N cancers are collected for training and testing.
arXiv Detail & Related papers (2025-08-14T21:50:31Z)
Parameterized Diffusion Optimization enabled Autoregressive Ordinal Regression for Diabetic Retinopathy Grading [53.11883409422728]
This work proposes a novel autoregressive ordinal regression method called AOR-DR.<n>We decompose the diabetic retinopathy grading task into a series of ordered steps by fusing the prediction of the previous steps with extracted image features.<n>We exploit the diffusion process to facilitate conditional probability modeling, enabling the direct use of continuous global image features for autoregression.
arXiv Detail & Related papers (2025-07-07T13:22:35Z)
Let LRMs Break Free from Overthinking via Self-Braking Tuning [68.93713497579853]
Large reasoning models (LRMs) have significantly enhanced their reasoning capabilities by generating longer chains of thought.<n>This performance gain comes at the cost of a substantial increase in redundant reasoning during the generation process.<n>We propose a novel framework, Self-Braking Tuning (SBT), which tackles overthinking from the perspective of allowing the model to regulate its own reasoning process.
arXiv Detail & Related papers (2025-05-20T16:53:40Z)
Automated Identification of Failure Cases in Organ at Risk Segmentation Using Distance Metrics: A Study on CT Data [0.19661503834671132]
Automated organ at risk (OAR) segmentation is crucial for radiation therapy planning in CT scans. The paper proposes a method to automatically identify failure cases by setting a threshold for the combination of Dice and Hausdorff distances.
arXiv Detail & Related papers (2023-08-21T11:14:49Z)
OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines [48.547200649819615]
We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models.
arXiv Detail & Related papers (2022-02-16T19:18:42Z)
Resource Planning for Hospitals Under Special Consideration of the COVID-19 Pandemic: Optimization and Sensitivity Analysis [87.31348761201716]
Crises like the COVID-19 pandemic pose a serious challenge to health-care institutions. BaBSim.Hospital is a tool for capacity planning based on discrete event simulation. We aim to investigate and optimize these parameters to improve BaBSim.Hospital.
arXiv Detail & Related papers (2021-05-16T12:38:35Z)
A feasibility study of a hyperparameter tuning approach to automated inverse planning in radiotherapy [68.8204255655161]
The purpose of this study is to automate the inverse planning process to reduce active planning time while maintaining plan quality. We investigated the impact of the choice of dose parameters, random and Bayesian search methods, and utility function form on planning time and plan quality. Using 100 samples was found to produce satisfactory plan quality, and the average planning time was 2.3 hours.
arXiv Detail & Related papers (2021-05-14T18:37:00Z)

This list is automatically generated from the titles and abstracts of the papers in this site.