SAFER: A Calibrated Risk-Aware Multimodal Recommendation Model for Dynamic Treatment Regimes
- URL: http://arxiv.org/abs/2506.06649v1
- Date: Sat, 07 Jun 2025 04:05:43 GMT
- Title: SAFER: A Calibrated Risk-Aware Multimodal Recommendation Model for Dynamic Treatment Regimes
- Authors: Yishan Shen, Yuyang Ye, Hui Xiong, Yong Chen,
- Abstract summary: We introduce SAFER, a risk-aware calibrated recommendation framework for dynamic treatment regimes (DTRs)<n> SAFER integrates structured EHR and clinical notes, enabling them to learn from each other and address label uncertainty.<n>Experiments demonstrate that SAFER outperforms state-of-the-art baselines across multiple recommendation metrics and counterfactual mortality rate.
- Score: 18.881967326672456
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Dynamic treatment regimes (DTRs) are critical to precision medicine, optimizing long-term outcomes through personalized, real-time decision-making in evolving clinical contexts, but require careful supervision for unsafe treatment risks. Existing efforts rely primarily on clinician-prescribed gold standards despite the absence of a known optimal strategy, and predominantly using structured EHR data without extracting valuable insights from clinical notes, limiting their reliability for treatment recommendations. In this work, we introduce SAFER, a calibrated risk-aware tabular-language recommendation framework for DTR that integrates both structured EHR and clinical notes, enabling them to learn from each other, and addresses inherent label uncertainty by assuming ambiguous optimal treatment solution for deceased patients. Moreover, SAFER employs conformal prediction to provide statistical guarantees, ensuring safe treatment recommendations while filtering out uncertain predictions. Experiments on two publicly available sepsis datasets demonstrate that SAFER outperforms state-of-the-art baselines across multiple recommendation metrics and counterfactual mortality rate, while offering robust formal assurances. These findings underscore SAFER potential as a trustworthy and theoretically grounded solution for high-stakes DTR applications.
Related papers
- Lessons Learned from Evaluation of LLM based Multi-agents in Safer Therapy Recommendation [9.84660526673816]
This study investigated the feasibility and value of using a Large Language Model (LLM)-based multi-agent system for safer therapy recommendations.<n>We designed a single agent and a MAS framework simulating multidisciplinary team (MDT) decision-making.<n>We compared MAS performance with single-agent approaches and real-world benchmarks.
arXiv Detail & Related papers (2025-07-15T02:01:38Z) - Uncertainty-Driven Expert Control: Enhancing the Reliability of Medical Vision-Language Models [52.2001050216955]
Existing methods aim to enhance the performance of Medical Vision Language Model (MedVLM) by adjusting model structure, fine-tuning with high-quality data, or through preference fine-tuning.<n>We propose an expert-in-the-loop framework named Expert-Controlled-Free Guidance (Expert-CFG) to align MedVLM with clinical expertise without additional training.
arXiv Detail & Related papers (2025-07-12T09:03:30Z) - Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies [9.123989363028945]
Out-of-distribution (OOD) issues pose significant risks when applying offline reinforcement learning (RL) in healthcare scenarios.<n>We propose textitOffline Guarded Safe Reinforcement Learning ($mathsfOGSRL$), a theoretically grounded model-based offline RL framework.
arXiv Detail & Related papers (2025-05-22T05:22:03Z) - SConU: Selective Conformal Uncertainty in Large Language Models [59.25881667640868]
We propose a novel approach termed Selective Conformal Uncertainty (SConU)<n>We develop two conformal p-values that are instrumental in determining whether a given sample deviates from the uncertainty distribution of the calibration set at a specific manageable risk level.<n>Our approach not only facilitates rigorous management of miscoverage rates across both single-domain and interdisciplinary contexts, but also enhances the efficiency of predictions.
arXiv Detail & Related papers (2025-04-19T03:01:45Z) - Training-Aware Risk Control for Intensity Modulated Radiation Therapies Quality Assurance with Conformal Prediction [7.227232362460348]
Measurement quality assurance practices play a key role in the safe use of Intensity Modulated Radiation Therapies (IMRT) for cancer treatment.<n>These practices have reduced measurement-based IMRT QA failure below 1%.<n>We propose a new training-aware conformal risk control method by combining the benefit of conformal risk control and conformal training.
arXiv Detail & Related papers (2025-01-15T17:19:51Z) - Uncertainty-Penalized Direct Preference Optimization [52.387088396044206]
We develop a pessimistic framework for DPO by introducing preference uncertainty penalization schemes.
The penalization serves as a correction to the loss which attenuates the loss gradient for uncertain samples.
We show improved overall performance compared to vanilla DPO, as well as better completions on prompts from high-uncertainty chosen/rejected responses.
arXiv Detail & Related papers (2024-10-26T14:24:37Z) - Towards Integrating Epistemic Uncertainty Estimation into the Radiotherapy Workflow [40.07325268305058]
The precision of contouring target structures and organs-at-risk (OAR) in radiotherapy planning is crucial for ensuring treatment efficacy and patient safety.
Recent advancements in deep learning (DL) have significantly improved OAR contouring performance.
However, the reliability of these models, especially in the presence of out-of-distribution (OOD) scenarios, remains a concern in clinical settings.
arXiv Detail & Related papers (2024-09-27T10:55:58Z) - Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment [5.0005174003014865]
We propose the medical decision transformer (MeDT) to solve tasks in safety-critical settings.
MeDT uses the decision transformer architecture to learn a policy for drug dosage recommendation.
MeDT captures complex dependencies among a patient's medical history, treatment decisions, outcomes, and short-term effects on stability.
arXiv Detail & Related papers (2024-07-28T03:40:00Z) - Safe Deployment for Counterfactual Learning to Rank with Exposure-Based
Risk Minimization [63.93275508300137]
We introduce a novel risk-aware Counterfactual Learning To Rank method with theoretical guarantees for safe deployment.
Our experimental results demonstrate the efficacy of our proposed method, which is effective at avoiding initial periods of bad performance when little data is available.
arXiv Detail & Related papers (2023-04-26T15:54:23Z) - Towards Reliable Medical Image Segmentation by utilizing Evidential Calibrated Uncertainty [52.03490691733464]
We introduce DEviS, an easily implementable foundational model that seamlessly integrates into various medical image segmentation networks.
By leveraging subjective logic theory, we explicitly model probability and uncertainty for the problem of medical image segmentation.
DeviS incorporates an uncertainty-aware filtering module, which utilizes the metric of uncertainty-calibrated error to filter reliable data.
arXiv Detail & Related papers (2023-01-01T05:02:46Z) - SDF-Bayes: Cautious Optimism in Safe Dose-Finding Clinical Trials with
Drug Combinations and Heterogeneous Patient Groups [84.63561578944183]
This paper proposes a novel Bayesian design, SDF-Bayes, for finding the maximum tolerated dose (MTD) of a drug in a clinical trial.
Rather than the conventional principle of escalating or de-escalating the current dose of one drug, SDF-Bayes proceeds by cautious optimism.
Extensive experiments based on both synthetic and real-world datasets demonstrate the advantages of SDF-Bayes over state of the art DC trial designs.
arXiv Detail & Related papers (2021-01-26T18:59:26Z) - Multicategory Angle-based Learning for Estimating Optimal Dynamic
Treatment Regimes with Censored Data [12.499787110182632]
An optimal treatment regime (DTR) consists of a sequence of decision rules in maximizing long-term benefits.
In this paper, we develop a novel angle-based approach to target the optimal DTR under a multicategory treatment framework.
Our numerical studies show that the proposed method outperforms competing methods in terms of maximizing the conditional survival function.
arXiv Detail & Related papers (2020-01-14T05:19:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.