CafeMed: Causal Attention Fusion Enhanced Medication Recommendation
- URL: http://arxiv.org/abs/2511.14064v1
- Date: Tue, 18 Nov 2025 02:44:27 GMT
- Title: CafeMed: Causal Attention Fusion Enhanced Medication Recommendation
- Authors: Kelin Ren, Chan-Yang Ju, Dong-Ho Lee,
- Abstract summary: CafeMed is a framework that integrates dynamic causal reasoning with cross-modal attention for safe and accurate medication recommendation.<n>Our results indicate that incorporating dynamic causal relationships and cross-modal synergies leads to more clinically-aligned and personalized medication recommendations.
- Score: 9.91438130100011
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Medication recommendation systems play a crucial role in assisting clinicians with personalized treatment decisions. While existing approaches have made significant progress in learning medication representations, they suffer from two fundamental limitations: (i) treating medical entities as independent features without modeling their synergistic effects on medication selection; (ii) employing static causal relationships that fail to adapt to patient-specific contexts and health states. To address these challenges, we propose CafeMed, a framework that integrates dynamic causal reasoning with cross-modal attention for safe and accurate medication recommendation. CafeMed introduces two key components: the Causal Weight Generator (CWG) that transforms static causal effects into dynamic modulation weights based on individual patient states, and the Channel Harmonized Attention Refinement Module (CHARM) that captures complex interdependencies between diagnoses and procedures. This design enables CafeMed to model how different medical conditions jointly influence treatment decisions while maintaining medication safety constraints. Extensive experiments on MIMIC-III and MIMIC-IV datasets demonstrate that CafeMed significantly outperforms state-of-the-art baselines, achieving superior accuracy in medication prediction while maintaining the lower drug--drug interaction rates. Our results indicate that incorporating dynamic causal relationships and cross-modal synergies leads to more clinically-aligned and personalized medication recommendations. Our code is released publicly at https://github.com/rkl71/CafeMed.
Related papers
- Towards Reliable Medical LLMs: Benchmarking and Enhancing Confidence Estimation of Large Language Models in Medical Consultation [97.36081721024728]
We propose the first benchmark for assessing confidence in multi-turn interaction during realistic medical consultations.<n>Our benchmark unifies three types of medical data for open-ended diagnostic generation.<n>We present MedConf, an evidence-grounded linguistic self-assessment framework.
arXiv Detail & Related papers (2026-01-22T04:51:39Z) - MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning [52.064286116035134]
We develop MedAlign, a framework to ensure visually accurate LVLM responses for Medical Visual Question Answering (Med-VQA)<n>We first propose a multimodal Direct Preference Optimization (mDPO) objective to align preference learning with visual context.<n>We then design a Retrieval-Aware Mixture-of-Experts (RA-MoE) architecture that utilizes image and text similarity to route queries to a specialized and context-augmented LVLM.
arXiv Detail & Related papers (2025-10-24T02:11:05Z) - ARMR: Adaptively Responsive Network for Medication Recommendation [1.354560757162539]
We propose an Adaptively Responsive network for Medication Recommendation (ARMR)<n>ARMR incorporates 1) a piecewise temporal learning component that distinguishes between recent and distant patient history, enabling more nuanced temporal understanding, and 2) an adaptively responsive mechanism that adjusts attention to new and existing drugs based on the patient's current health state and medication history.<n>Experiments on the MIMIC-III and MIMIC-IV datasets indicate that ARMR has better performance compared with the state-of-the-art baselines in different evaluation metrics.
arXiv Detail & Related papers (2025-07-06T15:24:00Z) - Combating the Bucket Effect:Multi-Knowledge Alignment for Medication Recommendation [7.962835256943926]
We introduce a cross-modal medication encoder capable of seamlessly aligning data from different modalities.<n>We then combine the multi-knowledge medication representations with patient records for recommendations.<n>Experiments show that MKMed mitigates the "bucket effect" in data, and significantly outperforms state-of-the-art baselines in recommendation accuracy and safety.
arXiv Detail & Related papers (2025-04-25T05:47:15Z) - Natural Language-Assisted Multi-modal Medication Recommendation [97.07805345563348]
We introduce the Natural Language-Assisted Multi-modal Medication Recommendation(NLA-MMR)<n>The NLA-MMR is a multi-modal alignment framework designed to learn knowledge from the patient view and medication view jointly.<n>In this vein, we employ pretrained language models(PLMs) to extract in-domain knowledge regarding patients and medications.
arXiv Detail & Related papers (2025-01-13T09:51:50Z) - CausalMed: Causality-Based Personalized Medication Recommendation Centered on Patient health state [11.137353555292277]
CausalMed is a patient health state-centric model capable of enhancing the personalization of patient representations.
Our method learns more personalized patient representation and outperforms state-of-the-art models in accuracy and safety.
arXiv Detail & Related papers (2024-04-18T14:44:08Z) - Leave No Patient Behind: Enhancing Medication Recommendation for Rare Disease Patients [47.68396964741116]
We propose a novel model called Robust and Accurate REcommendations for Medication (RAREMed) to enhance accuracy for rare diseases.
It employs a transformer encoder with a unified input sequence approach to capture complex relationships among disease and procedure codes.
It provides accurate drug sets for both rare and common disease patients, thereby mitigating unfairness in medication recommendation systems.
arXiv Detail & Related papers (2024-03-26T14:36:22Z) - StratMed: Relevance Stratification between Biomedical Entities for
Sparsity on Medication Recommendation [9.296433860766165]
StratMed is a stratification strategy that overcomes the long-tailed problem and achieves fuller learning of sparse data.
It also utilizes a dual-property network to address the issue of mutual constraints on the safety and accuracy of medication combinations.
Our model reduces safety risk by 15.08%, improves accuracy by 0.36%, and reduces training time consumption by 81.66%.
arXiv Detail & Related papers (2023-08-31T14:59:32Z) - Improving Multiple Sclerosis Lesion Segmentation Across Clinical Sites:
A Federated Learning Approach with Noise-Resilient Training [75.40980802817349]
Deep learning models have shown promise for automatically segmenting MS lesions, but the scarcity of accurately annotated data hinders progress in this area.
We introduce a Decoupled Hard Label Correction (DHLC) strategy that considers the imbalanced distribution and fuzzy boundaries of MS lesions.
We also introduce a Centrally Enhanced Label Correction (CELC) strategy, which leverages the aggregated central model as a correction teacher for all sites.
arXiv Detail & Related papers (2023-08-31T00:36:10Z) - Decision Support System for Chronic Diseases Based on Drug-Drug
Interactions [67.9840225237587]
This paper presents a Decision Support System, called DSSDDI, based on drug-drug interactions to support doctors prescribing decisions.
The DDI module learns safer and more effective drug representations from the drug-drug interactions.
The MD module considers the representations of patients and drugs as context, DDI and patients' similarity as treatment, and medication use as outcome.
arXiv Detail & Related papers (2023-03-04T12:44:38Z) - MeSIN: Multilevel Selective and Interactive Network for Medication
Recommendation [9.173903754083927]
We propose a multilevel selective and interactive network (MeSIN) for medication recommendation.
First, an attentional selective module (ASM) is applied to assign flexible attention scores to different medical codes embeddings.
Second, we incorporate a novel interactive long-short term memory network (InLSTM) to reinforce the interactions of multilevel medical sequences in EHR data.
arXiv Detail & Related papers (2021-04-22T12:59:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.