CDrugRed: A Chinese Drug Recommendation Dataset for Discharge Medications in Metabolic Diseases
- URL: http://arxiv.org/abs/2510.21084v1
- Date: Fri, 24 Oct 2025 01:47:23 GMT
- Title: CDrugRed: A Chinese Drug Recommendation Dataset for Discharge Medications in Metabolic Diseases
- Authors: Juntao Li, Haobin Yuan, Ling Luo, Yan Jiang, Fan Wang, Ping Zhang, Huiyi Lv, Jian Wang, Yuanyuan Sun, Hongfei Lin,
- Abstract summary: We present CDrugRed, a first publicly available Chinese drug recommendation dataset focused on discharge medications for metabolic diseases.<n>The dataset includes 5,894 de-identified records from 3,190 patients, containing comprehensive information such as patient demographics, medical history, clinical course, and discharge diagnoses.<n>We assess the utility of CDrugRed by benchmarking several state-of-the-art large language models (LLMs) on the discharge medication recommendation task.
- Score: 49.09102662968899
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Intelligent drug recommendation based on Electronic Health Records (EHRs) is critical for improving for improving the quality and efficiency of clinical decision-making. By leveraging large-scale patient data, drug recommendation systems can assist physicians in selecting the most appropriate medications according to a patient's medical history, diagnoses, laboratory results, and comorbidities. However, the advancement of such systems is significantly hampered by the scarcity of publicly available, real-world EHR datasets, particularly in languages other than English. In this work, we present CDrugRed, a first publicly available Chinese drug recommendation dataset focused on discharge medications for metabolic diseases. The dataset includes 5,894 de-identified records from 3,190 patients, containing comprehensive information such as patient demographics, medical history, clinical course, and discharge diagnoses. We assess the utility of CDrugRed by benchmarking several state-of-the-art large language models (LLMs) on the discharge medication recommendation task. Experimental results show that while supervised fine-tuning improves model performance, there remains substantial room for improvement, with the best model achieving the F1 score of 0.5648 and Jaccard score of 0.4477. This result highlights the complexity of the clinical drug recommendation task and establishes CDrugRed as a challenging and valuable resource for developing more robust and accurate drug recommendation systems. The dataset is publicly available to the research community under the data usage agreements at https://github.com/DUTIR-BioNLP/CDrugRed.
Related papers
- Overview of CHIP 2025 Shared Task 2: Discharge Medication Recommendation for Metabolic Diseases Based on Chinese Electronic Health Records [47.67215289515775]
Discharge medication recommendation plays a critical role in ensuring treatment continuity, preventing readmission, and improving long-term management.<n>This paper presents an overview of the CHIP 2025 Shared Task 2 competition, which aimed to develop state-of-the-art approaches for automatically recommending appro-priate discharge medications.<n>A total of 526 teams registered, with 167 and 95 teams submitting valid results to the Phase A and Phase B leaderboards, respectively.<n>The top-performing team achieved the highest overall performance on the final test set, with a Jaccard score of 0.5102, F1 score of 0.6267, demonstrating
arXiv Detail & Related papers (2025-11-09T05:11:27Z) - Retrieval Augmented Large Language Model System for Comprehensive Drug Contraindications [0.0]
The versatility of large language models (LLMs) has been explored across various sectors, but their application in healthcare poses challenges.<n>This study enhances the capability of LLMs to address contraindications effectively by implementing a Retrieval Augmented Generation (RAG) pipeline.
arXiv Detail & Related papers (2025-08-08T09:09:03Z) - Leave No Patient Behind: Enhancing Medication Recommendation for Rare Disease Patients [47.68396964741116]
We propose a novel model called Robust and Accurate REcommendations for Medication (RAREMed) to enhance accuracy for rare diseases.
It employs a transformer encoder with a unified input sequence approach to capture complex relationships among disease and procedure codes.
It provides accurate drug sets for both rare and common disease patients, thereby mitigating unfairness in medication recommendation systems.
arXiv Detail & Related papers (2024-03-26T14:36:22Z) - CIDGMed: Causal Inference-Driven Medication Recommendation with Enhanced Dual-Granularity Learning [10.60553153370577]
Medication recommendation aims to integrate patients' long-term health records to provide accurate and safe medication combinations.
Existing methods often fail to deeply explore the true causal relationships between diseases/procedures and medications.
We propose the Causal Inference-driven Dual-Granularity Medication Recommendation method (CIDGMed)
arXiv Detail & Related papers (2024-03-01T08:50:27Z) - Large Language Models for Healthcare Data Augmentation: An Example on
Patient-Trial Matching [49.78442796596806]
We propose an innovative privacy-aware data augmentation approach for patient-trial matching (LLM-PTM)
Our experiments demonstrate a 7.32% average improvement in performance using the proposed LLM-PTM method, and the generalizability to new data is improved by 12.12%.
arXiv Detail & Related papers (2023-03-24T03:14:00Z) - Prediction of drug effectiveness in rheumatoid arthritis patients based
on machine learning algorithms [2.5759046095742453]
Rheumatoid arthritis (RA) is an autoimmune condition caused when patients' immune system mistakenly targets their own tissue.
Machine learning (ML) has the potential to identify patterns in patient electronic health records to forecast the best clinical treatment to improve patient outcomes.
This study introduced a Drug Response Prediction (TNF) framework with two main goals: 1) design a data processing pipeline to extract information from clinical data, and then preprocess it for functional use, and 2) predict RA patient's responses to drugs and evaluate classification models' performance.
arXiv Detail & Related papers (2022-10-14T15:15:37Z) - Knowledge-Driven New Drug Recommendation [88.35607943144261]
We develop a drug-dependent multi-phenotype few-shot learner to bridge the gap between existing and new drugs.
EDGE eliminates the false-negative supervision signal using an external drug-disease knowledge base.
Results show that EDGE achieves 7.3% improvement on the ROC-AUC score over the best baseline.
arXiv Detail & Related papers (2022-10-11T16:07:52Z) - Conditional Generation Net for Medication Recommendation [73.09366442098339]
Medication recommendation targets to provide a proper set of medicines according to patients' diagnoses, which is a critical task in clinics.
We propose Conditional Generation Net (COGNet) which introduces a novel copy-or-predict mechanism to generate the set of medicines.
We validate the proposed model on the public MIMIC data set, and the experimental results show that the proposed model can outperform state-of-the-art approaches.
arXiv Detail & Related papers (2022-02-14T10:16:41Z) - MeSIN: Multilevel Selective and Interactive Network for Medication
Recommendation [9.173903754083927]
We propose a multilevel selective and interactive network (MeSIN) for medication recommendation.
First, an attentional selective module (ASM) is applied to assign flexible attention scores to different medical codes embeddings.
Second, we incorporate a novel interactive long-short term memory network (InLSTM) to reinforce the interactions of multilevel medical sequences in EHR data.
arXiv Detail & Related papers (2021-04-22T12:59:50Z) - PREMIER: Personalized REcommendation for Medical prescrIptions from
Electronic Records [8.365167718547296]
We design a two-stage attention-based personalized medication recommender system called PREMIER.
Our system takes into account the interactions among drugs in order to minimize the adverse effects for the patient.
Experiment results on MIMIC-III and a proprietary outpatient dataset show that PREMIER outperforms state-of-the-art medication recommendation systems.
arXiv Detail & Related papers (2020-08-28T04:48:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.