GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance
- URL: http://arxiv.org/abs/2505.04284v1
- Date: Wed, 07 May 2025 09:40:18 GMT
- Title: GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance
- Authors: Sofia Jamil, Aryan Dabad, Bollampalli Areen Reddy, Sriparna Saha, Rajiv Misra, Adil A. Shakur,
- Abstract summary: This work introduces the task of grouped summarization of adverse drug events reported by multiple patients using the same drug for cancer treatment.<n>To address the challenge of limited resources in cancer pharmacovigilance, we present the MultiLabeled Cancer Adverse Drug Reaction and Summarization dataset.<n>We propose the Grouping and Abstractive Summarization of Cancer Adverse Drug events (GASCADE) framework, a novel pipeline that combines the information extraction capabilities of Large Language Models with the summarization power of the encoder-decoder T5 model.
- Score: 11.33767380854598
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In the realm of cancer treatment, summarizing adverse drug events (ADEs) reported by patients using prescribed drugs is crucial for enhancing pharmacovigilance practices and improving drug-related decision-making. While the volume and complexity of pharmacovigilance data have increased, existing research in this field has predominantly focused on general diseases rather than specifically addressing cancer. This work introduces the task of grouped summarization of adverse drug events reported by multiple patients using the same drug for cancer treatment. To address the challenge of limited resources in cancer pharmacovigilance, we present the MultiLabeled Cancer Adverse Drug Reaction and Summarization (MCADRS) dataset. This dataset includes pharmacovigilance posts detailing patient concerns regarding drug efficacy and adverse effects, along with extracted labels for drug names, adverse drug events, severity, and adversity of reactions, as well as summaries of ADEs for each drug. Additionally, we propose the Grouping and Abstractive Summarization of Cancer Adverse Drug events (GASCADE) framework, a novel pipeline that combines the information extraction capabilities of Large Language Models (LLMs) with the summarization power of the encoder-decoder T5 model. Our work is the first to apply alignment techniques, including advanced algorithms like Direct Preference Optimization, to encoder-decoder models using synthetic datasets for summarization tasks. Through extensive experiments, we demonstrate the superior performance of GASCADE across various metrics, validated through both automated assessments and human evaluations. This multitasking approach enhances drug-related decision-making and fosters a deeper understanding of patient concerns, paving the way for advancements in personalized and responsive cancer care. The code and dataset used in this work are publicly available.
Related papers
- Automated Detection of Clinical Entities in Lung and Breast Cancer Reports Using NLP Techniques [0.0]
We focus on lung and breast cancer due to their high incidence and the significant impact they have on public health.<n>To enhance the accuracy and efficiency of data extraction, we utilize GMV's NLP tool uQuery.
arXiv Detail & Related papers (2025-05-14T20:44:29Z) - Emerging Drug Interaction Prediction Enabled by Flow-based Graph Neural
Network with Biomedical Network [69.16939798838159]
We propose EmerGNN, a graph neural network (GNN) that can effectively predict interactions for emerging drugs.
EmerGNN learns pairwise representations of drugs by extracting the paths between drug pairs, propagating information from one drug to the other, and incorporating the relevant biomedical concepts on the paths.
Overall, EmerGNN has higher accuracy than existing approaches in predicting interactions for emerging drugs and can identify the most relevant information on the biomedical network.
arXiv Detail & Related papers (2023-11-15T06:34:00Z) - Enhancing drug and cell line representations via contrastive learning
for improved anti-cancer drug prioritization [0.7070726553564699]
We propose the use of contrastive learning to improve learned drug and cell line representations.
We find that our learned representations exhibit a more balances reliance on drug- and cell line-derived features when making predictions.
arXiv Detail & Related papers (2023-10-20T04:18:47Z) - ADRNet: A Generalized Collaborative Filtering Framework Combining
Clinical and Non-Clinical Data for Adverse Drug Reaction Prediction [49.56476929112382]
Adverse drug reaction (ADR) prediction plays a crucial role in both health care and drug discovery.
We propose ADRNet, a generalized collaborative filtering framework combining clinical and non-clinical data for drug-ADR prediction.
arXiv Detail & Related papers (2023-08-03T11:28:12Z) - A clustering and graph deep learning-based framework for COVID-19 drug
repurposing [0.3359875577705538]
This study presents a novel unsupervised machine learning framework that utilizes a graph-based autoencoder for multi-feature type clustering on heterogeneous drug data.
The dataset consists of 438 drugs, of which 224 are under clinical trials for COVID-19.
Our framework relies on reported drug data, including its pharmacological properties, chemical/physical properties, interaction with the host, and efficacy in different publicly available COVID-19 assays.
arXiv Detail & Related papers (2023-06-24T15:00:47Z) - SynerGPT: In-Context Learning for Personalized Drug Synergy Prediction
and Drug Design [64.69434941796904]
We propose a novel setting and models for in-context drug synergy learning.
We are given a small "personalized dataset" of 10-20 drug synergy relationships in the context of specific cancer cell targets.
Our goal is to predict additional drug synergy relationships in that context.
arXiv Detail & Related papers (2023-06-19T17:03:46Z) - Drug Synergistic Combinations Predictions via Large-Scale Pre-Training
and Graph Structure Learning [82.93806087715507]
Drug combination therapy is a well-established strategy for disease treatment with better effectiveness and less safety degradation.
Deep learning models have emerged as an efficient way to discover synergistic combinations.
Our framework achieves state-of-the-art results in comparison with other deep learning-based methods.
arXiv Detail & Related papers (2023-01-14T15:07:43Z) - PHEE: A Dataset for Pharmacovigilance Event Extraction from Text [42.365919892504415]
PHEE is a novel dataset for pharmacovigilance comprising over 5000 annotated events from medical case reports and biomedical literature.
We describe the hierarchical event schema designed to provide coarse and fine-grained information about patients' demographics, treatments and (side) effects.
arXiv Detail & Related papers (2022-10-22T21:57:42Z) - Knowledge-Driven New Drug Recommendation [88.35607943144261]
We develop a drug-dependent multi-phenotype few-shot learner to bridge the gap between existing and new drugs.
EDGE eliminates the false-negative supervision signal using an external drug-disease knowledge base.
Results show that EDGE achieves 7.3% improvement on the ROC-AUC score over the best baseline.
arXiv Detail & Related papers (2022-10-11T16:07:52Z) - SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity
Prediction [127.43571146741984]
Drug-Target Affinity (DTA) is of vital importance in early-stage drug discovery.
wet experiments remain the most reliable method, but they are time-consuming and resource-intensive.
Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue.
We present the SSM-DTA framework, which incorporates three simple yet highly effective strategies.
arXiv Detail & Related papers (2022-06-20T14:53:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.