PHEE: A Dataset for Pharmacovigilance Event Extraction from Text
- URL: http://arxiv.org/abs/2210.12560v1
- Date: Sat, 22 Oct 2022 21:57:42 GMT
- Title: PHEE: A Dataset for Pharmacovigilance Event Extraction from Text
- Authors: Zhaoyue Sun, Jiazheng Li, Gabriele Pergola, Byron C. Wallace, Bino
John, Nigel Greene, Joseph Kim, Yulan He
- Abstract summary: PHEE is a novel dataset for pharmacovigilance comprising over 5000 annotated events from medical case reports and biomedical literature.
We describe the hierarchical event schema designed to provide coarse and fine-grained information about patients' demographics, treatments and (side) effects.
- Score: 42.365919892504415
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The primary goal of drug safety researchers and regulators is to promptly
identify adverse drug reactions. Doing so may in turn prevent or reduce the
harm to patients and ultimately improve public health. Evaluating and
monitoring drug safety (i.e., pharmacovigilance) involves analyzing an ever
growing collection of spontaneous reports from health professionals,
physicians, and pharmacists, and information voluntarily submitted by patients.
In this scenario, facilitating analysis of such reports via automation has the
potential to rapidly identify safety signals. Unfortunately, public resources
for developing natural language models for this task are scant. We present
PHEE, a novel dataset for pharmacovigilance comprising over 5000 annotated
events from medical case reports and biomedical literature, making it the
largest such public dataset to date. We describe the hierarchical event schema
designed to provide coarse and fine-grained information about patients'
demographics, treatments and (side) effects. Along with the discussion of the
dataset, we present a thorough experimental evaluation of current
state-of-the-art approaches for biomedical event extraction, point out their
limitations, and highlight open challenges to foster future research in this
area.
Related papers
- Explainable Biomedical Hypothesis Generation via Retrieval Augmented Generation enabled Large Language Models [46.05020842978823]
Large Language Models (LLMs) have emerged as powerful tools to navigate this complex data landscape.
RAGGED is a comprehensive workflow designed to support investigators with knowledge integration and hypothesis generation.
arXiv Detail & Related papers (2024-07-17T07:44:18Z) - Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development [12.258245804049114]
The mining of adverse drug events (ADEs) is pivotal in pharmacovigilance, enhancing patient safety.
Traditional ADE detection methods are reliable but slow, not easily adaptable to large-scale operations.
Previous ADE mining studies have focused on text-based methodologies, overlooking visual cues.
We present a MultiModal Adverse Drug Event (MMADE) detection dataset, merging ADE-related textual information with visual aids.
arXiv Detail & Related papers (2024-05-24T17:58:42Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for
Real-World Pharmacovigilance [26.88690084623426]
We introduce BioDEX, a large-scale resource for Biomedical adverse Drug Event Extraction.
In this work, we consider the task of predicting the core information of the report given its originating paper.
We estimate human performance to be 72.0% F1, whereas our best model achieves 62.3% F1.
arXiv Detail & Related papers (2023-05-22T18:15:57Z) - SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity
Prediction [127.43571146741984]
Drug-Target Affinity (DTA) is of vital importance in early-stage drug discovery.
wet experiments remain the most reliable method, but they are time-consuming and resource-intensive.
Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue.
We present the SSM-DTA framework, which incorporates three simple yet highly effective strategies.
arXiv Detail & Related papers (2022-06-20T14:53:25Z) - Label-dependent and event-guided interpretable disease risk prediction
using EHRs [8.854691034104071]
This paper proposes a label-dependent and event-guided risk prediction model (LERP) to predict the presence of multiple disease risks.
We adopt a label-dependent mechanism that gives greater attention to words from medical notes that are semantically similar to the names of risk labels.
As the clinical events can also indicate the health status of patients, our model utilizes the information from events to generate an event-guided representation of medical notes.
arXiv Detail & Related papers (2022-01-18T07:24:28Z) - Predicting Patient Readmission Risk from Medical Text via Knowledge
Graph Enhanced Multiview Graph Convolution [67.72545656557858]
We propose a new method that uses medical text of Electronic Health Records for prediction.
We represent discharge summaries of patients with multiview graphs enhanced by an external knowledge graph.
Experimental results prove the effectiveness of our method, yielding state-of-the-art performance.
arXiv Detail & Related papers (2021-12-19T01:45:57Z) - Selection of pseudo-annotated data for adverse drug reaction
classification across drug groups [12.259552039796027]
We assess the robustness of state-of-the-art neural architectures across different drug groups.
We investigate several strategies to use pseudo-labeled data in addition to a manually annotated train set.
arXiv Detail & Related papers (2021-11-24T13:11:05Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.