Want to Identify, Extract and Normalize Adverse Drug Reactions in
Tweets? Use RoBERTa
- URL: http://arxiv.org/abs/2006.16146v1
- Date: Mon, 29 Jun 2020 16:10:27 GMT
- Title: Want to Identify, Extract and Normalize Adverse Drug Reactions in
Tweets? Use RoBERTa
- Authors: Katikapalli Subramanyam Kalyan, S.Sangeetha
- Abstract summary: This paper presents our approach for task 2 and task 3 of Social Media Mining for Health (SMM4H) 2020 shared tasks.
In task 2, we have to differentiate adverse drug reaction (ADR) tweets from non ADR tweets and is treated as binary classification.
In task 3, we extract ADR mentions and then mapping them to MedDRA codes.
Our models achieve promising results in both the tasks with significant improvements over average scores.
- Score: 5.33024001730262
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents our approach for task 2 and task 3 of Social Media Mining
for Health (SMM4H) 2020 shared tasks. In task 2, we have to differentiate
adverse drug reaction (ADR) tweets from nonADR tweets and is treated as binary
classification. Task3 involves extracting ADR mentions and then mapping them to
MedDRA codes. Extracting ADR mentions is treated as sequence labeling and
normalizing ADR mentions is treated as multi-class classification. Our system
is based on pre-trained language model RoBERTa and it achieves a) F1-score of
58% in task2 which is 12% more than the average score b) relaxed F1-score of
70.1% in ADR extraction of task 3 which is 13.7% more than the average score
and relaxed F1-score of 35% in ADR extraction + normalization of task3 which is
5.8% more than the average score. Overall, our models achieve promising results
in both the tasks with significant improvements over average scores.
Related papers
- PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation [68.17081518640934]
We propose a PrIrmitive-driVen waypOinT-aware world model for Robotic manipulation (PIVOT-R)
PIVOT-R consists of a Waypoint-aware World Model (WAWM) and a lightweight action prediction module.
Our PIVOT-R outperforms state-of-the-art open-source models on the SeaWave benchmark, achieving an average relative improvement of 19.45% across four levels of instruction tasks.
arXiv Detail & Related papers (2024-10-14T11:30:18Z) - ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents [49.00494558898933]
This paper describes our participation in Task 3 and Task 5 of the #SMM4H (Social Media Mining for Health) 2024 Workshop.
Task 3 is a multi-class classification task centered on tweets discussing the impact of outdoor environments on symptoms of social anxiety.
Task 5 involves a binary classification task focusing on tweets reporting medical disorders in children.
We applied transfer learning from pre-trained encoder-decoder models such as BART-base and T5-small to identify the labels of a set of given tweets.
arXiv Detail & Related papers (2024-04-30T17:06:20Z) - Cross-lingual Contextualized Phrase Retrieval [63.80154430930898]
We propose a new task formulation of dense retrieval, cross-lingual contextualized phrase retrieval.
We train our Cross-lingual Contextualized Phrase Retriever (CCPR) using contrastive learning.
On the phrase retrieval task, CCPR surpasses baselines by a significant margin, achieving a top-1 accuracy that is at least 13 points higher.
arXiv Detail & Related papers (2024-03-25T14:46:51Z) - Text Augmentations with R-drop for Classification of Tweets Self
Reporting Covid-19 [28.91836510067532]
This paper presents models created for the Social Media Mining for Health 2023 shared task.
Our approach involves a classification model that incorporates diverse textual augmentations.
Our system achieves an impressive F1 score of 0.877 on the test set.
arXiv Detail & Related papers (2023-11-06T14:18:16Z) - Tool-Augmented Reward Modeling [58.381678612409]
We propose a tool-augmented preference modeling approach, named Themis, to address limitations by empowering RMs with access to external environments.
Our study delves into the integration of external tools into RMs, enabling them to interact with diverse external sources.
In human evaluations, RLHF trained with Themis attains an average win rate of 32% when compared to baselines.
arXiv Detail & Related papers (2023-10-02T09:47:40Z) - Incorporating Emotions into Health Mention Classification Task on Social
Media [70.23889100356091]
We present a framework for health mention classification that incorporates affective features.
We evaluate our approach on 5 HMC-related datasets from different social media platforms.
Our results indicate that HMC models infused with emotional knowledge are an effective alternative.
arXiv Detail & Related papers (2022-12-09T18:38:41Z) - A PubMedBERT-based Classifier with Data Augmentation Strategy for
Detecting Medication Mentions in Tweets [2.539568419434224]
Twitter publishes a large number of user-generated text (tweets) on a daily basis.
entity recognition (NER) presents some special challenges for tweet data.
In this paper, we explore a PubMedBERT-based classifier trained with a combination of multiple data augmentation approaches.
Our method achieved an F1 score of 0.762, which is substantially higher than the mean of all submissions.
arXiv Detail & Related papers (2021-11-03T14:29:24Z) - Three Sentences Are All You Need: Local Path Enhanced Document Relation
Extraction [54.95848026576076]
We present an embarrassingly simple but effective method to select evidence sentences for document-level RE.
We have released our code at https://github.com/AndrewZhe/Three-Sentences-Are-All-You-Need.
arXiv Detail & Related papers (2021-06-03T12:29:40Z) - BERT based Transformers lead the way in Extraction of Health Information
from Social Media [0.0]
We participated in two tasks: (1) Classification, extraction and normalization of adverse drug effect (ADE) mentions in English tweets (Task-1) and (2) Classification of COVID-19 tweets containing symptoms (Task-6)
Our system ranked first among all the submissions for subtask-1(a) with an F1-score of 61%.
For subtask-1(b), our system obtained an F1-score of 50% with improvements up to +8% F1 over the score averaged across all submissions.
The BERTweet model achieved an F1 score of 94% on SMM4H 2021 Task-6.
arXiv Detail & Related papers (2021-04-15T10:50:21Z) - Fake News Detection in Social Media using Graph Neural Networks and NLP
Techniques: A COVID-19 Use-case [2.4937400423177767]
The paper presents our solutions for the MediaEval 2020 task namely FakeNews: Corona Virus and 5G Conspiracy Multimedia Twitter-Data-Based Analysis.
arXiv Detail & Related papers (2020-11-30T16:41:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.