Characterizing drug mentions in COVID-19 Twitter Chatter
- URL: http://arxiv.org/abs/2007.10276v2
- Date: Fri, 9 Oct 2020 15:35:23 GMT
- Title: Characterizing drug mentions in COVID-19 Twitter Chatter
- Authors: Ramya Tekumalla, Juan M. Banda
- Abstract summary: In this work, we mined a large twitter dataset of 424 million tweets of COVID-19 chatter to identify discourse around drug mentions.
While seemingly a straightforward task, due to the informal nature of language use in Twitter, we demonstrate the need of machine learning alongside traditional automated methods to aid in this task.
We are able to recover almost 15% additional data, making misspelling handling a needed task as a pre-processing step when dealing with social media data.
- Score: 1.2400116527089997
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Since the classification of COVID-19 as a global pandemic, there have been
many attempts to treat and contain the virus. Although there is no specific
antiviral treatment recommended for COVID-19, there are several drugs that can
potentially help with symptoms. In this work, we mined a large twitter dataset
of 424 million tweets of COVID-19 chatter to identify discourse around drug
mentions. While seemingly a straightforward task, due to the informal nature of
language use in Twitter, we demonstrate the need of machine learning alongside
traditional automated methods to aid in this task. By applying these
complementary methods, we are able to recover almost 15% additional data,
making misspelling handling a needed task as a pre-processing step when dealing
with social media data.
Related papers
- Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal
Misinformation [83.2079454464572]
This paper describes our approach to the Image-Text Inconsistency Detection challenge of the DARPA Semantic Forensics (SemaFor) Program.
We collect Twitter-COMMs, a large-scale multimodal dataset with 884k tweets relevant to the topics of Climate Change, COVID-19, and Military Vehicles.
We train our approach, based on the state-of-the-art CLIP model, leveraging automatically generated random and hard negatives.
arXiv Detail & Related papers (2021-12-16T03:37:20Z) - Variance of Twitter Embeddings and Temporal Trends of COVID-19 cases [0.9449650062296824]
This paper proposes a method for harnessing social media, specifically Twitter, to predict the upcoming scenarios related to COVID-19 cases.
Using word embeddings to capture the semantic meaning of tweets, we identify Significant Dimensions (SDs)
Our methodology predicts the rise in cases with a lead time of 15 days and 30 days with R2 scores of 0.80 and 0.62 respectively.
arXiv Detail & Related papers (2021-09-30T18:03:10Z) - Unsupervised Text Mining of COVID-19 Records [0.0]
Twitter as a powerful tool can help researchers measure public health in response to COVID-19.
This paper preprocessed the existing medical dataset regarding COVID-19 named CORD-19 and annotated the dataset for supervised classification tasks.
arXiv Detail & Related papers (2021-09-08T05:57:22Z) - Project Achoo: A Practical Model and Application for COVID-19 Detection
from Recordings of Breath, Voice, and Cough [55.45063681652457]
We propose a machine learning method to quickly triage COVID-19 using recordings made on consumer devices.
The approach combines signal processing methods with fine-tuned deep learning networks and provides methods for signal denoising, cough detection and classification.
We have also developed and deployed a mobile application that uses symptoms checker together with voice, breath and cough signals to detect COVID-19 infection.
arXiv Detail & Related papers (2021-07-12T08:07:56Z) - Putting words into the system's mouth: A targeted attack on neural
machine translation using monolingual data poisoning [50.67997309717586]
We propose a poisoning attack in which a malicious adversary inserts a small poisoned sample of monolingual text into the training set of a system trained using back-translation.
This sample is designed to induce a specific, targeted translation behaviour, such as peddling misinformation.
We present two methods for crafting poisoned examples, and show that only a tiny handful of instances, amounting to only 0.02% of the training set, is sufficient to enact a successful attack.
arXiv Detail & Related papers (2021-07-12T08:07:09Z) - Leveraging Event Specific and Chunk Span features to Extract COVID
Events from tweets [0.0]
We describe our system entry for WNUT 2020 Shared Task-3.
The task was aimed at automating the extraction of a variety of COVID-19 related events from Twitter.
The system ranks 1st at the leader-board with F1 of 0.6598, without using any ensembles or additional datasets.
arXiv Detail & Related papers (2020-12-18T04:49:32Z) - NIT COVID-19 at WNUT-2020 Task 2: Deep Learning Model RoBERTa for
Identify Informative COVID-19 English Tweets [0.0]
This paper presents the model submitted by the NIT_COVID-19 team for identified informative COVID-19 English tweets at WNUT-2020 Task2.
The performance achieved by the proposed model for shared task WNUT 2020 Task2 is 89.14% in the F1-score metric.
arXiv Detail & Related papers (2020-11-11T05:20:39Z) - A Stance Data Set on Polarized Conversations on Twitter about the
Efficacy of Hydroxychloroquine as a Treatment for COVID-19 [0.0]
COVID-CQ is the first data set of Twitter users' stances in the context of the COVID-19 pandemic.
We have made this data set available to the research community via GitHub.
arXiv Detail & Related papers (2020-08-19T21:58:58Z) - TICO-19: the Translation Initiative for Covid-19 [112.5601530395345]
The Translation Initiative for COvid-19 (TICO-19) has made test and development data available to AI and MT researchers in 35 different languages.
The same data is translated into all of the languages represented, meaning that testing or development can be done for any pairing of languages in the set.
arXiv Detail & Related papers (2020-07-03T16:26:17Z) - Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment [90.12602012910465]
We train on Italy's early COVID-19 outbreak through Twitter and transfer to several other countries.
Our experiments show strong results with up to 0.85 Spearman correlation in cross-country predictions.
arXiv Detail & Related papers (2020-06-05T02:04:25Z) - On the Generation of Medical Dialogues for COVID-19 [60.63485429268256]
People experiencing COVID19-related symptoms or exposed to risk factors have a pressing need to consult doctors.
Because of the shortage of medical professionals, many people cannot receive online consultations timely.
We aim to develop a medical dialogue system that can provide COVID19-related consultations.
arXiv Detail & Related papers (2020-05-11T21:23:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.