A Weakly Supervised Approach for Classifying Stance in Twitter Replies
- URL: http://arxiv.org/abs/2103.07098v1
- Date: Fri, 12 Mar 2021 06:02:45 GMT
- Title: A Weakly Supervised Approach for Classifying Stance in Twitter Replies
- Authors: Sumeet Kumar, Ramon Villa Cox, Matthew Babcock, Kathleen M. Carley
- Abstract summary: adversarial reactions are prevalent in online conversations.
Inferring those adverse views (or stance) from the text in replies is difficult.
We propose a weakly-supervised approach to predict the stance in Twitter replies.
- Score: 11.139350549173953
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Conversations on social media (SM) are increasingly being used to investigate
social issues on the web, such as online harassment and rumor spread. For such
issues, a common thread of research uses adversarial reactions, e.g., replies
pointing out factual inaccuracies in rumors. Though adversarial reactions are
prevalent in online conversations, inferring those adverse views (or stance)
from the text in replies is difficult and requires complex natural language
processing (NLP) models. Moreover, conventional NLP models for stance mining
need labeled data for supervised learning. Getting labeled conversations can
itself be challenging as conversations can be on any topic, and topics change
over time. These challenges make learning the stance a difficult NLP problem.
In this research, we first create a new stance dataset comprised of three
different topics by labeling both users' opinions on the topics (as in pro/con)
and users' stance while replying to others' posts (as in favor/oppose). As we
find limitations with supervised approaches, we propose a weakly-supervised
approach to predict the stance in Twitter replies. Our novel method allows
using a smaller number of hashtags to generate weak labels for Twitter replies.
Compared to supervised learning, our method improves the mean F1-macro by 8\%
on the hand-labeled dataset without using any hand-labeled examples in the
training set. We further show the applicability of our proposed method on COVID
19 related conversations on Twitter.
Related papers
- NLP for Counterspeech against Hate: A Survey and How-To Guide [11.696212004806263]
We provide a guide for doing research on counterspeech, by describing - with detailed examples - the steps to undertake.
We discuss open challenges and future directions of counterspeech research in NLP.
arXiv Detail & Related papers (2024-03-29T10:32:44Z) - AutoConv: Automatically Generating Information-seeking Conversations
with Large Language Models [74.10293412011455]
We propose AutoConv for synthetic conversation generation.
Specifically, we formulate the conversation generation problem as a language modeling task.
We finetune an LLM with a few human conversations to capture the characteristics of the information-seeking process.
arXiv Detail & Related papers (2023-08-12T08:52:40Z) - NewsDialogues: Towards Proactive News Grounded Conversation [72.10055780635625]
We propose a novel task, Proactive News Grounded Conversation, in which a dialogue system can proactively lead the conversation based on some key topics of the news.
To further develop this novel task, we collect a human-to-human Chinese dialogue dataset tsNewsDialogues, which includes 1K conversations with a total of 14.6K utterances.
arXiv Detail & Related papers (2023-08-12T08:33:42Z) - Learning Stance Embeddings from Signed Social Graphs [7.202476284052426]
A key challenge in social network analysis is understanding the position, or stance, of people in the graph on a large set of topics.
We propose the Stance Embeddings Model, which jointly learns embeddings for each user and topic in signed social graphs.
SEM is able to perform cold-start topic stance detection, predicting the stance of a user on topics for which we have not observed their engagement.
arXiv Detail & Related papers (2022-01-27T17:22:49Z) - Findings on Conversation Disentanglement [28.874162427052905]
We build a learning model that learns utterance-to-utterance and utterance-to-thread classification.
Experiments on the Ubuntu IRC dataset show that this approach has the potential to outperform the conventional greedy approach.
arXiv Detail & Related papers (2021-12-10T05:54:48Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - Disentangling Online Chats with DAG-Structured LSTMs [55.33014148383343]
DAG-LSTMs are a generalization of Tree-LSTMs that can handle directed acyclic dependencies.
We show that the novel model we propose achieves state of the art status on the task of recovering reply-to relations.
arXiv Detail & Related papers (2021-06-16T18:00:00Z) - Response Selection for Multi-Party Conversations with Dynamic Topic
Tracking [63.15158355071206]
We frame response selection as a dynamic topic tracking task to match the topic between the response and relevant conversation context.
We propose a novel multi-task learning framework that supports efficient encoding through large pretrained models.
Experimental results on the DSTC-8 Ubuntu IRC dataset show state-of-the-art results in response selection and topic disentanglement tasks.
arXiv Detail & Related papers (2020-10-15T14:21:38Z) - ConvAI3: Generating Clarifying Questions for Open-Domain Dialogue
Systems (ClariQ) [64.60303062063663]
This document presents a detailed description of the challenge on clarifying questions for dialogue systems (ClariQ)
The challenge is organized as part of the Conversational AI challenge series (ConvAI3) at Search Oriented Conversational AI (SCAI) EMNLP workshop in 2020.
arXiv Detail & Related papers (2020-09-23T19:48:02Z) - Through the Twitter Glass: Detecting Questions in Micro-Text [0.0]
In a separate study, we were interested in understanding people's Q&A habits on Twitter.
Finding questions within Twitter turned out to be a difficult challenge, so we considered applying some traditional NLP approaches to the problem.
This work is still preliminary, but in this paper we discuss the techniques we used and the lessons we learned.
arXiv Detail & Related papers (2020-06-13T22:34:01Z) - Stance in Replies and Quotes (SRQ): A New Dataset For Learning Stance in
Twitter Conversations [8.097870074875729]
We present the largest human-labeled stance dataset for Twitter conversations with over 5200 stance labels.
We include many baseline models for learning the stance in conversations and compare the performance of various models.
arXiv Detail & Related papers (2020-06-01T03:30:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.