Through the Twitter Glass: Detecting Questions in Micro-Text
- URL: http://arxiv.org/abs/2006.07732v1
- Date: Sat, 13 Jun 2020 22:34:01 GMT
- Title: Through the Twitter Glass: Detecting Questions in Micro-Text
- Authors: Kyle Dent and Sharoda Paul
- Abstract summary: In a separate study, we were interested in understanding people's Q&A habits on Twitter.
Finding questions within Twitter turned out to be a difficult challenge, so we considered applying some traditional NLP approaches to the problem.
This work is still preliminary, but in this paper we discuss the techniques we used and the lessons we learned.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In a separate study, we were interested in understanding people's Q&A habits
on Twitter. Finding questions within Twitter turned out to be a difficult
challenge, so we considered applying some traditional NLP approaches to the
problem. On the one hand, Twitter is full of idiosyncrasies, which make
processing it difficult. On the other, it is very restricted in length and
tends to employ simple syntactic constructions, which could help the
performance of NLP processing. In order to find out the viability of NLP and
Twitter, we built a pipeline of tools to work specifically with Twitter input
for the task of finding questions in tweets. This work is still preliminary,
but in this paper we discuss the techniques we used and the lessons we learned.
Related papers
- How to Engage Your Readers? Generating Guiding Questions to Promote Active Reading [60.19226384241482]
We introduce GuidingQ, a dataset of 10K in-text questions from textbooks and scientific articles.
We explore various approaches to generate such questions using language models.
We conduct a human study to understand the implication of such questions on reading comprehension.
arXiv Detail & Related papers (2024-07-19T13:42:56Z) - Which questions should I answer? Salience Prediction of Inquisitive Questions [118.097974193544]
We show that highly salient questions are empirically more likely to be answered in the same article.
We further validate our findings by showing that answering salient questions is an indicator of summarization quality in news.
arXiv Detail & Related papers (2024-04-16T21:33:05Z) - Use of social media and Natural Language Processing (NLP) in natural
hazard research [0.0]
In the works of Sasaki et al. (2010) and Earle et al. (2011) the authors explored the real-time interaction on Twitter for detecting natural hazards.
An inherent challenge for such an application is the natural language processing (NLP), which basically consists in converting the words in number.
In this report we implement a NLP machine learning process with advanced classification and classification applications.
arXiv Detail & Related papers (2023-04-17T15:03:05Z) - Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data [31.19059013571499]
Twitter-Demographer is a flow-based tool to enrich Twitter data with additional information about tweets and users.
We discuss our design choices, inspired by the flow-based programming paradigm, to use black-box components that can easily be chained together and extended.
arXiv Detail & Related papers (2022-01-26T14:59:17Z) - Identification of Twitter Bots based on an Explainable ML Framework: the
US 2020 Elections Case Study [72.61531092316092]
This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data.
Supervised machine learning (ML) framework is adopted using an Extreme Gradient Boosting (XGBoost) algorithm.
Our study also deploys Shapley Additive Explanations (SHAP) for explaining the ML model predictions.
arXiv Detail & Related papers (2021-12-08T14:12:24Z) - What goes on inside rumour and non-rumour tweets and their reactions: A
Psycholinguistic Analyses [58.75684238003408]
psycho-linguistics analyses of social media text are vital for drawing meaningful conclusions to mitigate misinformation.
This research contributes by performing an in-depth psycholinguistic analysis of rumours related to various kinds of events.
arXiv Detail & Related papers (2021-11-09T07:45:11Z) - TweeNLP: A Twitter Exploration Portal for Natural Language Processing [5.315717519923849]
TweeNLP is a one-stop portal that organizes Twitter's natural language processing (NLP) data.
It curates 19,395 tweets (as of April 2021) from various NLP conferences and general NLP discussions.
arXiv Detail & Related papers (2021-06-19T15:11:22Z) - A Weakly Supervised Approach for Classifying Stance in Twitter Replies [11.139350549173953]
adversarial reactions are prevalent in online conversations.
Inferring those adverse views (or stance) from the text in replies is difficult.
We propose a weakly-supervised approach to predict the stance in Twitter replies.
arXiv Detail & Related papers (2021-03-12T06:02:45Z) - You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate
Speech Detection [5.203329540700176]
We investigate profiling users by their past utterances as an informative prior to better predicting whether new utterances constitute hate speech.
To evaluate this, we augment three Twitter hate speech datasets with additional timeline data, then embed this additional context into a strong baseline model.
Promising results suggest merit for further investigation, though analysis is complicated by differences in annotation schemes and processes, as well as Twitter API limitations and data sharing policies.
arXiv Detail & Related papers (2020-12-16T17:17:47Z) - Universal Natural Language Processing with Limited Annotations: Try
Few-shot Textual Entailment as a Start [125.23550801424328]
Universal Few-shot textual Entailment (UFO-Entail)
We introduce Universal Few-shot textual Entailment (UFO-Entail)
We demonstrate that this framework enables a pretrained entailment model to work well on new entailment domains in a few-shot setting.
arXiv Detail & Related papers (2020-10-06T09:50:25Z) - Writer Identification Using Microblogging Texts for Social Media
Forensics [53.180678723280145]
We evaluate popular stylometric features, widely used in literary analysis, and specific Twitter features like URLs, hashtags, replies or quotes.
We test varying sized author sets and varying amounts of training/test texts per author.
arXiv Detail & Related papers (2020-07-31T00:23:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.