Temporal Analysis of Drifting Hashtags in Textual Data Streams: A
Graph-Based Application
- URL: http://arxiv.org/abs/2402.10230v1
- Date: Thu, 8 Feb 2024 21:58:53 GMT
- Title: Temporal Analysis of Drifting Hashtags in Textual Data Streams: A
Graph-Based Application
- Authors: Cristiano M. Garcia and Alceu de Souza Britto Jr and Jean Paul Barddal
- Abstract summary: We analyze hashtag drifts over time using concepts from graph analysis and textual data streams.
We observe that the hashtag suffered drifts during the studied period across topics such as drug legalization, vaccination, political protests, war, and civil rights.
The year 2021 was the most significant drifting year, in which the communities detected suggest that #mybodymychoice significantly drifted to vaccination and Covid-19-related topics.
- Score: 3.3148826359547523
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Social media has played an important role since its emergence. People use the
internet to express opinions about anything, making social media platforms a
social sensor. Initially supported by Twitter, the hashtags are now in use on
several social media platforms. Hashtags are helpful to tag, track, and group
posts on similar topics. In this paper, we analyze hashtag drifts over time
using concepts from graph analysis and textual data streams using the
Girvan-Newman method to uncover hashtag communities in annual snapshots. More
specifically, we analyzed the #mybodymychoice hashtag between 2018 and 2022. In
addition, we offer insights about some hashtags found in the study.
Furthermore, our approach can be useful for monitoring changes over time in
opinions and sentiment patterns about an entity on social media. Even though
the hashtag #mybodymychoice was initially coupled with women's rights,
abortion, and bodily autonomy, we observe that it suffered drifts during the
studied period across topics such as drug legalization, vaccination, political
protests, war, and civil rights. The year 2021 was the most significant
drifting year, in which the communities detected suggest that #mybodymychoice
significantly drifted to vaccination and Covid-19-related topics.
Related papers
- RIGHT: Retrieval-augmented Generation for Mainstream Hashtag
Recommendation [76.24205422163169]
We propose RetrIeval-augmented Generative Mainstream HashTag Recommender (RIGHT)
RIGHT consists of three components: 1) a retriever seeks relevant hashtags from the entire tweet-hashtags set; 2) a selector enhances mainstream identification by introducing global signals; and 3) a generator incorporates input tweets and selected hashtags to directly generate the desired hashtags.
Our method achieves significant improvements over state-of-the-art baselines. Moreover, RIGHT can be easily integrated into large language models, improving the performance of ChatGPT by more than 10%.
arXiv Detail & Related papers (2023-12-16T14:47:03Z) - Effects of Algorithmic Trend Promotion: Evidence from Coordinated
Campaigns in Twitter's Trending Topics [5.524750830120598]
We study the effects of a hashtag appearing on the trending topics page on the number of tweets produced with that hashtag.
We find there is a statistically significant, but modest, return to a hashtag being featured on trending topics.
arXiv Detail & Related papers (2023-04-08T15:22:36Z) - Hashtag-Guided Low-Resource Tweet Classification [31.810562621519804]
We propose a novel Hashtag-guided Tweet Classification model (HashTation)
HashTation automatically generates meaningful hashtags for the input tweet to provide useful auxiliary signals for tweet classification.
Experiments show that HashTation achieves significant improvements on seven low-resource tweet classification tasks.
arXiv Detail & Related papers (2023-02-20T18:21:02Z) - Discourse Analysis of Covid-19 in Persian Twitter Social Networks Using
Graph Mining and Natural Language Processing [0.0]
The examined big data is five million tweets from 160,000 users of the Persian Twitter network.
The analyzed Iranian society does not consider itself responsible for the Covid-19 wicked problem.
The most active and most influential users' similarity is that political, national, and critical discourse construction is the predominant one.
arXiv Detail & Related papers (2021-09-01T10:39:20Z) - News consumption and social media regulations policy [70.31753171707005]
We analyze two social media that enforced opposite moderation methods, Twitter and Gab, to assess the interplay between news consumption and content regulation.
Our results show that the presence of moderation pursued by Twitter produces a significant reduction of questionable content.
The lack of clear regulation on Gab results in the tendency of the user to engage with both types of content, showing a slight preference for the questionable ones which may account for a dissing/endorsement behavior.
arXiv Detail & Related papers (2021-06-07T19:26:32Z) - Towards A Sentiment Analyzer for Low-Resource Languages [0.0]
This research aims to analyse a sentiment of the users towards a particular trending topic that has been actively and massively discussed at that time.
We use the hashtag textit#kpujangancurang that was the trending topic during the Indonesia presidential election in 2019.
This research utilizes rapid miner tool to generate the twitter data and comparing Naive Bayes, K-Nearest Neighbor, Decision Tree, and Multi-Layer Perceptron classification methods to classify the sentiment of the twitter data.
arXiv Detail & Related papers (2020-11-12T13:50:00Z) - Echo Chambers on Social Media: A comparative analysis [64.2256216637683]
We introduce an operational definition of echo chambers and perform a massive comparative analysis on 1B pieces of contents produced by 1M users on four social media platforms.
We infer the leaning of users about controversial topics and reconstruct their interaction networks by analyzing different features.
We find support for the hypothesis that platforms implementing news feed algorithms like Facebook may elicit the emergence of echo-chambers.
arXiv Detail & Related papers (2020-04-20T20:00:27Z) - Whose Tweets are Surveilled for the Police: An Audit of Social-Media
Monitoring Tool via Log Files [69.02688684221265]
We obtained log files from the Corvallis (Oregon) Police Department's use of social media monitoring software called DigitalStakeout.
These log files include the results of proprietary searches by DigitalStakeout that were running over a period of 13 months and include 7240 social media posts.
We observe differences in the demographics of the users whose Tweets are flagged by DigitalStakeout compared to the demographics of the Twitter users in the region.
arXiv Detail & Related papers (2020-01-23T19:35:12Z) - #MeToo on Campus: Studying College Sexual Assault at Scale Using Data
Reported on Social Media [71.74529365205053]
We analyze the influence of the # trend on a pool of college followers.
The results show that the majority of topics embedded in those # tweets detail sexual harassment stories.
There exists a significant correlation between the prevalence of this trend and official reports on several major geographical regions.
arXiv Detail & Related papers (2020-01-16T18:05:46Z) - On Identifying Hashtags in Disaster Twitter Data [55.17975121160699]
We construct a unique dataset of disaster-related tweets annotated with hashtags useful for filtering actionable information.
Using this dataset, we investigate Long Short Term Memory-based models within a Multi-Task Learning framework.
The best performing model achieves an F1-score as high as 92.22%.
arXiv Detail & Related papers (2020-01-05T22:37:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.