Related papers: Temporal Analysis of Drifting Hashtags in Textual Data Streams: A Graph-Based Application

Temporal Analysis of Drifting Hashtags in Textual Data Streams: A Graph-Based Application

URL: http://arxiv.org/abs/2402.10230v1
Date: Thu, 8 Feb 2024 21:58:53 GMT
Title: Temporal Analysis of Drifting Hashtags in Textual Data Streams: A Graph-Based Application
Authors: Cristiano M. Garcia and Alceu de Souza Britto Jr and Jean Paul Barddal
Abstract summary: We analyze hashtag drifts over time using concepts from graph analysis and textual data streams. We observe that the hashtag suffered drifts during the studied period across topics such as drug legalization, vaccination, political protests, war, and civil rights. The year 2021 was the most significant drifting year, in which the communities detected suggest that #mybodymychoice significantly drifted to vaccination and Covid-19-related topics.
Score: 3.3148826359547523
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Social media has played an important role since its emergence. People use the internet to express opinions about anything, making social media platforms a social sensor. Initially supported by Twitter, the hashtags are now in use on several social media platforms. Hashtags are helpful to tag, track, and group posts on similar topics. In this paper, we analyze hashtag drifts over time using concepts from graph analysis and textual data streams using the Girvan-Newman method to uncover hashtag communities in annual snapshots. More specifically, we analyzed the #mybodymychoice hashtag between 2018 and 2022. In addition, we offer insights about some hashtags found in the study. Furthermore, our approach can be useful for monitoring changes over time in opinions and sentiment patterns about an entity on social media. Even though the hashtag #mybodymychoice was initially coupled with women's rights, abortion, and bodily autonomy, we observe that it suffered drifts during the studied period across topics such as drug legalization, vaccination, political protests, war, and civil rights. The year 2021 was the most significant drifting year, in which the communities detected suggest that #mybodymychoice significantly drifted to vaccination and Covid-19-related topics.

Related papers

Hashtag Re-Appropriation for Audience Control on Recommendation-Driven Social Media Xiaohongshu (rednote) [17.873872681980437]
Women on Xiaohongshu (rednote) proactively re-appropriate hashtags by using them in posts unrelated to their literal meaning. We analyzed the practice of hashtag re-appropriation based on 5,800 collected posts and interviewed 24 active users from diverse backgrounds. This practice highlights how users can reclaim agency over content distribution on recommendation-driven platforms.
arXiv Detail & Related papers (2025-01-30T08:55:32Z)
RIGHT: Retrieval-augmented Generation for Mainstream Hashtag Recommendation [76.24205422163169]
We propose RetrIeval-augmented Generative Mainstream HashTag Recommender (RIGHT) RIGHT consists of three components: 1) a retriever seeks relevant hashtags from the entire tweet-hashtags set; 2) a selector enhances mainstream identification by introducing global signals; and 3) a generator incorporates input tweets and selected hashtags to directly generate the desired hashtags. Our method achieves significant improvements over state-of-the-art baselines. Moreover, RIGHT can be easily integrated into large language models, improving the performance of ChatGPT by more than 10%.
arXiv Detail & Related papers (2023-12-16T14:47:03Z)
Effects of Algorithmic Trend Promotion: Evidence from Coordinated Campaigns in Twitter's Trending Topics [5.524750830120598]
We study the effects of a hashtag appearing on the trending topics page on the number of tweets produced with that hashtag. We find there is a statistically significant, but modest, return to a hashtag being featured on trending topics.
arXiv Detail & Related papers (2023-04-08T15:22:36Z)
Hashtag-Guided Low-Resource Tweet Classification [31.810562621519804]
We propose a novel Hashtag-guided Tweet Classification model (HashTation) HashTation automatically generates meaningful hashtags for the input tweet to provide useful auxiliary signals for tweet classification. Experiments show that HashTation achieves significant improvements on seven low-resource tweet classification tasks.
arXiv Detail & Related papers (2023-02-20T18:21:02Z)
Attend and Select: A Segment Attention based Selection Mechanism for Microblog Hashtag Generation [69.73215951112452]
A hashtag is formed by tokens or phrases that may originate from various fragmentary segments of the original text. We propose an end-to-end Transformer-based generation model which consists of three phases: encoding, segments-selection, and decoding. We introduce two large-scale hashtag generation datasets, which are newly collected from Chinese Weibo and English Twitter.
arXiv Detail & Related papers (2021-06-06T15:13:58Z)
Towards A Sentiment Analyzer for Low-Resource Languages [0.0]
This research aims to analyse a sentiment of the users towards a particular trending topic that has been actively and massively discussed at that time. We use the hashtag textit#kpujangancurang that was the trending topic during the Indonesia presidential election in 2019. This research utilizes rapid miner tool to generate the twitter data and comparing Naive Bayes, K-Nearest Neighbor, Decision Tree, and Multi-Layer Perceptron classification methods to classify the sentiment of the twitter data.
arXiv Detail & Related papers (2020-11-12T13:50:00Z)
Hit ratio: An Evaluation Metric for Hashtag Recommendation [6.746400031322727]
We propose a new metric which we call hit ratio for hashtag recommendation. Most of the research in the area of hashtag recommendation have used classical metrics such as hit rate, precision, recall, and F1-score. A comparison of hit ratio with the classical evaluation metrics reveals their limitations.
arXiv Detail & Related papers (2020-10-03T02:07:41Z)
Echo Chambers on Social Media: A comparative analysis [64.2256216637683]
We introduce an operational definition of echo chambers and perform a massive comparative analysis on 1B pieces of contents produced by 1M users on four social media platforms. We infer the leaning of users about controversial topics and reconstruct their interaction networks by analyzing different features. We find support for the hypothesis that platforms implementing news feed algorithms like Facebook may elicit the emergence of echo-chambers.
arXiv Detail & Related papers (2020-04-20T20:00:27Z)
Whose Tweets are Surveilled for the Police: An Audit of Social-Media Monitoring Tool via Log Files [69.02688684221265]
We obtained log files from the Corvallis (Oregon) Police Department's use of social media monitoring software called DigitalStakeout. These log files include the results of proprietary searches by DigitalStakeout that were running over a period of 13 months and include 7240 social media posts. We observe differences in the demographics of the users whose Tweets are flagged by DigitalStakeout compared to the demographics of the Twitter users in the region.
arXiv Detail & Related papers (2020-01-23T19:35:12Z)
#MeToo on Campus: Studying College Sexual Assault at Scale Using Data Reported on Social Media [71.74529365205053]
We analyze the influence of the # trend on a pool of college followers. The results show that the majority of topics embedded in those # tweets detail sexual harassment stories. There exists a significant correlation between the prevalence of this trend and official reports on several major geographical regions.
arXiv Detail & Related papers (2020-01-16T18:05:46Z)
On Identifying Hashtags in Disaster Twitter Data [55.17975121160699]
We construct a unique dataset of disaster-related tweets annotated with hashtags useful for filtering actionable information. Using this dataset, we investigate Long Short Term Memory-based models within a Multi-Task Learning framework. The best performing model achieves an F1-score as high as 92.22%.
arXiv Detail & Related papers (2020-01-05T22:37:17Z)

This list is automatically generated from the titles and abstracts of the papers in this site.