Learning User Embeddings from Temporal Social Media Data: A Survey
- URL: http://arxiv.org/abs/2105.07996v1
- Date: Mon, 17 May 2021 16:22:43 GMT
- Title: Learning User Embeddings from Temporal Social Media Data: A Survey
- Authors: Fatema Hasan, Kevin S. Xu, James R. Foulds, Shimei Pan
- Abstract summary: We survey representative work on learning a concise latent user representation (a.k.a. user embedding) that can capture the main characteristics of a social media user.
The learned user embeddings can later be used to support different downstream user analysis tasks such as personality modeling, suicidal risk assessment and purchase decision prediction.
- Score: 15.324014759254915
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: User-generated data on social media contain rich information about who we
are, what we like and how we make decisions. In this paper, we survey
representative work on learning a concise latent user representation (a.k.a.
user embedding) that can capture the main characteristics of a social media
user. The learned user embeddings can later be used to support different
downstream user analysis tasks such as personality modeling, suicidal risk
assessment and purchase decision prediction. The temporal nature of
user-generated data on social media has largely been overlooked in much of the
existing user embedding literature. In this survey, we focus on research that
bridges the gap by incorporating temporal/sequential information in user
representation learning. We categorize relevant papers along several key
dimensions, identify limitations in the current work and suggest future
research directions.
Related papers
- Do We Trust What They Say or What They Do? A Multimodal User Embedding Provides Personalized Explanations [35.77028281332307]
We propose Contribution-Aware Multimodal User Embedding (CAMUE) for social networks.
We show that our approach can provide personalized explainable predictions, automatically mitigating the impact of unreliable information.
Our work paves the way for more explainable, reliable, and effective social media user embedding.
arXiv Detail & Related papers (2024-09-04T02:17:32Z) - SoMeR: Multi-View User Representation Learning for Social Media [1.7949335303516192]
We propose SoMeR, a Social Media user representation learning framework that incorporates temporal activities, text content, profile information, and network interactions to learn comprehensive user portraits.
SoMeR encodes user post streams as sequences of timestamped textual features, uses transformers to embed this along with profile data, and jointly trains with link prediction and contrastive learning objectives.
We demonstrate SoMeR's versatility through two applications: 1) Identifying inauthentic accounts involved in coordinated influence operations by detecting users posting similar content simultaneously, and 2) Measuring increased polarization in online discussions after major events by quantifying how users with different beliefs moved farther apart
arXiv Detail & Related papers (2024-05-02T22:26:55Z) - Decoding the Silent Majority: Inducing Belief Augmented Social Graph
with Large Language Model for Response Forecasting [74.68371461260946]
SocialSense is a framework that induces a belief-centered graph on top of an existent social network, along with graph-based propagation to capture social dynamics.
Our method surpasses existing state-of-the-art in experimental evaluations for both zero-shot and supervised settings.
arXiv Detail & Related papers (2023-10-20T06:17:02Z) - Silence Speaks Volumes: Re-weighting Techniques for Under-Represented
Users in Fake News Detection [25.5495085102178]
A mere 1% of users generate the majority of the content on social networking sites.
The remaining users, though engaged to varying degrees, tend to be less active in content creation and largely silent.
We propose to leverage re-weighting techniques to make the silent majority heard, and in turn, investigate whether the cues from these users can improve the performance of the current models for the downstream task of fake news detection.
arXiv Detail & Related papers (2023-08-03T20:04:20Z) - Likes and Fragments: Examining Perceptions of Time Spent on TikTok [24.079009867311637]
This work builds on prior studies to explore a novel social media platform in the context of use time: TikTok.
We conduct platform-independent measurements of people's self-reported and server-logged TikTok usage to understand how users' demographics and platform engagement influence their perceptions of the time they spend on the platform and their estimation accuracy.
arXiv Detail & Related papers (2023-03-03T16:06:22Z) - Predicting mental health using social media: A roadmap for future
development [0.0]
Mental disorders such as depression and suicidal ideation affect more than 300 million people over the world.
On social media, mental disorder symptoms can be observed, and automated approaches are increasingly capable of detecting them.
This research offers a roadmap for analysis, where mental state detection can be based on machine learning techniques.
arXiv Detail & Related papers (2023-01-25T08:08:29Z) - Personalized multi-faceted trust modeling to determine trust links in
social media and its potential for misinformation management [61.88858330222619]
We present an approach for predicting trust links between peers in social media.
We propose a data-driven multi-faceted trust modeling which incorporates many distinct features for a comprehensive analysis.
Illustrated in a trust-aware item recommendation task, we evaluate the proposed framework in the context of a large Yelp dataset.
arXiv Detail & Related papers (2021-11-11T19:40:51Z) - Two-Faced Humans on Twitter and Facebook: Harvesting Social Multimedia
for Human Personality Profiling [74.83957286553924]
We infer the Myers-Briggs Personality Type indicators by applying a novel multi-view fusion framework, called "PERS"
Our experimental results demonstrate the PERS's ability to learn from multi-view data for personality profiling by efficiently leveraging on the significantly different data arriving from diverse social multimedia sources.
arXiv Detail & Related papers (2021-06-20T10:48:49Z) - Can You be More Social? Injecting Politeness and Positivity into
Task-Oriented Conversational Agents [60.27066549589362]
Social language used by human agents is associated with greater users' responsiveness and task completion.
The model uses a sequence-to-sequence deep learning architecture, extended with a social language understanding element.
Evaluation in terms of content preservation and social language level using both human judgment and automatic linguistic measures shows that the model can generate responses that enable agents to address users' issues in a more socially appropriate way.
arXiv Detail & Related papers (2020-12-29T08:22:48Z) - Improving time use measurement with personal big data collection -- the
experience of the European Big Data Hackathon 2019 [62.997667081978825]
This article assesses the experience with i-Log at the European Big Data Hackathon 2019, a satellite event of the New Techniques and Technologies for Statistics (NTTS) conference, organised by Eurostat.
i-Log is a system that allows to capture personal big data from smartphones' internal sensors to be used for time use measurement.
arXiv Detail & Related papers (2020-04-24T18:40:08Z) - I Know Where You Are Coming From: On the Impact of Social Media Sources
on AI Model Performance [79.05613148641018]
We will study the performance of different machine learning models when being learned on multi-modal data from different social networks.
Our initial experimental results reveal that social network choice impacts the performance.
arXiv Detail & Related papers (2020-02-05T11:10:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.