User Factor Adaptation for User Embedding via Multitask Learning
- URL: http://arxiv.org/abs/2102.11103v1
- Date: Mon, 22 Feb 2021 15:21:01 GMT
- Title: User Factor Adaptation for User Embedding via Multitask Learning
- Authors: Xiaolei Huang, Michael J. Paul, Robin Burke, Franck Dernoncourt, Mark
Dredze
- Abstract summary: We treat the user interest as domains and empirically examine how the user language can vary across the user factor.
We propose a user embedding model to account for the language variability of user interests via a multitask learning framework.
The model learns user language and its variations without human supervision.
- Score: 45.56193775870044
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Language varies across users and their interested fields in social media
data: words authored by a user across his/her interests may have different
meanings (e.g., cool) or sentiments (e.g., fast). However, most of the existing
methods to train user embeddings ignore the variations across user interests,
such as product and movie categories (e.g., drama vs. action). In this study,
we treat the user interest as domains and empirically examine how the user
language can vary across the user factor in three English social media
datasets. We then propose a user embedding model to account for the language
variability of user interests via a multitask learning framework. The model
learns user language and its variations without human supervision. While
existing work mainly evaluated the user embedding by extrinsic tasks, we
propose an intrinsic evaluation via clustering and evaluate user embeddings by
an extrinsic task, text classification. The experiments on the three
English-language social media datasets show that our proposed approach can
generally outperform baselines via adapting the user factor.
Related papers
- Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering [8.20929362102942]
Author profiling is the task of inferring characteristics about individuals by analyzing content they share.
We propose a new method for author profiling which aims at distinguishing relevant from irrelevant content first, followed by the actual user profiling only with relevant data.
We evaluate our method for Big Five personality trait prediction on two Twitter corpora.
arXiv Detail & Related papers (2024-09-06T08:43:10Z) - SoMeR: Multi-View User Representation Learning for Social Media [1.7949335303516192]
We propose SoMeR, a Social Media user representation learning framework that incorporates temporal activities, text content, profile information, and network interactions to learn comprehensive user portraits.
SoMeR encodes user post streams as sequences of timestamped textual features, uses transformers to embed this along with profile data, and jointly trains with link prediction and contrastive learning objectives.
We demonstrate SoMeR's versatility through two applications: 1) Identifying inauthentic accounts involved in coordinated influence operations by detecting users posting similar content simultaneously, and 2) Measuring increased polarization in online discussions after major events by quantifying how users with different beliefs moved farther apart
arXiv Detail & Related papers (2024-05-02T22:26:55Z) - Learning from Emotions, Demographic Information and Implicit User
Feedback in Task-Oriented Document-Grounded Dialogues [59.516187851808375]
We introduce FEDI, the first English dialogue dataset for task-oriented document-grounded dialogues annotated with demographic information, user emotions and implicit feedback.
Our experiments with FLAN-T5, GPT-2 and LLaMA-2 show that these data have the potential to improve task completion and the factual consistency of the generated responses and user acceptance.
arXiv Detail & Related papers (2024-01-17T14:52:26Z) - Unsupervised Neural Stylistic Text Generation using Transfer learning
and Adapters [66.17039929803933]
We propose a novel transfer learning framework which updates only $0.3%$ of model parameters to learn style specific attributes for response generation.
We learn style specific attributes from the PERSONALITY-CAPTIONS dataset.
arXiv Detail & Related papers (2022-10-07T00:09:22Z) - Understanding How People Rate Their Conversations [73.17730062864314]
We conduct a study to better understand how people rate their interactions with conversational agents.
We focus on agreeableness and extraversion as variables that may explain variation in ratings.
arXiv Detail & Related papers (2022-06-01T00:45:32Z) - A New Generation of Perspective API: Efficient Multilingual
Character-level Transformers [66.9176610388952]
We present the fundamentals behind the next version of the Perspective API from Google Jigsaw.
At the heart of the approach is a single multilingual token-free Charformer model.
We demonstrate that by forgoing static vocabularies, we gain flexibility across a variety of settings.
arXiv Detail & Related papers (2022-02-22T20:55:31Z) - Like Article, Like Audience: Enforcing Multimodal Correlations for
Disinformation Detection [20.394457328537975]
correlations between user-generated and user-shared content can be leveraged for detecting disinformation in online news articles.
We develop a multimodal learning algorithm for disinformation detection.
arXiv Detail & Related papers (2021-08-31T14:50:16Z) - One Chatbot Per Person: Creating Personalized Chatbots based on Implicit
User Profiles [31.432585994256375]
Existing personalized approaches tried to incorporate several text descriptions as explicit user profiles.
We train a personalized language model to construct a general user profile from the user's historical responses.
We design a personalized decoder to fuse two decoding strategies, including generating a word from the generic vocabulary and copying one word from the user's personalized vocabulary.
arXiv Detail & Related papers (2021-08-20T20:33:12Z) - "What Do You Mean by That?" A Parser-Independent Interactive Approach
for Enhancing Text-to-SQL [49.85635994436742]
We include human in the loop and present a novel-independent interactive approach (PIIA) that interacts with users using multi-choice questions.
PIIA is capable of enhancing the text-to-domain performance with limited interaction turns by using both simulation and human evaluation.
arXiv Detail & Related papers (2020-11-09T02:14:33Z) - Topic Modeling on User Stories using Word Mover's Distance [4.378337862197529]
This paper focuses on topic modeling as a means to identify topics within a large set of crowd-generated user stories.
We evaluate the approaches on a publicly available set of 2,966 user stories written and categorized by crowd workers.
arXiv Detail & Related papers (2020-07-10T11:05:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.