Matching Theory and Data with Personal-ITY: What a Corpus of Italian
YouTube Comments Reveals About Personality
- URL: http://arxiv.org/abs/2011.07009v1
- Date: Wed, 11 Nov 2020 12:45:33 GMT
- Title: Matching Theory and Data with Personal-ITY: What a Corpus of Italian
YouTube Comments Reveals About Personality
- Authors: Elisa Bassignana, Malvina Nissim and Viviana Patti
- Abstract summary: We create a novel corpus of YouTube comments in Italian, where authors are labelled with personality traits.
The traits are derived from one of the mainstream personality theories in psychology research, named MBTI.
We study the task of personality prediction in itself on our corpus as well as on TwiSty.
- Score: 11.38723572165938
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As a contribution to personality detection in languages other than English,
we rely on distant supervision to create Personal-ITY, a novel corpus of
YouTube comments in Italian, where authors are labelled with personality
traits. The traits are derived from one of the mainstream personality theories
in psychology research, named MBTI. Using personality prediction experiments,
we (i) study the task of personality prediction in itself on our corpus as well
as on TwiSty, a Twitter dataset also annotated with MBTI labels; (ii) carry out
an extensive, in-depth analysis of the features used by the classifier, and
view them specifically under the light of the original theory that we used to
create the corpus in the first place. We observe that no single model is best
at personality detection, and that while some traits are easier than others to
detect, and also to match back to theory, for other, less frequent traits the
picture is much more blurred.
Related papers
- Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues [63.936654900356004]
Personality recognition aims to identify the personality traits implied in user data such as dialogues and social media posts.
We propose a novel task named Explainable Personality Recognition, aiming to reveal the reasoning process as supporting evidence of the personality trait.
arXiv Detail & Related papers (2024-09-29T14:41:43Z) - LLMvsSmall Model? Large Language Model Based Text Augmentation Enhanced
Personality Detection Model [58.887561071010985]
Personality detection aims to detect one's personality traits underlying in social media posts.
Most existing methods learn post features directly by fine-tuning the pre-trained language models.
We propose a large language model (LLM) based text augmentation enhanced personality detection model.
arXiv Detail & Related papers (2024-03-12T12:10:18Z) - PsyCoT: Psychological Questionnaire as Powerful Chain-of-Thought for
Personality Detection [50.66968526809069]
We propose a novel personality detection method, called PsyCoT, which mimics the way individuals complete psychological questionnaires in a multi-turn dialogue manner.
Our experiments demonstrate that PsyCoT significantly improves the performance and robustness of GPT-3.5 in personality detection.
arXiv Detail & Related papers (2023-10-31T08:23:33Z) - InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews [57.04431594769461]
This paper introduces a novel perspective to evaluate the personality fidelity of RPAs with psychological scales.
Experiments include various types of RPAs and LLMs, covering 32 distinct characters on 14 widely used psychological scales.
With InCharacter, we show that state-of-the-art RPAs exhibit personalities highly aligned with the human-perceived personalities of the characters, achieving an accuracy up to 80.7%.
arXiv Detail & Related papers (2023-10-27T08:42:18Z) - Editing Personality for Large Language Models [73.59001811199823]
This paper introduces an innovative task focused on editing the personality traits of Large Language Models (LLMs)
We construct PersonalityEdit, a new benchmark dataset to address this task.
arXiv Detail & Related papers (2023-10-03T16:02:36Z) - Identifying and Manipulating the Personality Traits of Language Models [9.213700601337383]
We investigate whether perceived personality in language models is exhibited consistently in their language generation.
We show that language models such as BERT and GPT2 can consistently identify and reflect personality markers in different contexts.
This behavior illustrates an ability to be manipulated in a highly predictable way, and frames them as tools for identifying personality traits and controlling personas in applications such as dialog systems.
arXiv Detail & Related papers (2022-12-20T14:24:11Z) - MBTI Personality Prediction for Fictional Characters Using Movie Scripts [34.24896499537589]
We construct a benchmark, Story2Personality, to predict a movie character's MBTI or Big 5 personality types.
Experiments show that our task is challenging for the existing text classification models.
We propose a multi-view model for personality prediction using both verbal and non-verbal descriptions.
arXiv Detail & Related papers (2022-10-20T03:41:07Z) - Exploring Personality and Online Social Engagement: An Investigation of
MBTI Users on Twitter [0.0]
We investigate 3848 profiles from Twitter with self-labeled Myers-Briggs personality traits (MBTI)
We leverage BERT, a state-of-the-art NLP architecture based on deep learning, to analyze various sources of text that hold most predictive power for our task.
We find that biographies, statuses, and liked tweets contain significant predictive power for all dimensions of the MBTI system.
arXiv Detail & Related papers (2021-09-14T02:26:30Z) - Personal-ITY: A Novel YouTube-based Corpus for Personality Prediction in
Italian [11.38723572165938]
We present a novel corpus for personality prediction in Italian, containing a larger number of authors and a different genre compared to previously available resources.
The corpus is built exploiting Distant Supervision, assigning Myers-Briggs Type Indicator (MBTI) labels to YouTube comments, and can lend itself to a variety of experiments.
We report on preliminary experiments on Personal-ITY, which can serve as a baseline for future work, showing that some types are easier to predict than others, and discussing the perks of cross-dataset prediction.
arXiv Detail & Related papers (2020-11-11T10:51:07Z) - Vyaktitv: A Multimodal Peer-to-Peer Hindi Conversations based Dataset
for Personality Assessment [50.15466026089435]
We present a novel peer-to-peer Hindi conversation dataset- Vyaktitv.
It consists of high-quality audio and video recordings of the participants, with Hinglish textual transcriptions for each conversation.
The dataset also contains a rich set of socio-demographic features, like income, cultural orientation, amongst several others, for all the participants.
arXiv Detail & Related papers (2020-08-31T17:44:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.