Wikibio: a Semantic Resource for the Intersectional Analysis of
Biographical Events
- URL: http://arxiv.org/abs/2306.09505v1
- Date: Thu, 15 Jun 2023 20:59:37 GMT
- Title: Wikibio: a Semantic Resource for the Intersectional Analysis of
Biographical Events
- Authors: Marco Antonio Stranisci, Rossana Damiano, Enrico Mensa, Viviana Patti,
Daniele Radicioni, Tommaso Caselli
- Abstract summary: We present a new corpus annotated for biographical event detection.
The model was able to detect all mentions of the target-entity in a biography with an F-score of 0.808.
It was also used for performing an analysis of biases about women and non-Western people in Wikipedia biographies.
- Score: 3.8455936323976694
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Biographical event detection is a relevant task for the exploration and
comparison of the ways in which people's lives are told and represented. In
this sense, it may support several applications in digital humanities and in
works aimed at exploring bias about minoritized groups. Despite that, there are
no corpora and models specifically designed for this task. In this paper we
fill this gap by presenting a new corpus annotated for biographical event
detection. The corpus, which includes 20 Wikipedia biographies, was compared
with five existing corpora to train a model for the biographical event
detection task. The model was able to detect all mentions of the target-entity
in a biography with an F-score of 0.808 and the entity-related events with an
F-score of 0.859. Finally, the model was used for performing an analysis of
biases about women and non-Western people in Wikipedia biographies.
Related papers
- Causal Micro-Narratives [62.47217054314046]
We present a novel approach to classify causal micro-narratives from text.
These narratives are sentence-level explanations of the cause(s) and/or effect(s) of a target subject.
arXiv Detail & Related papers (2024-10-07T17:55:10Z) - GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing [72.0343083866144]
This paper introduces the GenderBias-emphVL benchmark to evaluate occupation-related gender bias in Large Vision-Language Models.
Using our benchmark, we extensively evaluate 15 commonly used open-source LVLMs and state-of-the-art commercial APIs.
Our findings reveal widespread gender biases in existing LVLMs.
arXiv Detail & Related papers (2024-06-30T05:55:15Z) - BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once [58.41069132627823]
holistic image analysis comprises subtasks such as segmentation, detection, and recognition of relevant objects.
Here, we propose BiomedParse, a biomedical foundation model for imaging parsing that can jointly conduct segmentation, detection, and recognition for 82 object types across 9 imaging modalities.
Through joint learning, we can improve accuracy for individual tasks and enable novel applications such as segmenting all relevant objects in a noisy image through a text prompt.
arXiv Detail & Related papers (2024-05-21T17:54:06Z) - Gender Biases in Automatic Evaluation Metrics for Image Captioning [87.15170977240643]
We conduct a systematic study of gender biases in model-based evaluation metrics for image captioning tasks.
We demonstrate the negative consequences of using these biased metrics, including the inability to differentiate between biased and unbiased generations.
We present a simple and effective way to mitigate the metric bias without hurting the correlations with human judgments.
arXiv Detail & Related papers (2023-05-24T04:27:40Z) - PART: Pre-trained Authorship Representation Transformer [64.78260098263489]
Authors writing documents imprint identifying information within their texts: vocabulary, registry, punctuation, misspellings, or even emoji usage.
Previous works use hand-crafted features or classification tasks to train their authorship models, leading to poor performance on out-of-domain authors.
We propose a contrastively trained model fit to learn textbfauthorship embeddings instead of semantics.
arXiv Detail & Related papers (2022-09-30T11:08:39Z) - Guidelines and a Corpus for Extracting Biographical Events [1.181206257787103]
Our work challenges the limitation by providing a set of guidelines for the semantic annotation of life events.
The guidelines are designed to be interoperable with existing ISO-standards for semantic annotation: ISO-TimeML (ISO-24617-1), and SemAF (ISO-24617-4)
arXiv Detail & Related papers (2022-06-07T19:36:18Z) - Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia [16.892630505629224]
We present the first event-centric study of gender biases in a Wikipedia corpus.
We detect events with a state-of-the-art event detection model, calibrate the results using strategically generated templates, and extract events that have asymmetric associations with genders.
arXiv Detail & Related papers (2021-06-03T05:22:16Z) - How True is GPT-2? An Empirical Analysis of Intersectional Occupational
Biases [50.591267188664666]
Downstream applications are at risk of inheriting biases contained in natural language models.
We analyze the occupational biases of a popular generative language model, GPT-2.
For a given job, GPT-2 reflects the societal skew of gender and ethnicity in the US, and in some cases, pulls the distribution towards gender parity.
arXiv Detail & Related papers (2021-02-08T11:10:27Z) - Controlled Analyses of Social Biases in Wikipedia Bios [27.591896251854724]
We present a methodology for reducing the effects of confounding variables in analyses of Wikipedia biography pages.
We evaluate our methodology by developing metrics to measure how well the comparison corpus aligns with the target corpus.
Our results show that failing to control for confounding variables can result in different conclusions and mask biases.
arXiv Detail & Related papers (2020-12-31T21:27:12Z) - Automatic Section Recognition in Obituaries [10.536415845097661]
We propose a statistical model which recognizes sections of obituaries.
We collect a corpus of 20058 English obituaries from TheDaily Item, Remembering.CA and The London Free Press.
Formulated as an automatic segmentation task, a convolutional neural network outperforms bag-of-words.
arXiv Detail & Related papers (2020-02-28T13:20:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.