SINAI at eRisk@CLEF 2022: Approaching Early Detection of Gambling and Eating Disorders with Natural Language Processing
- URL: http://arxiv.org/abs/2509.14806v1
- Date: Thu, 18 Sep 2025 10:03:31 GMT
- Title: SINAI at eRisk@CLEF 2022: Approaching Early Detection of Gambling and Eating Disorders with Natural Language Processing
- Authors: Alba Maria Marmol-Romero, Salud Maria Jimenez-Zafra, Flor Miriam Plaza-del-Arco, M. Dolores Molina-Gonzalez, Maria-Teresa Martin-Valdivia, Arturo Montejo-Raez,
- Abstract summary: This paper describes the participation of the SINAI team in the eRisk@CLEF lab.<n>The approach presented in Task 1 is based on the use of sentence embeddings from Transformers.<n>The approach for Task 3 is based on text similarity estimation using contextualized word embeddings from Transformers.
- Score: 3.303345937552717
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper describes the participation of the SINAI team in the eRisk@CLEF lab. Specifically, two of the proposed tasks have been addressed: i) Task 1 on the early detection of signs of pathological gambling, and ii) Task 3 on measuring the severity of the signs of eating disorders. The approach presented in Task 1 is based on the use of sentence embeddings from Transformers with features related to volumetry, lexical diversity, complexity metrics, and emotion-related scores, while the approach for Task 3 is based on text similarity estimation using contextualized word embeddings from Transformers. In Task 1, our team has been ranked in second position, with an F1 score of 0.808, out of 41 participant submissions. In Task 3, our team also placed second out of a total of 3 participating teams.
Related papers
- Generating Natural-Language Surgical Feedback: From Structured Representation to Domain-Grounded Evaluation [66.7752700084159]
High-quality feedback from a surgical trainer is pivotal for improving trainee performance and long-term skill acquisition.<n>We present a structure-aware pipeline that learns a surgical action ontology from real trainer-to-trainee transcripts.
arXiv Detail & Related papers (2025-11-19T06:19:34Z) - SINAI at eRisk@CLEF 2023: Approaching Early Detection of Gambling with Natural Language Processing [3.987649624343527]
This paper describes the participation of the SINAI team in the eRisk@CLEF lab.<n>One of the proposed tasks has been addressed: Task 2 on the early detection of signs of pathological gambling.<n>The approach presented in Task 2 is based on pre-trained models from Transformers architecture with comprehensive preprocessing data and data balancing techniques.
arXiv Detail & Related papers (2025-09-18T09:50:14Z) - GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human [71.42669028683741]
We present a shared task on binary machine generated text detection conducted as a part of the GenAI workshop at COLING 2025.<n>The task consists of two subtasks: Monolingual (English) and Multilingual.<n>We provide a comprehensive overview of the data, a summary of the results, detailed descriptions of the participating systems, and an in-depth analysis of submissions.
arXiv Detail & Related papers (2025-01-19T11:11:55Z) - GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge [12.076440946525434]
The Academic Essay Authenticity Challenge was organized as part of the GenAI Content Detection shared tasks collocated with COLING 2025.<n>This challenge focuses on detecting machine-generated vs. human-authored essays for academic purposes.<n>The challenge involves two languages: English and Arabic.<n>This paper outlines the task formulation, details the dataset construction process, and explains the evaluation framework.
arXiv Detail & Related papers (2024-12-24T08:33:44Z) - Strategies to Harness the Transformers' Potential: UNSL at eRisk 2023 [0.9208007322096532]
The CLEF eRisk Laboratory explores solutions to different tasks related to risk detection on the Internet.
In the 2023 edition, Task 1 consisted of searching for symptoms of depression, the objective of which was to extract user writings according to their relevance to the BDI Questionnaire symptoms.
Task 2 was related to the problem of early detection of pathological gambling risks, where the participants had to detect users at risk as quickly as possible.
Task 3 consisted of estimating the severity levels of signs of eating disorders.
arXiv Detail & Related papers (2023-10-30T19:34:33Z) - Pushing the Limits of ChatGPT on NLP Tasks [79.17291002710517]
Despite the success of ChatGPT, its performances on most NLP tasks are still well below the supervised baselines.
In this work, we looked into the causes, and discovered that its subpar performance was caused by the following factors.
We propose a collection of general modules to address these issues, in an attempt to push the limits of ChatGPT on NLP tasks.
arXiv Detail & Related papers (2023-06-16T09:40:05Z) - mCPT at SemEval-2023 Task 3: Multilingual Label-Aware Contrastive
Pre-Training of Transformers for Few- and Zero-shot Framing Detection [63.540146992962526]
This paper presents the winning system for the zero-shot Spanish framing detection task.
Our solution employs a pre-training procedure based on multilingual Transformers.
In addition to describing the system, we perform an embedding space analysis and ablation study to demonstrate how our pre-training procedure supports framing detection.
arXiv Detail & Related papers (2023-03-17T11:33:06Z) - Task formulation for Extracting Social Determinants of Health from
Clinical Narratives [0.0]
The 2022 n2c2 NLP Challenge posed identification of social determinants of health in clinical narratives.
We present three systems that we developed for the Challenge and discuss the distinctive task formulation used in each of the three systems.
arXiv Detail & Related papers (2023-01-26T20:00:54Z) - Event Causality Identification with Causal News Corpus -- Shared Task 3,
CASE 2022 [3.0775142635531685]
Event Causality Identification Shared Task of CASE 2022 involved two subtasks.
Subtask 1 required participants to predict if a sentence contains a causal relation or not.
Subtask 2 required participants to identify the Cause, Effect and Signal spans per causal sentence.
arXiv Detail & Related papers (2022-11-22T10:34:09Z) - RuArg-2022: Argument Mining Evaluation [69.87149207721035]
This paper is a report of the organizers on the first competition of argumentation analysis systems dealing with Russian language texts.
A corpus containing 9,550 sentences (comments on social media posts) on three topics related to the COVID-19 pandemic was prepared.
The system that won the first place in both tasks used the NLI (Natural Language Inference) variant of the BERT architecture.
arXiv Detail & Related papers (2022-06-18T17:13:37Z) - An Iterative Multi-Knowledge Transfer Network for Aspect-Based Sentiment
Analysis [73.7488524683061]
We propose a novel Iterative Multi-Knowledge Transfer Network (IMKTN) for end-to-end ABSA.
Our IMKTN transfers the task-specific knowledge from any two of the three subtasks to another one at the token level by utilizing a well-designed routing algorithm.
Experimental results on three benchmark datasets demonstrate the effectiveness and superiority of our approach.
arXiv Detail & Related papers (2020-04-04T13:49:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.