ERTIM@MC2: Diversified Argumentative Tweets Retrieval
- URL: http://arxiv.org/abs/2304.08047v1
- Date: Mon, 17 Apr 2023 08:06:17 GMT
- Title: ERTIM@MC2: Diversified Argumentative Tweets Retrieval
- Authors: K\'evin Deturck (ERTIM), Parantapa Goswami, Damien Nouvel (ERTIM),
Fr\'ed\'erique Segond (ERTIM)
- Abstract summary: It consists in detecting the most argumentative and diverse Tweets about some festivals in English and French from a massive multilingual collection.
An initial step filters the original dataset to fit the language and topic requirements of the task.
The final step extracts the most diverse arguments by clustering Tweets according to their textual content and selecting the most argumentative ones from each cluster.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we present our participation to CLEF MC2 2018 edition for the
task 2 Mining opinion argumentation. It consists in detecting the most
argumentative and diverse Tweets about some festivals in English and French
from a massive multilingual collection. We measure argumentativity of a Tweet
computing the amount of argumentation compounds it contains. We consider
argumentation compounds as a combination between opinion expression and its
support with facts and a particular structuration. Regarding diversity, we
consider the amount of festival aspects covered by Tweets. An initial step
filters the original dataset to fit the language and topic requirements of the
task. Then, we compute and integrate linguistic descriptors to detect claims
and their respective justifications in Tweets. The final step extracts the most
diverse arguments by clustering Tweets according to their textual content and
selecting the most argumentative ones from each cluster. We conclude the paper
describing the different ways we combined the descriptors among the different
runs we submitted and discussing their results.
Related papers
- PERSPECTRA: A Scalable and Configurable Pluralist Benchmark of Perspectives from Arguments [16.8677147128948]
PERSPECTRA is a pluralist benchmark for evaluating how well models represent, distinguish, and reason over multiple perspectives.<n>We construct 3,810 enriched arguments spanning 762 pro/con stances on 100 controversial topics.<n>Each opinion is expanded to multiple naturalistic variants, enabling robust evaluation of pluralism.
arXiv Detail & Related papers (2026-02-09T14:25:07Z) - ARQUSUMM: Argument-aware Quantitative Summarization of Online Conversations [11.33923212079359]
We propose a novel task of argument-aware quantitative summarization to reveal the claim-reason structure of arguments in conversations.<n>For quantitative summarization, ARQUSUMM employs argument structure-aware clustering algorithms to aggregate arguments and quantify their support.
arXiv Detail & Related papers (2025-11-21T06:37:32Z) - CERD: A Comprehensive Chinese Rhetoric Dataset for Rhetorical Understanding and Generation in Essays [30.728539221991188]
Existing rhetorical datasets or corpora primarily focus on single coarse-grained categories or fine-grained categories.
We propose the Chinese Essay Rhetoric dataset (CERD), consisting of 4 commonly used coarse-grained categories.
CERD is a manually annotated and comprehensive Chinese rhetoric dataset with five interrelated sub-tasks.
arXiv Detail & Related papers (2024-09-29T12:47:25Z) - Uncovering Differences in Persuasive Language in Russian versus English Wikipedia [40.61046400448044]
We study how differences in persuasive language across Wikipedia articles, written in either English and Russian, can uncover each culture's distinct perspective on different subjects.
We develop a large language model (LLM) powered system to identify instances of persuasive language in multilingual texts.
arXiv Detail & Related papers (2024-09-27T21:23:19Z) - PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis [74.41260927676747]
This paper bridges the gaps by introducing a multimodal conversational Sentiment Analysis (ABSA)
To benchmark the tasks, we construct PanoSent, a dataset annotated both manually and automatically, featuring high quality, large scale, multimodality, multilingualism, multi-scenarios, and covering both implicit and explicit sentiment elements.
To effectively address the tasks, we devise a novel Chain-of-Sentiment reasoning framework, together with a novel multimodal large language model (namely Sentica) and a paraphrase-based verification mechanism.
arXiv Detail & Related papers (2024-08-18T13:51:01Z) - Argue with Me Tersely: Towards Sentence-Level Counter-Argument
Generation [62.069374456021016]
We present the ArgTersely benchmark for sentence-level counter-argument generation.
We also propose Arg-LlaMA for generating high-quality counter-argument.
arXiv Detail & Related papers (2023-12-21T06:51:34Z) - Explainable Topic-Enhanced Argument Mining from Heterogeneous Sources [33.62800469391487]
Given a controversial target such as nuclear energy'', argument mining aims to identify the argumentative text from heterogeneous sources.
Current approaches focus on exploring better ways of integrating the target-associated semantic information with the argumentative text.
We propose a novel explainable topic-enhanced argument mining approach.
arXiv Detail & Related papers (2023-07-22T17:26:55Z) - Retrofitting Multilingual Sentence Embeddings with Abstract Meaning
Representation [70.58243648754507]
We introduce a new method to improve existing multilingual sentence embeddings with Abstract Meaning Representation (AMR)
Compared with the original textual input, AMR is a structured semantic representation that presents the core concepts and relations in a sentence explicitly and unambiguously.
Experiment results show that retrofitting multilingual sentence embeddings with AMR leads to better state-of-the-art performance on both semantic similarity and transfer tasks.
arXiv Detail & Related papers (2022-10-18T11:37:36Z) - RuArg-2022: Argument Mining Evaluation [69.87149207721035]
This paper is a report of the organizers on the first competition of argumentation analysis systems dealing with Russian language texts.
A corpus containing 9,550 sentences (comments on social media posts) on three topics related to the COVID-19 pandemic was prepared.
The system that won the first place in both tasks used the NLI (Natural Language Inference) variant of the BERT architecture.
arXiv Detail & Related papers (2022-06-18T17:13:37Z) - IAM: A Comprehensive and Large-Scale Dataset for Integrated Argument
Mining Tasks [59.457948080207174]
In this work, we introduce a comprehensive and large dataset named IAM, which can be applied to a series of argument mining tasks.
Near 70k sentences in the dataset are fully annotated based on their argument properties.
We propose two new integrated argument mining tasks associated with the debate preparation process: (1) claim extraction with stance classification (CESC) and (2) claim-evidence pair extraction (CEPE)
arXiv Detail & Related papers (2022-03-23T08:07:32Z) - Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit
Argument Relations [70.35379323231241]
This paper presents a better approach for event extraction by explicitly utilizing the relationships of event arguments.
We employ reinforcement learning and incremental learning to extract multiple arguments via a multi-turned, iterative process.
Experimental results show that our approach consistently outperforms seven state-of-the-art event extraction methods.
arXiv Detail & Related papers (2021-06-23T13:24:39Z) - Great Service! Fine-grained Parsing of Implicit Arguments [7.785534704637891]
We show that certain types of implicit arguments are more difficult to parse than others.
This work will facilitate a better understanding of implicit and underspecified language, by incorporating it holistically into meaning representations.
arXiv Detail & Related papers (2021-06-04T15:50:35Z) - The Discussion Tracker Corpus of Collaborative Argumentation [2.800857580710507]
The Discussion Tracker corpus was collected in American high school English classes.
The corpus consists of 29 multi-party discussions of English literature transcribed from 985 minutes of audio.
arXiv Detail & Related papers (2020-05-22T18:27:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.