Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for
Comparative Opinion Mining from Vietnamese Product Reviews
- URL: http://arxiv.org/abs/2402.13613v2
- Date: Mon, 4 Mar 2024 21:20:57 GMT
- Title: Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for
Comparative Opinion Mining from Vietnamese Product Reviews
- Authors: Hoang-Quynh Le, Duy-Cat Can, Khanh-Vinh Nguyen and Mai-Vu Tran
- Abstract summary: This paper presents a comprehensive overview of the Comparative Opinion Mining from Vietnamese Product Reviews shared task (ComOM)
The primary objective of this shared task is to advance the field of natural language processing by developing techniques that proficiently extract comparative opinions from Vietnamese product reviews.
We construct a human-annotated dataset comprising $120$ documents, encompassing $7427$ non-comparative sentences and $2468$ comparisons within $1798$ sentences.
- Score: 0.6827423171182151
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a comprehensive overview of the Comparative Opinion
Mining from Vietnamese Product Reviews shared task (ComOM), held as part of the
10$^{th}$ International Workshop on Vietnamese Language and Speech Processing
(VLSP 2023). The primary objective of this shared task is to advance the field
of natural language processing by developing techniques that proficiently
extract comparative opinions from Vietnamese product reviews. Participants are
challenged to propose models that adeptly extract a comparative "quintuple"
from a comparative sentence, encompassing Subject, Object, Aspect, Predicate,
and Comparison Type Label. We construct a human-annotated dataset comprising
$120$ documents, encompassing $7427$ non-comparative sentences and $2468$
comparisons within $1798$ sentences. Participating models undergo evaluation
and ranking based on the Exact match macro-averaged quintuple F1 score.
Related papers
- GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human [71.42669028683741]
We present a shared task on binary machine generated text detection conducted as a part of the GenAI workshop at COLING 2025.
The task consists of two subtasks: Monolingual (English) and Multilingual.
We provide a comprehensive overview of the data, a summary of the results, detailed descriptions of the participating systems, and an in-depth analysis of submissions.
arXiv Detail & Related papers (2025-01-19T11:11:55Z) - ViBidirectionMT-Eval: Machine Translation for Vietnamese-Chinese and Vietnamese-Lao language pair [0.0]
The paper presents the results of the VLSP 2022-2023 Machine Translation Shared Tasks.
The tasks were organized as part of the 9th, 10th annual workshop on Vietnamese Language and Speech Processing.
The objective of the shared task was to build machine translation systems, specifically targeting Vietnamese-Chinese and Vietnamese-Lao translation.
arXiv Detail & Related papers (2025-01-15T06:40:26Z) - Navigating Text-to-Image Generative Bias across Indic Languages [53.92640848303192]
This research investigates biases in text-to-image (TTI) models for the Indic languages widely spoken across India.
It evaluates and compares the generative performance and cultural relevance of leading TTI models in these languages against their performance in English.
arXiv Detail & Related papers (2024-08-01T04:56:13Z) - Narrative Action Evaluation with Prompt-Guided Multimodal Interaction [60.281405999483]
Narrative action evaluation (NAE) aims to generate professional commentary that evaluates the execution of an action.
NAE is a more challenging task because it requires both narrative flexibility and evaluation rigor.
We propose a prompt-guided multimodal interaction framework to facilitate the interaction between different modalities of information.
arXiv Detail & Related papers (2024-04-22T17:55:07Z) - Unveiling Comparative Sentiments in Vietnamese Product Reviews: A
Sequential Classification Framework [2.235716381266672]
We propose an approach that consists of solving three sequential sub-tasks: identifying comparative sentence, extracting comparative elements, and classifying comparison types.
Our method is ranked fifth at the Vietnamese Language and Speech Processing (VLSP) 2023 challenge on Comparative Opinion Mining (ComOM) from Vietnamese Product Reviews.
arXiv Detail & Related papers (2024-01-02T08:58:01Z) - ComOM at VLSP 2023: A Dual-Stage Framework with BERTology and Unified
Multi-Task Instruction Tuning Model for Vietnamese Comparative Opinion Mining [0.6522338519818377]
The ComOM shared task aims to extract comparative opinions from product reviews in Vietnamese language.
We propose a two-stage system based on fine-tuning a BERTology model for the CSI task and unified multi-task instruction tuning for the CEE task.
Experimental results show that our approach outperforms the other competitors and has achieved the top score on the official private test.
arXiv Detail & Related papers (2023-12-14T14:44:59Z) - Overview of the VLSP 2022 -- Abmusu Shared Task: A Data Challenge for
Vietnamese Abstractive Multi-document Summarization [0.6827423171182151]
The goal of Abmusu shared task is to develop summarization systems that could create abstractive summaries automatically for a set of documents on a topic.
We build a human-annotated dataset of 1,839 documents in 600 clusters, collected from Vietnamese news in 8 categories.
Models are evaluated and ranked in terms of textttROUGE2-F1 score, the most typical evaluation metric for document summarization problem.
arXiv Detail & Related papers (2023-11-27T04:01:13Z) - ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
Document Images [198.35937007558078]
The competition opened on 30th December, 2022 and closed on 24th March, 2023.
There are 35 participants and 91 valid submissions received for Track 1, and 15 participants and 26 valid submissions received for Track 2.
According to the performance of the submissions, we believe there is still a large gap on the expected information extraction performance for complex and zero-shot scenarios.
arXiv Detail & Related papers (2023-06-05T22:20:52Z) - RuArg-2022: Argument Mining Evaluation [69.87149207721035]
This paper is a report of the organizers on the first competition of argumentation analysis systems dealing with Russian language texts.
A corpus containing 9,550 sentences (comments on social media posts) on three topics related to the COVID-19 pandemic was prepared.
The system that won the first place in both tasks used the NLI (Natural Language Inference) variant of the BERT architecture.
arXiv Detail & Related papers (2022-06-18T17:13:37Z) - IAM: A Comprehensive and Large-Scale Dataset for Integrated Argument
Mining Tasks [59.457948080207174]
In this work, we introduce a comprehensive and large dataset named IAM, which can be applied to a series of argument mining tasks.
Near 70k sentences in the dataset are fully annotated based on their argument properties.
We propose two new integrated argument mining tasks associated with the debate preparation process: (1) claim extraction with stance classification (CESC) and (2) claim-evidence pair extraction (CEPE)
arXiv Detail & Related papers (2022-03-23T08:07:32Z) - COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer
Reviews [9.838034994804124]
We present a dataset of comparison discussions in peer reviews of research papers in the domain of experimental deep learning.
We build a taxonomy of categories in comparison discussions and present a detailed annotation scheme to analyze this.
Overall, we annotate 117 reviews covering 1,800 sentences and report a maximum F1 Score of 0.49.
arXiv Detail & Related papers (2021-08-09T21:24:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.