"Sharks are not the threat humans are": Argument Component Segmentation
in School Student Essays
- URL: http://arxiv.org/abs/2103.04518v1
- Date: Mon, 8 Mar 2021 02:40:07 GMT
- Title: "Sharks are not the threat humans are": Argument Component Segmentation
in School Student Essays
- Authors: Tariq Alhindi and Debanjan Ghosh
- Abstract summary: We apply a token-level classification to identify claim and premise tokens from a new corpus of argumentative essays written by middle school students.
We demonstrate that a BERT-based multi-task learning architecture (i.e., token and sentence level classification) adaptively pretrained on a relevant unlabeled dataset obtains the best results.
- Score: 3.632177840361928
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Argument mining is often addressed by a pipeline method where segmentation of
text into argumentative units is conducted first and proceeded by an argument
component identification task. In this research, we apply a token-level
classification to identify claim and premise tokens from a new corpus of
argumentative essays written by middle school students. To this end, we compare
a variety of state-of-the-art models such as discrete features and deep
learning architectures (e.g., BiLSTM networks and BERT-based architectures) to
identify the argument components. We demonstrate that a BERT-based multi-task
learning architecture (i.e., token and sentence level classification)
adaptively pretrained on a relevant unlabeled dataset obtains the best results
Related papers
- DMON: A Simple yet Effective Approach for Argument Structure Learning [33.96187185638286]
Argument structure learning (ASL) entails predicting relations between arguments.
Despite its broad utilization, ASL remains a challenging task because it involves examining the complex relationships between the sentences in a potentially unstructured discourse.
We have developed a simple yet effective approach called Dual-tower Multi-scale cOnvolution neural Network(DMON) for the ASL task.
arXiv Detail & Related papers (2024-05-02T11:56:16Z) - Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic
Segmentation [59.37587762543934]
This paper studies the problem of weakly open-vocabulary semantic segmentation (WOVSS)
Existing methods suffer from a granularity inconsistency regarding the usage of group tokens.
We propose the prototypical guidance network (PGSeg) that incorporates multi-modal regularization.
arXiv Detail & Related papers (2023-10-29T13:18:00Z) - Multiview Identifiers Enhanced Generative Retrieval [78.38443356800848]
generative retrieval generates identifier strings of passages as the retrieval target.
We propose a new type of identifier, synthetic identifiers, that are generated based on the content of a passage.
Our proposed approach performs the best in generative retrieval, demonstrating its effectiveness and robustness.
arXiv Detail & Related papers (2023-05-26T06:50:21Z) - Learning Context-aware Classifier for Semantic Segmentation [88.88198210948426]
In this paper, contextual hints are exploited via learning a context-aware classifier.
Our method is model-agnostic and can be easily applied to generic segmentation models.
With only negligible additional parameters and +2% inference time, decent performance gain has been achieved on both small and large models.
arXiv Detail & Related papers (2023-03-21T07:00:35Z) - MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text
Generation [102.20036684996248]
We propose MURMUR, a neuro-symbolic modular approach to text generation from semi-structured data with multi-step reasoning.
We conduct experiments on two data-to-text generation tasks like WebNLG and LogicNLG.
arXiv Detail & Related papers (2022-12-16T17:36:23Z) - Multi-granularity Argument Mining in Legal Texts [0.913755431537592]
We conceptualize argument mining as a token-level (i.e., word-level) classification problem.
Results show that token-level text classification identifies certain legal argument elements more accurately than sentence-level text classification.
arXiv Detail & Related papers (2022-10-17T23:28:22Z) - Perturbations and Subpopulations for Testing Robustness in Token-Based
Argument Unit Recognition [6.502694770864571]
Argument Unit Recognition and Classification aims at identifying argument units from text and classifying them as pro or against.
One of the design choices that need to be made when developing systems for this task is what the unit of classification should be: segments of tokens or full sentences.
Previous research suggests that fine-tuning language models on the token-level yields more robust results for classifying sentences compared to training on sentences directly.
We reproduce the study that originally made this claim and further investigate what exactly token-based systems learned better compared to sentence-based ones.
arXiv Detail & Related papers (2022-09-29T13:44:28Z) - RuArg-2022: Argument Mining Evaluation [69.87149207721035]
This paper is a report of the organizers on the first competition of argumentation analysis systems dealing with Russian language texts.
A corpus containing 9,550 sentences (comments on social media posts) on three topics related to the COVID-19 pandemic was prepared.
The system that won the first place in both tasks used the NLI (Natural Language Inference) variant of the BERT architecture.
arXiv Detail & Related papers (2022-06-18T17:13:37Z) - Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets [49.65208986436848]
We investigate the effect of Argument Mining dataset composition in few- and zero-shot settings.
Our findings show that, while fine-tuning is mandatory to achieve acceptable model performance, using carefully composed training samples and reducing the training sample size by up to almost 90% can still yield 95% of the maximum performance.
arXiv Detail & Related papers (2022-05-23T17:14:32Z) - Aspect-Based Argument Mining [2.3148470932285665]
We present the task of Aspect-Based Argument Mining (ABAM) with the essential subtasks of Aspect Term Extraction (ATE) and Nested Term Extraction (NS)
We consider aspects as the main point(s) argument units are addressing.
This information is important for further downstream tasks such as argument ranking, argument summarization and generation, as well as the search for counter-arguments on the aspect-level.
arXiv Detail & Related papers (2020-11-01T21:57:51Z) - Same Side Stance Classification Task: Facilitating Argument Stance
Classification by Fine-tuning a BERT Model [8.8896707993459]
The same side stance classification task provides a dataset of argument pairs classified by whether or not both arguments share the same stance.
We fine-tuned a pre-trained BERT model for three epochs and used the first 512 tokens of each argument to predict if two arguments share the same stance.
arXiv Detail & Related papers (2020-04-23T13:54:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.