Sister Help: Data Augmentation for Frame-Semantic Role Labeling
- URL: http://arxiv.org/abs/2109.07725v1
- Date: Thu, 16 Sep 2021 05:15:29 GMT
- Title: Sister Help: Data Augmentation for Frame-Semantic Role Labeling
- Authors: Ayush Pancholy, Miriam R. L. Petruck, Swabha Swayamdipta
- Abstract summary: We propose a data augmentation approach, which uses existing frame-specific annotation to automatically annotate other lexical units of the same frame which are unannotated.
We present experiments on frame-semantic role labeling which demonstrate the importance of this data augmentation.
- Score: 9.62264668211579
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: While FrameNet is widely regarded as a rich resource of semantics in natural
language processing, a major criticism concerns its lack of coverage and the
relative paucity of its labeled data compared to other commonly used lexical
resources such as PropBank and VerbNet. This paper reports on a pilot study to
address these gaps. We propose a data augmentation approach, which uses
existing frame-specific annotation to automatically annotate other lexical
units of the same frame which are unannotated. Our rule-based approach defines
the notion of a sister lexical unit and generates frame-specific augmented data
for training. We present experiments on frame-semantic role labeling which
demonstrate the importance of this data augmentation: we obtain a large
improvement to prior results on frame identification and argument
identification for FrameNet, utilizing both full-text and lexicographic
annotations under FrameNet. Our findings on data augmentation highlight the
value of automatic resource creation for improved models in frame-semantic
parsing.
Related papers
- Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling [81.69474860607542]
We present Openstory++, a large-scale dataset combining additional instance-level annotations with both images and text.
We also present Cohere-Bench, a pioneering benchmark framework for evaluating the image generation tasks when long multimodal context is provided.
arXiv Detail & Related papers (2024-08-07T11:20:37Z) - FS-RAG: A Frame Semantics Based Approach for Improved Factual Accuracy in Large Language Models [2.1484130681985047]
We present a novel extension to Retrieval Augmented Generation with the goal of mitigating factual inaccuracies in the output of large language models.
Our method draws on the cognitive linguistic theory of frame semantics for the indexing and retrieval of factual information relevant to helping large language models answer queries.
arXiv Detail & Related papers (2024-06-23T17:18:19Z) - Acquiring Frame Element Knowledge with Deep Metric Learning for Semantic
Frame Induction [24.486546938073907]
We propose a method that applies deep metric learning to semantic frame induction tasks.
A pre-trained language model is fine-tuned to be suitable for distinguishing frame element roles.
Experimental results on FrameNet demonstrate that our method achieves substantially better performance than existing methods.
arXiv Detail & Related papers (2023-05-23T11:02:28Z) - Self-Sufficient Framework for Continuous Sign Language Recognition [75.60327502570242]
The goal of this work is to develop self-sufficient framework for Continuous Sign Language Recognition.
These include the need for complex multi-scale features such as hands, face, and mouth for understanding, and absence of frame-level annotations.
We propose Divide and Focus Convolution (DFConv) which extracts both manual and non-manual features without the need for additional networks or annotations.
DPLR propagates non-spiky frame-level pseudo-labels by combining the ground truth gloss sequence labels with the predicted sequence.
arXiv Detail & Related papers (2023-03-21T11:42:57Z) - Query Your Model with Definitions in FrameNet: An Effective Method for
Frame Semantic Role Labeling [43.58108941071302]
Frame Semantic Role Labeling (FSRL) identifies arguments and labels them with frame roles defined in FrameNet.
We propose a query-based framework named ArGument Extractor with Definitions in FrameNet (AGED) to mitigate these problems.
arXiv Detail & Related papers (2022-12-05T05:09:12Z) - A Double-Graph Based Framework for Frame Semantic Parsing [23.552054033442545]
Frame semantic parsing is a fundamental NLP task, which consists of three subtasks: frame identification, argument identification and role classification.
Most previous studies tend to neglect relations between different subtasks and arguments and pay little attention to ontological frame knowledge.
In this paper, we propose a Knowledge-guided semanticPK with Double-graph (KID)
Our experiments show KID outperforms the previous state-of-the-art method by up to 1.7 F1-score on two FrameNet datasets.
arXiv Detail & Related papers (2022-06-18T09:39:38Z) - FactGraph: Evaluating Factuality in Summarization with Semantic Graph
Representations [114.94628499698096]
We propose FactGraph, a method that decomposes the document and the summary into structured meaning representations (MRs)
MRs describe core semantic concepts and their relations, aggregating the main content in both document and summary in a canonical form, and reducing data sparsity.
Experiments on different benchmarks for evaluating factuality show that FactGraph outperforms previous approaches by up to 15%.
arXiv Detail & Related papers (2022-04-13T16:45:33Z) - Incorporating Constituent Syntax for Coreference Resolution [50.71868417008133]
We propose a graph-based method to incorporate constituent syntactic structures.
We also explore to utilise higher-order neighbourhood information to encode rich structures in constituent trees.
Experiments on the English and Chinese portions of OntoNotes 5.0 benchmark show that our proposed model either beats a strong baseline or achieves new state-of-the-art performance.
arXiv Detail & Related papers (2022-02-22T07:40:42Z) - Improving Image Captioning with Better Use of Captions [65.39641077768488]
We present a novel image captioning architecture to better explore semantics available in captions and leverage that to enhance both image representation and caption generation.
Our models first construct caption-guided visual relationship graphs that introduce beneficial inductive bias using weakly supervised multi-instance learning.
During generation, the model further incorporates visual relationships using multi-task learning for jointly predicting word and object/predicate tag sequences.
arXiv Detail & Related papers (2020-06-21T14:10:47Z) - SF-Net: Single-Frame Supervision for Temporal Action Localization [60.202516362976645]
Single-frame supervision introduces extra temporal action signals while maintaining low annotation overhead.
We propose a unified system called SF-Net to make use of such single-frame supervision.
SF-Net significantly improves upon state-of-the-art weakly-supervised methods in terms of both segment localization and single-frame localization.
arXiv Detail & Related papers (2020-03-15T15:06:01Z) - FrameAxis: Characterizing Microframe Bias and Intensity with Word
Embedding [8.278618225536807]
We propose FrameAxis, a method for characterizing documents by identifying the most relevant semantic axes ("microframes")
FrameAxis is designed to quantitatively tease out two important dimensions of how microframes are used in the text.
We demonstrate that microframes with the highest bias and intensity well align with sentiment, topic, and partisan spectrum by applying FrameAxis to multiple datasets from restaurant reviews to political news.
arXiv Detail & Related papers (2020-02-20T08:01:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.