Semantic Frame Induction using Masked Word Embeddings and Two-Step
Clustering
- URL: http://arxiv.org/abs/2105.13466v1
- Date: Thu, 27 May 2021 22:00:33 GMT
- Title: Semantic Frame Induction using Masked Word Embeddings and Two-Step
Clustering
- Authors: Kosuke Yamada, Ryohei Sasano, Koichi Takeda
- Abstract summary: We propose a semantic frame induction method using masked word embeddings and two-step clustering.
We demonstrate that using the masked word embeddings is effective for avoiding too much reliance on the surface information of frame-evoking verbs.
- Score: 9.93359829907774
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent studies on semantic frame induction show that relatively high
performance has been achieved by using clustering-based methods with
contextualized word embeddings. However, there are two potential drawbacks to
these methods: one is that they focus too much on the superficial information
of the frame-evoking verb and the other is that they tend to divide the
instances of the same verb into too many different frame clusters. To overcome
these drawbacks, we propose a semantic frame induction method using masked word
embeddings and two-step clustering. Through experiments on the English FrameNet
data, we demonstrate that using the masked word embeddings is effective for
avoiding too much reliance on the surface information of frame-evoking verbs
and that two-step clustering can improve the number of resulting frame clusters
for the instances of the same verb.
Related papers
- Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision [87.15580604023555]
Unpair-Seg is a novel weakly-supervised open-vocabulary segmentation framework.
It learns from unpaired image-mask and image-text pairs, which can be independently and efficiently collected.
It achieves 14.6% and 19.5% mIoU on the ADE-847 and PASCAL Context-459 datasets.
arXiv Detail & Related papers (2024-02-14T06:01:44Z) - Acquiring Frame Element Knowledge with Deep Metric Learning for Semantic
Frame Induction [24.486546938073907]
We propose a method that applies deep metric learning to semantic frame induction tasks.
A pre-trained language model is fine-tuned to be suitable for distinguishing frame element roles.
Experimental results on FrameNet demonstrate that our method achieves substantially better performance than existing methods.
arXiv Detail & Related papers (2023-05-23T11:02:28Z) - Semantic Frame Induction with Deep Metric Learning [24.486546938073907]
We propose a model that uses deep metric learning to fine-tune a contextualized embedding model.
We apply the fine-tuned contextualized embeddings to perform semantic frame induction.
arXiv Detail & Related papers (2023-04-27T15:46:09Z) - Associating Spatially-Consistent Grouping with Text-supervised Semantic
Segmentation [117.36746226803993]
We introduce self-supervised spatially-consistent grouping with text-supervised semantic segmentation.
Considering the part-like grouped results, we further adapt a text-supervised model from image-level to region-level recognition.
Our method achieves 59.2% mIoU and 32.4% mIoU on Pascal VOC and Pascal Context benchmarks.
arXiv Detail & Related papers (2023-04-03T16:24:39Z) - Adaptively Clustering Neighbor Elements for Image-Text Generation [78.82346492527425]
We propose a novel Transformer-based image-to-text generation model termed as textbfACF.
ACF adaptively clusters vision patches into object regions and language words into phrases to implicitly learn object-phrase alignments.
Experiment results demonstrate the effectiveness of ACF, which outperforms most SOTA captioning and VQA models.
arXiv Detail & Related papers (2023-01-05T08:37:36Z) - Betrayed by Captions: Joint Caption Grounding and Generation for Open
Vocabulary Instance Segmentation [80.48979302400868]
We focus on open vocabulary instance segmentation to expand a segmentation model to classify and segment instance-level novel categories.
Previous approaches have relied on massive caption datasets and complex pipelines to establish one-to-one mappings between image regions and captions in nouns.
We devise a joint textbfCaption Grounding and Generation (CGG) framework, which incorporates a novel grounding loss that only focuses on matching object to improve learning efficiency.
arXiv Detail & Related papers (2023-01-02T18:52:12Z) - Sister Help: Data Augmentation for Frame-Semantic Role Labeling [9.62264668211579]
We propose a data augmentation approach, which uses existing frame-specific annotation to automatically annotate other lexical units of the same frame which are unannotated.
We present experiments on frame-semantic role labeling which demonstrate the importance of this data augmentation.
arXiv Detail & Related papers (2021-09-16T05:15:29Z) - Verb Sense Clustering using Contextualized Word Representations for
Semantic Frame Induction [9.93359829907774]
Contextualized word representations have proven useful for various natural language processing tasks.
In this paper, we focus on verbs that evoke different frames depending on the context.
We investigate how well contextualized word representations can recognize the difference of frames that the same verb evokes.
arXiv Detail & Related papers (2021-05-27T21:53:40Z) - End-to-End Speaker Diarization as Post-Processing [64.12519350944572]
Clustering-based diarization methods partition frames into clusters of the number of speakers.
Some end-to-end diarization methods can handle overlapping speech by treating the problem as multi-label classification.
We propose to use a two-speaker end-to-end diarization method as post-processing of the results obtained by a clustering-based method.
arXiv Detail & Related papers (2020-12-18T05:31:07Z) - Causal Intervention for Weakly-Supervised Semantic Segmentation [122.1846968696862]
We aim to generate better pixel-level pseudo-masks by using only image-level labels.
We propose a structural causal model to analyze the causalities among images, contexts, and class labels.
Based on it, we develop a new method: Context Adjustment (CONTA), to remove the confounding bias in image-level classification.
arXiv Detail & Related papers (2020-09-26T09:26:29Z) - FrameAxis: Characterizing Microframe Bias and Intensity with Word
Embedding [8.278618225536807]
We propose FrameAxis, a method for characterizing documents by identifying the most relevant semantic axes ("microframes")
FrameAxis is designed to quantitatively tease out two important dimensions of how microframes are used in the text.
We demonstrate that microframes with the highest bias and intensity well align with sentiment, topic, and partisan spectrum by applying FrameAxis to multiple datasets from restaurant reviews to political news.
arXiv Detail & Related papers (2020-02-20T08:01:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.