Making Better Use of Bilingual Information for Cross-Lingual AMR Parsing
- URL: http://arxiv.org/abs/2106.04814v1
- Date: Wed, 9 Jun 2021 05:14:54 GMT
- Title: Making Better Use of Bilingual Information for Cross-Lingual AMR Parsing
- Authors: Yitao Cai, Zhe Lin and Xiaojun Wan
- Abstract summary: We argue that the misprediction of concepts is due to the high relevance between English tokens and AMR concepts.
We introduce bilingual input, namely the translated texts as well as non-English texts, in order to enable the model to predict more accurate concepts.
- Score: 88.08581016329398
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Abstract Meaning Representation (AMR) is a rooted, labeled, acyclic graph
representing the semantics of natural language. As previous works show,
although AMR is designed for English at first, it can also represent semantics
in other languages. However, they find that concepts in their predicted AMR
graphs are less specific. We argue that the misprediction of concepts is due to
the high relevance between English tokens and AMR concepts. In this work, we
introduce bilingual input, namely the translated texts as well as non-English
texts, in order to enable the model to predict more accurate concepts. Besides,
we also introduce an auxiliary task, requiring the decoder to predict the
English sequences at the same time. The auxiliary task can help the decoder
understand what exactly the corresponding English tokens are. Our proposed
cross-lingual AMR parser surpasses previous state-of-the-art parser by 10.6
points on Smatch F1 score. The ablation study also demonstrates the efficacy of
our proposed modules.
Related papers
- Analyzing the Role of Semantic Representations in the Era of Large Language Models [104.18157036880287]
We investigate the role of semantic representations in the era of large language models (LLMs)
We propose an AMR-driven chain-of-thought prompting method, which we call AMRCoT.
We find that it is difficult to predict which input examples AMR may help or hurt on, but errors tend to arise with multi-word expressions.
arXiv Detail & Related papers (2024-05-02T17:32:59Z) - "You Are An Expert Linguistic Annotator": Limits of LLMs as Analyzers of
Abstract Meaning Representation [60.863629647985526]
We examine the successes and limitations of the GPT-3, ChatGPT, and GPT-4 models in analysis of sentence meaning structure.
We find that models can reliably reproduce the basic format of AMR, and can often capture core event, argument, and modifier structure.
Overall, our findings indicate that these models out-of-the-box can capture aspects of semantic structure, but there remain key limitations in their ability to support fully accurate semantic analyses or parses.
arXiv Detail & Related papers (2023-10-26T21:47:59Z) - Widely Interpretable Semantic Representation: Frameless Meaning
Representation for Broader Applicability [10.710058244695128]
This paper presents a novel semantic representation, WISeR, that overcomes challenges for Abstract Meaning Representation (AMR)
Despite its strengths, AMR is not easily applied to languages or domains without predefined semantic frames.
We create a new corpus of 1K English dialogue sentences in both WISeR and AMR WISeR.
arXiv Detail & Related papers (2023-09-12T17:44:40Z) - An AMR-based Link Prediction Approach for Document-level Event Argument
Extraction [51.77733454436013]
Recent works have introduced Abstract Meaning Representation (AMR) for Document-level Event Argument Extraction (Doc-level EAE)
This work reformulates EAE as a link prediction problem on AMR graphs.
We propose a novel graph structure, Tailored AMR Graph (TAG), which compresses less informative subgraphs and edge types, integrates span information, and highlights surrounding events in the same document.
arXiv Detail & Related papers (2023-05-30T16:07:48Z) - Retrofitting Multilingual Sentence Embeddings with Abstract Meaning
Representation [70.58243648754507]
We introduce a new method to improve existing multilingual sentence embeddings with Abstract Meaning Representation (AMR)
Compared with the original textual input, AMR is a structured semantic representation that presents the core concepts and relations in a sentence explicitly and unambiguously.
Experiment results show that retrofitting multilingual sentence embeddings with AMR leads to better state-of-the-art performance on both semantic similarity and transfer tasks.
arXiv Detail & Related papers (2022-10-18T11:37:36Z) - SBERT studies Meaning Representations: Decomposing Sentence Embeddings
into Explainable AMR Meaning Features [22.8438857884398]
We create similarity metrics that are highly effective, while also providing an interpretable rationale for their rating.
Our approach works in two steps: We first select AMR graph metrics that measure meaning similarity of sentences with respect to key semantic facets.
Second, we employ these metrics to induce Semantically Structured Sentence BERT embeddings, which are composed of different meaning aspects captured in different sub-spaces.
arXiv Detail & Related papers (2022-06-14T17:37:18Z) - Transition-based Abstract Meaning Representation Parsing with Contextual
Embeddings [0.0]
We study a way of combing two of the most successful routes to meaning of language--statistical language models and symbolic semantics formalisms--in the task of semantic parsing.
We explore the utility of incorporating pretrained context-aware word embeddings--such as BERT and RoBERTa--in the problem of parsing.
arXiv Detail & Related papers (2022-06-13T15:05:24Z) - Multilingual AMR Parsing with Noisy Knowledge Distillation [68.01173640691094]
We study multilingual AMR parsing from the perspective of knowledge distillation, where the aim is to learn and improve a multilingual AMR by using an existing English as its teacher.
We identify that noisy input and precise output are the key to successful distillation.
arXiv Detail & Related papers (2021-09-30T15:13:48Z) - Translate, then Parse! A strong baseline for Cross-Lingual AMR Parsing [10.495114898741205]
We develop models that project sentences from various languages onto their AMRs to capture their essential semantic structures.
In this paper, we revisit a simple two-step base-line, and enhance it with a strong NMT system and a strong AMR.
Our experiments show that T+P outperforms a recent state-of-the-art system across all tested languages.
arXiv Detail & Related papers (2021-06-08T17:52:48Z) - Parsing Indonesian Sentence into Abstract Meaning Representation using
Machine Learning Approach [0.0]
We develop a system that aims to parse an Indonesian sentence using a machine learning approach.
Our system consists of three steps: pair prediction, label prediction, and graph construction.
Our model achieved the SMATCH score of 0.820 for simple sentence test data.
arXiv Detail & Related papers (2021-03-05T15:01:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.