Schema-Free Dependency Parsing via Sequence Generation
- URL: http://arxiv.org/abs/2201.12407v1
- Date: Fri, 28 Jan 2022 20:32:04 GMT
- Title: Schema-Free Dependency Parsing via Sequence Generation
- Authors: Boda Lin, Zijun Yao, Jiaxin Shi, Shulin Cao, Binghao Tang, Si Li, Yong
Luo, Juanzi Li, Lei Hou
- Abstract summary: Dependency parsing aims to extract syntactic dependency structure or semantic dependency structure for sentences.
We propose to achieve universal and schema-free Dependency Parsing (DP) via Sequence Generation (SG)
- Score: 33.58097441488377
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Dependency parsing aims to extract syntactic dependency structure or semantic
dependency structure for sentences. Existing methods suffer the drawbacks of
lacking universality or highly relying on the auxiliary decoder. To remedy
these drawbacks, we propose to achieve universal and schema-free Dependency
Parsing (DP) via Sequence Generation (SG) DPSG by utilizing only the
pre-trained language model (PLM) without any auxiliary structures or parsing
algorithms. We first explore different serialization designing strategies for
converting parsing structures into sequences. Then we design dependency units
and concatenate these units into the sequence for DPSG. Thanks to the high
flexibility of the sequence generation, our DPSG can achieve both syntactic DP
and semantic DP using a single model. By concatenating the prefix to indicate
the specific schema with the sequence, our DPSG can even accomplish
multi-schemata parsing. The effectiveness of our DPSG is demonstrated by the
experiments on widely used DP benchmarks, i.e., PTB, CODT, SDP15, and
SemEval16. DPSG achieves comparable results with the first-tier methods on all
the benchmarks and even the state-of-the-art (SOTA) performance in CODT and
SemEval16. This paper demonstrates our DPSG has the potential to be a new
parsing paradigm. We will release our codes upon acceptance.
Related papers
- Differentially Private Retrieval-Augmented Generation [13.622078883013442]
Retrieval-augmented generation (RAG) is a widely used framework for reducing hallucinations in large language models (LLMs)<n>RAG poses serious privacy risks when the database contains sensitive corpora, such as medical records or legal documents.<n>We present DP-KSA, a novel privacy-preserving RAG algorithm that integrates DP using the propose-test-release paradigm.
arXiv Detail & Related papers (2026-02-16T00:52:57Z) - GS-KAN: Parameter-Efficient Kolmogorov-Arnold Networks via Sprecher-Type Shared Basis Functions [0.0]
We propose GS-KAN (Generalized Sprecher-KAN), a lightweight architecture inspired by David Sprecher's refinement of the superposition theorem.<n> GS-KAN constructs unique edge functions by applying learnable linear transformations to a single learnable, shared parent function per layer.<n>Our results demonstrate that GS-KAN outperforms both approximations and standard KAN baselines on continuous function tasks while maintaining superior parameter efficiency.
arXiv Detail & Related papers (2025-12-09T19:56:36Z) - Spanning Tree Autoregressive Visual Generation [51.7635842702602]
We present Spanning Tree Autoregressive (STAR) modeling, which can incorporate prior knowledge of images, such as center bias and locality, to maintain sampling performance.
arXiv Detail & Related papers (2025-11-21T09:45:17Z) - Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs [63.82840470917859]
We show that the decoding mechanism of dLLMs can be used as a powerful tool for model attribution.<n>We propose a novel information extraction scheme called the Directed Decoding Map (DDM), which captures structural relationships between decoding steps and better reveals model-specific behaviors.
arXiv Detail & Related papers (2025-10-02T06:25:10Z) - Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs [1.088291253486435]
We propose a novel step-by-step instruction strategy, where universal part-of-speech tagging precedes the prediction of syntactic heads and dependency labels.<n>Our method achieves state-of-the-art accuracy on Universal Dependencies datasets across 17 languages without hallucination or contamination.
arXiv Detail & Related papers (2025-06-11T17:56:10Z) - GEML: A Grammar-based Evolutionary Machine Learning Approach for
Design-Pattern Detection [7.018591019975254]
Design patterns (DPs) are recognised as a good practice in software development.
The lack of appropriate documentation often hampers traceability, and their benefits are blurred among thousands of lines of code.
We propose GEML, a novel detection approach based on evolutionary machine learning using software properties of diverse nature.
arXiv Detail & Related papers (2024-01-13T11:05:24Z) - Compositional Program Generation for Few-Shot Systematic Generalization [59.57656559816271]
This study on a neuro-symbolic architecture called the Compositional Program Generator (CPG)
CPG has three key features: textitmodularity, textitcomposition, and textitabstraction, in the form of grammar rules.
It perfect achieves generalization on both the SCAN and COGS benchmarks using just 14 examples for SCAN and 22 examples for COGS.
arXiv Detail & Related papers (2023-09-28T14:33:20Z) - Generalizing DP-SGD with Shuffling and Batch Clipping [21.55827140532476]
DP-SGD implements individual clipping with random subsampling, which forces a mini-batch SGD approach.
We provide a general differential private algorithmic framework that goes beyond DP-SGD and allows any possible first order summings.
We show a $sqrtg E$ DP dependency for batch clipping with shuffling.
arXiv Detail & Related papers (2022-12-12T09:43:26Z) - Automatic Clipping: Differentially Private Deep Learning Made Easier and
Stronger [39.93710312222771]
Per-example clipping is a key algorithmic step that enables practical differential private (DP) training for deep learning models.
We propose an easy-to-use replacement, called automatic clipping, that eliminates the need to tune R for any DPs.
arXiv Detail & Related papers (2022-06-14T19:49:44Z) - Structure-aware Fine-tuning of Sequence-to-sequence Transformers for
Transition-based AMR Parsing [20.67024416678313]
We explore the integration of general pre-trained sequence-to-sequence language models and a structure-aware transition-based approach.
We propose a simplified transition set, designed to better exploit pre-trained language models for structured fine-tuning.
We show that the proposed parsing architecture retains the desirable properties of previous transition-based approaches, while being simpler and reaching the new state of the art for AMR 2.0, without the need for graph re-categorization.
arXiv Detail & Related papers (2021-10-29T04:36:31Z) - Layout-to-Image Translation with Double Pooling Generative Adversarial
Networks [76.83075646527521]
We propose a novel Double Pooing GAN (DPGAN) for generating photo-realistic and semantically-consistent results from the input layout.
We also propose a novel Double Pooling Module (DPM), which consists of the Square-shape Pooling Module (SPM) and the Rectangle-shape Pooling Module ( RPM)
arXiv Detail & Related papers (2021-08-29T19:55:14Z) - Auto-Parsing Network for Image Captioning and Visual Question Answering [101.77688388554097]
We propose an Auto-Parsing Network (APN) to discover and exploit the input data's hidden tree structures.
Specifically, we impose a Probabilistic Graphical Model (PGM) parameterized by the attention operations on each self-attention layer to incorporate sparse assumption.
arXiv Detail & Related papers (2021-08-24T08:14:35Z) - Head-driven Phrase Structure Parsing in O($n^3$) Time Complexity [48.683350567504604]
Constituent and dependency parsing, the two classic forms of syntactic parsing, have been found to benefit from joint training and decoding under a uniform formalism.
We propose an improved head scorer that helps achieve a novel performance-preserved in $O$($n3$) time complexity.
arXiv Detail & Related papers (2021-05-20T15:33:51Z) - Span-based Semantic Parsing for Compositional Generalization [53.24255235340056]
SpanBasedSP predicts a span tree over an input utterance, explicitly encoding how partial programs compose over spans in the input.
On GeoQuery, SCAN and CLOSURE, SpanBasedSP performs similarly to strong seq2seq baselines on random splits, but dramatically improves performance compared to baselines on splits that require compositional generalization.
arXiv Detail & Related papers (2020-09-13T16:42:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.