Composable Text Controls in Latent Space with ODEs
- URL: http://arxiv.org/abs/2208.00638v3
- Date: Mon, 6 Nov 2023 06:21:40 GMT
- Title: Composable Text Controls in Latent Space with ODEs
- Authors: Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei
Bao, Xiaodong He, Shuguang Cui, Zhen Li, Zhiting Hu
- Abstract summary: This paper proposes a new efficient approach for composable text operations in the compact latent space of text.
By connecting pretrained LMs to the latent space through efficient adaption, we then decode the sampled vectors into desired text sequences.
Experiments show that composing those operators within our approach manages to generate or edit high-quality text.
- Score: 97.12426987887021
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Real-world text applications often involve composing a wide range of text
control operations, such as editing the text w.r.t. an attribute, manipulating
keywords and structure, and generating new text of desired properties. Prior
work typically learns/finetunes a language model (LM) to perform individual or
specific subsets of operations. Recent research has studied combining
operations in a plug-and-play manner, often with costly search or optimization
in the complex sequence space. This paper proposes a new efficient approach for
composable text operations in the compact latent space of text. The
low-dimensionality and differentiability of the text latent vector allow us to
develop an efficient sampler based on ordinary differential equations (ODEs)
given arbitrary plug-in operators (e.g., attribute classifiers). By connecting
pretrained LMs (e.g., GPT2) to the latent space through efficient adaption, we
then decode the sampled vectors into desired text sequences. The flexible
approach permits diverse control operators (sentiment, tense, formality,
keywords, etc.) acquired using any relevant data from different domains.
Experiments show that composing those operators within our approach manages to
generate or edit high-quality text, substantially improving over previous
methods in terms of generation quality and efficiency.
Related papers
- Towards Unified Multi-granularity Text Detection with Interactive Attention [56.79437272168507]
"Detect Any Text" is an advanced paradigm that unifies scene text detection, layout analysis, and document page detection into a cohesive, end-to-end model.
A pivotal innovation in DAT is the across-granularity interactive attention module, which significantly enhances the representation learning of text instances.
Tests demonstrate that DAT achieves state-of-the-art performances across a variety of text-related benchmarks.
arXiv Detail & Related papers (2024-05-30T07:25:23Z) - Successor Features for Efficient Multisubject Controlled Text Generation [48.37713738712319]
We introduce SF-GEN, which is grounded in two primary concepts: successor features (SFs) and language model rectification.
SF-GEN seamlessly integrates the two to enable dynamic steering of text generation with no need to alter the LLM's parameters.
To the best of our knowledge, our research represents the first application of successor features in text generation.
arXiv Detail & Related papers (2023-11-03T00:17:08Z) - TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision [61.186488081379]
We propose TextFormer, a query-based end-to-end text spotter with Transformer architecture.
TextFormer builds upon an image encoder and a text decoder to learn a joint semantic understanding for multi-task modeling.
It allows for mutual training and optimization of classification, segmentation, and recognition branches, resulting in deeper feature sharing.
arXiv Detail & Related papers (2023-06-06T03:37:41Z) - MacLaSa: Multi-Aspect Controllable Text Generation via Efficient
Sampling from Compact Latent Space [110.85888003111653]
Multi-aspect controllable text generation aims to generate fluent sentences that possess multiple desired attributes simultaneously.
We introduce a novel approach for multi-aspect control, namely MacLaSa, that estimates compact latent space for multiple aspects.
We show that MacLaSa outperforms several strong baselines on attribute relevance and textual quality while maintaining a high inference speed.
arXiv Detail & Related papers (2023-05-22T07:30:35Z) - Text Revision by On-the-Fly Representation Optimization [76.11035270753757]
Current state-of-the-art methods formulate these tasks as sequence-to-sequence learning problems.
We present an iterative in-place editing approach for text revision, which requires no parallel data.
It achieves competitive and even better performance than state-of-the-art supervised methods on text simplification.
arXiv Detail & Related papers (2022-04-15T07:38:08Z) - Control Prefixes for Text Generation [17.682443394199375]
We propose a dynamic method, Control Prefixes, which allows for the inclusion of conditional input-dependent information in each prompt.
We present state-of-the-art results on several data-to-text datasets, including WebNLG.
arXiv Detail & Related papers (2021-10-15T19:32:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.