A Unified Framework for Pun Generation with Humor Principles
- URL: http://arxiv.org/abs/2210.13055v1
- Date: Mon, 24 Oct 2022 09:20:45 GMT
- Title: A Unified Framework for Pun Generation with Humor Principles
- Authors: Yufei Tian, Divyanshu Sheth and Nanyun Peng
- Abstract summary: We propose a unified framework to generate both homophonic and homographic puns.
We incorporate three linguistic attributes of puns to the language models: ambiguity, distinctiveness, and surprise.
- Score: 31.70470387786539
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a unified framework to generate both homophonic and homographic
puns to resolve the split-up in existing works. Specifically, we incorporate
three linguistic attributes of puns to the language models: ambiguity,
distinctiveness, and surprise. Our framework consists of three parts: 1) a
context words/phrases selector to promote the aforementioned attributes, 2) a
generation model trained on non-pun sentences to incorporate the context
words/phrases into the generation output, and 3) a label predictor that learns
the structure of puns which is used to steer the generation model at inference
time. Evaluation results on both pun types demonstrate the efficacy of our
model over strong baselines.
Related papers
- Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models [113.58052868898173]
We identify and characterize a phenomenon never discussed before, where models leak irrelevant information from the prompt into the generation in unexpected ways.
We propose an evaluation setting to detect semantic leakage both by humans and automatically, curate a diverse test suite for diagnosing this behavior, and measure significant semantic leakage in 13 flagship models.
arXiv Detail & Related papers (2024-08-12T22:30:55Z) - Not wacky vs. definitely wacky: A study of scalar adverbs in pretrained
language models [0.0]
Modern pretrained language models, such as BERT, RoBERTa and GPT-3 hold the promise of performing better on logical tasks than classic static word embeddings.
We investigate the extent to which BERT, RoBERTa, GPT-2 and GPT-3 exhibit general, human-like, knowledge of these common words.
We find that despite capturing some aspects of logical meaning, the models fall far short of human performance.
arXiv Detail & Related papers (2023-05-25T18:56:26Z) - Context-Situated Pun Generation [42.727010784168115]
We propose a new task, context-situated pun generation, where a specific context represented by a set of keywords is provided.
The task is to first identify suitable pun words that are appropriate for the context, then generate puns based on the context keywords and the identified pun words.
We show that 69% of our top retrieved pun words can be used to generate context-situated puns, and our generation module yields successful 31% of the time.
arXiv Detail & Related papers (2022-10-24T18:24:48Z) - ExPUNations: Augmenting Puns with Keywords and Explanations [88.58174386894913]
We augment an existing dataset of puns with detailed crowdsourced annotations of keywords.
This is the first humor dataset with such extensive and fine-grained annotations specifically for puns.
We propose two tasks: explanation generation to aid with pun classification and keyword-conditioned pun generation.
arXiv Detail & Related papers (2022-10-24T18:12:02Z) - AmbiPun: Generating Humorous Puns with Ambiguous Context [31.81213062995652]
Our model first produces a list of related concepts through a reverse dictionary.
We then utilize one-shot GPT3 to generate context words and then generate puns incorporating context words from both concepts.
Human evaluation shows that our method successfully generates pun 52% of the time.
arXiv Detail & Related papers (2022-05-04T00:24:11Z) - Investigating Cross-Linguistic Adjective Ordering Tendencies with a
Latent-Variable Model [66.84264870118723]
We present the first purely corpus-driven model of multi-lingual adjective ordering in the form of a latent-variable model.
We provide strong converging evidence for the existence of universal, cross-linguistic, hierarchical adjective ordering tendencies.
arXiv Detail & Related papers (2020-10-09T18:27:55Z) - "The Boating Store Had Its Best Sail Ever": Pronunciation-attentive
Contextualized Pun Recognition [80.59427655743092]
We propose Pronunciation-attentive Contextualized Pun Recognition (PCPR) to perceive human humor.
PCPR derives contextualized representation for each word in a sentence by capturing the association between the surrounding context and its corresponding phonetic symbols.
Results demonstrate that the proposed approach significantly outperforms the state-of-the-art methods in pun detection and location tasks.
arXiv Detail & Related papers (2020-04-29T20:12:20Z) - SongNet: Rigid Formats Controlled Text Generation [51.428634666559724]
We propose a simple and elegant framework named SongNet to tackle this problem.
The backbone of the framework is a Transformer-based auto-regressive language model.
A pre-training and fine-tuning framework is designed to further improve the generation quality.
arXiv Detail & Related papers (2020-04-17T01:40:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.