Decrypting Cryptic Crosswords: Semantically Complex Wordplay Puzzles as
a Target for NLP
- URL: http://arxiv.org/abs/2104.08620v1
- Date: Sat, 17 Apr 2021 18:54:00 GMT
- Title: Decrypting Cryptic Crosswords: Semantically Complex Wordplay Puzzles as
a Target for NLP
- Authors: Josh Rozner, Christopher Potts, Kyle Mahowald
- Abstract summary: Cryptic crosswords are the dominant English-language crossword variety in the United Kingdom.
We present a dataset of cryptic crossword clues that can be used as a benchmark and train a sequence-to-sequence model to solve them.
We show that performance can be substantially improved using a novel curriculum learning approach.
- Score: 5.447716844779342
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cryptic crosswords, the dominant English-language crossword variety in the
United Kingdom, can be solved by expert humans using flexible, creative
intelligence and knowledge of language. Cryptic clues read like fluent natural
language, but they are adversarially composed of two parts: a definition and a
wordplay cipher requiring sub-word or character-level manipulations. As such,
they are a promising target for evaluating and advancing NLP systems that seek
to process language in more creative, human-like ways. We present a dataset of
cryptic crossword clues from a major newspaper that can be used as a benchmark
and train a sequence-to-sequence model to solve them. We also develop related
benchmarks that can guide development of approaches to this challenging task.
We show that performance can be substantially improved using a novel curriculum
learning approach in which the model is pre-trained on related tasks involving,
e.g, unscrambling words, before it is trained to solve cryptics. However, even
this curricular approach does not generalize to novel clue types in the way
that humans can, and so cryptic crosswords remain a challenge for NLP systems
and a potential source of future innovation.
Related papers
- A survey of neural-network-based methods utilising comparable data for finding translation equivalents [0.0]
We present the most common approaches from NLP that endeavour to automatically induce one of the essential dictionary components.
We analyse them from a lexicographic perspective since their viewpoints are crucial for improving the described methods.
This survey encourages a connection between the NLP and lexicography fields as the NLP field can benefit from lexicographic insights.
arXiv Detail & Related papers (2024-10-19T16:10:41Z) - Language Models are Crossword Solvers [1.53744306569115]
We tackle the challenge of solving crosswords with Large Language Models (LLMs)
We demonstrate that the current generation of state-of-the art (SoTA) language models show significant competence at deciphering cryptic crossword clues.
We also develop a search algorithm that builds off this performance to tackle the problem of solving full crossword grids with LLMs.
arXiv Detail & Related papers (2024-06-13T12:29:27Z) - A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus [71.77214818319054]
Natural language inference is a proxy for natural language understanding.
There is no publicly available NLI corpus for the Romanian language.
We introduce the first Romanian NLI corpus (RoNLI) comprising 58K training sentence pairs.
arXiv Detail & Related papers (2024-05-20T08:41:15Z) - LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language
Models [56.25156596019168]
This paper introduces the LMRL-Gym benchmark for evaluating multi-turn RL for large language models (LLMs)
Our benchmark consists of 8 different language tasks, which require multiple rounds of language interaction and cover a range of tasks in open-ended dialogue and text games.
arXiv Detail & Related papers (2023-11-30T03:59:31Z) - Italian Crossword Generator: Enhancing Education through Interactive
Word Puzzles [9.84767617576152]
We develop a comprehensive system for generating and verifying crossword clues.
A dataset of clue-answer pairs was compiled to fine-tune the models.
For generating crossword clues from a given text, Zero/Few-shot learning techniques were used.
arXiv Detail & Related papers (2023-11-27T11:17:29Z) - Large Language Models are Fixated by Red Herrings: Exploring Creative
Problem Solving and Einstellung Effect using the Only Connect Wall Dataset [4.789429120223149]
The quest for human imitative AI has been an enduring topic in AI research since its inception.
Creative problem solving in humans is a well-studied topic in cognitive neuroscience.
Only Connect Wall segment essentially mimics Mednick's Remote Associates Test (RAT) formulation with built-in, deliberate red herrings.
arXiv Detail & Related papers (2023-06-19T21:14:57Z) - Prompting Language Models for Linguistic Structure [73.11488464916668]
We present a structured prompting approach for linguistic structured prediction tasks.
We evaluate this approach on part-of-speech tagging, named entity recognition, and sentence chunking.
We find that while PLMs contain significant prior knowledge of task labels due to task leakage into the pretraining corpus, structured prompting can also retrieve linguistic structure with arbitrary labels.
arXiv Detail & Related papers (2022-11-15T01:13:39Z) - Retrieval-Augmented Multilingual Keyphrase Generation with
Retriever-Generator Iterative Training [66.64843711515341]
Keyphrase generation is the task of automatically predicting keyphrases given a piece of long text.
We call attention to a new setting named multilingual keyphrase generation.
We propose a retrieval-augmented method for multilingual keyphrase generation to mitigate the data shortage problem in non-English languages.
arXiv Detail & Related papers (2022-05-21T00:45:21Z) - Between words and characters: A Brief History of Open-Vocabulary
Modeling and Tokenization in NLP [22.772546707304766]
We show how hybrid approaches of words and characters as well as subword-based approaches based on learned segmentation have been proposed and evaluated.
We conclude that there is and likely will never be a silver bullet singular solution for all applications.
arXiv Detail & Related papers (2021-12-20T13:04:18Z) - Reinforced Iterative Knowledge Distillation for Cross-Lingual Named
Entity Recognition [54.92161571089808]
Cross-lingual NER transfers knowledge from rich-resource language to languages with low resources.
Existing cross-lingual NER methods do not make good use of rich unlabeled data in target languages.
We develop a novel approach based on the ideas of semi-supervised learning and reinforcement learning.
arXiv Detail & Related papers (2021-06-01T05:46:22Z) - On the Importance of Word Order Information in Cross-lingual Sequence
Labeling [80.65425412067464]
Cross-lingual models that fit into the word order of the source language might fail to handle target languages.
We investigate whether making models insensitive to the word order of the source language can improve the adaptation performance in target languages.
arXiv Detail & Related papers (2020-01-30T03:35:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.