Semantic Diversity in Dialogue with Natural Language Inference
        - URL: http://arxiv.org/abs/2205.01497v1
- Date: Tue, 3 May 2022 13:56:32 GMT
- Title: Semantic Diversity in Dialogue with Natural Language Inference
- Authors: Katherine Stasaski and Marti A. Hearst
- Abstract summary: This paper makes two substantial contributions to improving diversity in dialogue generation.
First, we propose a novel metric which uses Natural Language Inference (NLI) to measure the semantic diversity of a set of model responses for a conversation.
Second, we demonstrate how to iteratively improve the semantic diversity of a sampled set of responses via a new generation procedure called Diversity Threshold Generation.
- Score: 19.74618235525502
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Generating diverse, interesting responses to chitchat conversations is a
problem for neural conversational agents. This paper makes two substantial
contributions to improving diversity in dialogue generation. First, we propose
a novel metric which uses Natural Language Inference (NLI) to measure the
semantic diversity of a set of model responses for a conversation. We evaluate
this metric using an established framework (Tevet and Berant, 2021) and find
strong evidence indicating NLI Diversity is correlated with semantic diversity.
Specifically, we show that the contradiction relation is more useful than the
neutral relation for measuring this diversity and that incorporating the NLI
model's confidence achieves state-of-the-art results. Second, we demonstrate
how to iteratively improve the semantic diversity of a sampled set of responses
via a new generation procedure called Diversity Threshold Generation, which
results in an average 137% increase in NLI Diversity compared to standard
generation procedures.
 
      
        Related papers
        - A survey of diversity quantification in natural language processing: The   why, what, where and how [2.5833049611832273]
 We survey articles in the ACL Anthology from the past 6 years, with "diversity" or "diverse" in their title.<n>We put forward a unified taxonomy of why, what on, where, and how diversity is measured in NLP.<n>We believe that this study paves the way towards a better formalization of diversity in NLP.
 arXiv  Detail & Related papers  (2025-07-28T14:12:34Z)
- Evaluating the Diversity and Quality of LLM Generated Content [72.84945252821908]
 We introduce a framework for measuring effective semantic diversity--diversity among outputs that meet quality thresholds.
Although preference-tuned models exhibit reduced lexical and syntactic diversity, they produce greater effective semantic diversity than SFT or base models.
These findings have important implications for applications that require diverse yet high-quality outputs.
 arXiv  Detail & Related papers  (2025-04-16T23:02:23Z)
- The Factuality Tax of Diversity-Intervened Text-to-Image Generation:   Benchmark and Fact-Augmented Intervention [61.80236015147771]
 We quantify the trade-off between using diversity interventions and preserving demographic factuality in T2I models.
Experiments on DoFaiR reveal that diversity-oriented instructions increase the number of different gender and racial groups.
We propose Fact-Augmented Intervention (FAI) to reflect on verbalized or retrieved factual information about gender and racial compositions of generation subjects in history.
 arXiv  Detail & Related papers  (2024-06-29T09:09:42Z)
- Improving Diversity of Commonsense Generation by Large Language Models   via In-Context Learning [28.654890118684957]
 Generative Commonsense Reasoning (GCR) requires a model to reason about a situation using commonsense knowledge.
The diversity of the generation is equally important because it reflects the model's ability to use a range of commonsense knowledge facts.
We propose a simple method that diversifies the LLM generations, while preserving their quality.
 arXiv  Detail & Related papers  (2024-04-25T17:52:39Z)
- Scaling Data Diversity for Fine-Tuning Language Models in Human   Alignment [84.32768080422349]
 Alignment with human preference prevents large language models from generating misleading or toxic content.
We propose a new formulation of prompt diversity, implying a linear correlation with the final performance of LLMs after fine-tuning.
 arXiv  Detail & Related papers  (2024-03-17T07:08:55Z)
- Improving Diversity of Demographic Representation in Large Language
  Models via Collective-Critiques and Self-Voting [19.79214899011072]
 This paper formalizes diversity of representation in generative large language models.
We present evaluation datasets and propose metrics to measure diversity in generated responses along people and culture axes.
We find that LLMs understand the notion of diversity, and that they can reason and critique their own responses for that goal.
 arXiv  Detail & Related papers  (2023-10-25T10:17:17Z)
- Diversify Question Generation with Retrieval-Augmented Style Transfer [68.00794669873196]
 We propose RAST, a framework for Retrieval-Augmented Style Transfer.
The objective is to utilize the style of diverse templates for question generation.
We develop a novel Reinforcement Learning (RL) based approach that maximizes a weighted combination of diversity reward and consistency reward.
 arXiv  Detail & Related papers  (2023-10-23T02:27:31Z)
- Diverse and Faithful Knowledge-Grounded Dialogue Generation via
  Sequential Posterior Inference [82.28542500317445]
 We present an end-to-end learning framework, termed Sequential Posterior Inference (SPI), capable of selecting knowledge and generating dialogues.
Unlike other methods, SPI does not require the inference network or assume a simple geometry of the posterior distribution.
 arXiv  Detail & Related papers  (2023-06-01T21:23:13Z)
- Pragmatically Appropriate Diversity for Dialogue Evaluation [19.74618235525502]
 Linguistic pragmatics state that a conversation's underlying speech acts can constrain the type of response which is appropriate at each turn in the conversation.
We propose the notion of Pragmatically Appropriate Diversity, defined as the extent to which a conversation creates and constrains the creation of multiple diverse responses.
 arXiv  Detail & Related papers  (2023-04-06T01:24:18Z)
- Exploring Diversity in Back Translation for Low-Resource Machine
  Translation [85.03257601325183]
 Back translation is one of the most widely used methods for improving the performance of neural machine translation systems.
Recent research has sought to enhance the effectiveness of this method by increasing the 'diversity' of the generated translations.
This work puts forward a more nuanced framework for understanding diversity in training data, splitting it into lexical diversity and syntactic diversity.
 arXiv  Detail & Related papers  (2022-06-01T15:21:16Z)
- Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase
  Generation Approach [97.38622477085188]
 We propose BTmPG (Back-Translation guided multi-round Paraphrase Generation) to improve diversity of paraphrase.
We evaluate BTmPG on two benchmark datasets.
 arXiv  Detail & Related papers  (2021-09-04T13:12:01Z)
- Informed Sampling for Diversity in Concept-to-Text NLG [8.883733362171034]
 We propose an Imitation Learning approach to explore the level of diversity that a language generation model can reliably produce.
Specifically, we augment the decoding process with a meta-classifier trained to distinguish which words at any given timestep will lead to high-quality output.
 arXiv  Detail & Related papers  (2020-04-29T17:43:24Z)
- Evaluating the Evaluation of Diversity in Natural Language Generation [43.05127848086264]
 We propose a framework for evaluating diversity metrics in natural language generation systems.
Our framework can advance the understanding of different diversity metrics, an essential step on the road towards better NLG systems.
 arXiv  Detail & Related papers  (2020-04-06T20:44:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.