Benchmark tasks for Quality-Diversity applied to Uncertain domains
- URL: http://arxiv.org/abs/2304.12454v2
- Date: Wed, 26 Apr 2023 17:46:21 GMT
- Title: Benchmark tasks for Quality-Diversity applied to Uncertain domains
- Authors: Manon Flageat and Luca Grillotti and Antoine Cully
- Abstract summary: We introduce a set of 8 easy-to-implement and lightweight tasks, split into 3 main categories.
We identify the key uncertainty properties to easily define UQD benchmark tasks.
All our tasks build on the Redundant Arm: a common QD environment that is lightweight and easily replicable.
- Score: 1.5469452301122175
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: While standard approaches to optimisation focus on producing a single
high-performing solution, Quality-Diversity (QD) algorithms allow large diverse
collections of such solutions to be found. If QD has proven promising across a
large variety of domains, it still struggles when faced with uncertain domains,
where quantification of performance and diversity are non-deterministic.
Previous work in Uncertain Quality-Diversity (UQD) has proposed methods and
metrics designed for such uncertain domains. In this paper, we propose a first
set of benchmark tasks to analyse and estimate the performance of UQD
algorithms. We identify the key uncertainty properties to easily define UQD
benchmark tasks: the uncertainty location, the type of distribution and its
parameters. By varying the nature of those key UQD components, we introduce a
set of 8 easy-to-implement and lightweight tasks, split into 3 main categories.
All our tasks build on the Redundant Arm: a common QD environment that is
lightweight and easily replicable. Each one of these tasks highlights one
specific limitation that arises when considering UQD domains. With this first
benchmark, we hope to facilitate later advances in UQD.
Related papers
- Exploring the Performance-Reproducibility Trade-off in Quality-Diversity [7.858994681440057]
Quality-Diversity (QD) algorithms have exhibited promising results across many domains and applications.
However, uncertainty in fitness and behaviour estimations of solutions remains a major challenge when QD is used in complex real-world applications.
We propose four new a-priori QD algorithms that find optimal solutions for given preferences over the trade-offs.
arXiv Detail & Related papers (2024-09-20T08:20:31Z) - KaPQA: Knowledge-Augmented Product Question-Answering [59.096607961704656]
We introduce two product question-answering (QA) datasets focused on Adobe Acrobat and Photoshop products.
We also propose a novel knowledge-driven RAG-QA framework to enhance the performance of the models in the product QA task.
arXiv Detail & Related papers (2024-07-22T22:14:56Z) - Efficient Quality-Diversity Optimization through Diverse Quality Species [3.428706362109921]
We show that a diverse population of solutions can be found without the limitation of needing an archive or defining the range of behaviors in advance.
We propose Diverse Quality Species (DQS) as an alternative to archive-based Quality-Diversity (QD) algorithms.
arXiv Detail & Related papers (2023-04-14T23:15:51Z) - Don't Bet on Luck Alone: Enhancing Behavioral Reproducibility of
Quality-Diversity Solutions in Uncertain Domains [2.639902239625779]
We introduce Archive Reproducibility Improvement Algorithm (ARIA)
ARIA is a plug-and-play approach that improves the quality of solutions present in an archive.
We show that our algorithm enhances the quality and descriptor space coverage of any given archive by at least 50%.
arXiv Detail & Related papers (2023-04-07T14:45:14Z) - Uncertain Quality-Diversity: Evaluation methodology and new methods for
Quality-Diversity in Uncertain Domains [1.827510863075184]
Quality-Diversity optimisation (QD) has proven to yield promising results across a broad set of applications.
However, QD approaches struggle in the presence of uncertainty in the environment, as it impacts their ability to quantify the true performance and novelty of solutions.
We formalise a common framework for uncertain domains: the Uncertain QD setting.
Second, we propose a new methodology to evaluate Uncertain QD approaches, relying on a new per-generation sampling budget and a set of existing and new metrics specifically designed for Uncertain QD.
Third, we propose three new Uncertain QD algorithms: Archive-sampling, Parallel-Adaptive
arXiv Detail & Related papers (2023-02-01T14:12:04Z) - IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing [88.35145788575348]
Image anomaly detection (IAD) is an emerging and vital computer vision task in industrial manufacturing.
The lack of a uniform IM benchmark is hindering the development and usage of IAD methods in real-world applications.
We construct a comprehensive image anomaly detection benchmark (IM-IAD), which includes 19 algorithms on seven major datasets.
arXiv Detail & Related papers (2023-01-31T01:24:45Z) - Towards QD-suite: developing a set of benchmarks for Quality-Diversity
algorithms [0.0]
Existing benchmarks are not standardized, and there is currently no MNIST equivalent for Quality-Diversity (QD)
We argue that the identification of challenges faced by QD methods and the development of targeted, challenging, scalable benchmarks is an important step.
arXiv Detail & Related papers (2022-05-06T13:33:50Z) - QAFactEval: Improved QA-Based Factual Consistency Evaluation for
Summarization [116.56171113972944]
We show that carefully choosing the components of a QA-based metric is critical to performance.
Our solution improves upon the best-performing entailment-based metric and achieves state-of-the-art performance.
arXiv Detail & Related papers (2021-12-16T00:38:35Z) - Few-shot Quality-Diversity Optimization [50.337225556491774]
Quality-Diversity (QD) optimization has been shown to be effective tools in dealing with deceptive minima and sparse rewards in Reinforcement Learning.
We show that, given examples from a task distribution, information about the paths taken by optimization in parameter space can be leveraged to build a prior population, which when used to initialize QD methods in unseen environments, allows for few-shot adaptation.
Experiments carried in both sparse and dense reward settings using robotic manipulation and navigation benchmarks show that it considerably reduces the number of generations that are required for QD optimization in these environments.
arXiv Detail & Related papers (2021-09-14T17:12:20Z) - SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
Reinforcement Learning [102.78958681141577]
We present SUNRISE, a simple unified ensemble method, which is compatible with various off-policy deep reinforcement learning algorithms.
SUNRISE integrates two key ingredients: (a) ensemble-based weighted Bellman backups, which re-weight target Q-values based on uncertainty estimates from a Q-ensemble, and (b) an inference method that selects actions using the highest upper-confidence bounds for efficient exploration.
arXiv Detail & Related papers (2020-07-09T17:08:44Z) - Generating Diverse and Consistent QA pairs from Contexts with
Information-Maximizing Hierarchical Conditional VAEs [62.71505254770827]
We propose a conditional variational autoencoder (HCVAE) for generating QA pairs given unstructured texts as contexts.
Our model obtains impressive performance gains over all baselines on both tasks, using only a fraction of data for training.
arXiv Detail & Related papers (2020-05-28T08:26:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.