RangL: A Reinforcement Learning Competition Platform
- URL: http://arxiv.org/abs/2208.00003v1
- Date: Thu, 28 Jul 2022 09:44:21 GMT
- Title: RangL: A Reinforcement Learning Competition Platform
- Authors: Viktor Zobernig, Richard A. Saldanha, Jinke He, Erica van der Sar,
Jasper van Doorn, Jia-Chen Hua, Lachlan R. Mason, Aleksander Czechowski,
Drago Indjic, Tomasz Kosmala, Alessandro Zocca, Sandjai Bhulai, Jorge
Montalvo Arvizu, Claude Kl\"ockl, John Moriarty
- Abstract summary: RangL aims to encourage the wider uptake of reinforcement learning by supporting competitions relating to real-world dynamic decision problems.
This article describes the reusable code repository developed by the RangL team and deployed for the 2022 Pathways to Net Zero Challenge.
The winning solutions to this particular Challenge seek to optimize the UK's energy transition policy to net zero carbon emissions by 2050.
- Score: 82.1944886411643
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The RangL project hosted by The Alan Turing Institute aims to encourage the
wider uptake of reinforcement learning by supporting competitions relating to
real-world dynamic decision problems. This article describes the reusable code
repository developed by the RangL team and deployed for the 2022 Pathways to
Net Zero Challenge, supported by the UK Net Zero Technology Centre. The winning
solutions to this particular Challenge seek to optimize the UK's energy
transition policy to net zero carbon emissions by 2050. The RangL repository
includes an OpenAI Gym reinforcement learning environment and code that
supports both submission to, and evaluation in, a remote instance of the open
source EvalAI platform as well as all winning learning agent strategies. The
repository is an illustrative example of RangL's capability to provide a
reusable structure for future challenges.
Related papers
- State and Action Factorization in Power Grids [47.65236082304256]
We propose a domain-agnostic algorithm that estimates correlations between state and action components entirely based on data.
The algorithm is validated on a power grid benchmark obtained with the Grid2Op simulator.
arXiv Detail & Related papers (2024-09-03T15:00:58Z) - Benchmarking Robustness and Generalization in Multi-Agent Systems: A
Case Study on Neural MMO [50.58083807719749]
We present the results of the second Neural MMO challenge, hosted at IJCAI 2022, which received 1600+ submissions.
This competition targets robustness and generalization in multi-agent systems.
We will open-source our benchmark including the environment wrapper, baselines, a visualization tool, and selected policies for further research.
arXiv Detail & Related papers (2023-08-30T07:16:11Z) - Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the
MineRL BASALT 2022 Competition [20.922425732605756]
The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft.
We describe the competition and provide an overview of the top solutions.
arXiv Detail & Related papers (2023-03-23T17:59:17Z) - Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone
fine-tuning without episodic meta-learning dominates for few-shot learning
image classification [40.901760230639496]
We describe the design of the MetaDL competition series, the datasets, the best experimental results, and the top-ranked methods in the NeurIPS 2021 challenge.
The solutions of the top participants have been open-sourced.
arXiv Detail & Related papers (2022-06-15T10:27:23Z) - MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned [60.11039031794829]
Reinforcement learning competitions advance the field by providing appropriate scope and support to develop solutions toward a specific problem.
We hosted the third edition of the MineRL ObtainDiamond competition, MineRL Diamond 2021, with a separate track in which we permitted any solution to promote the participation of newcomers.
The participants of this easier track were able to obtain a diamond, and the participants of the harder track progressed the generalizable solutions in the same task.
arXiv Detail & Related papers (2022-02-17T13:37:35Z) - Two-Stream Consensus Network: Submission to HACS Challenge 2021
Weakly-Supervised Learning Track [78.64815984927425]
The goal of weakly-supervised temporal action localization is to temporally locate and classify action of interest in untrimmed videos.
We adopt the two-stream consensus network (TSCN) as the main framework in this challenge.
Our solution ranked 2rd in this challenge, and we hope our method can serve as a baseline for future academic research.
arXiv Detail & Related papers (2021-06-21T03:36:36Z) - Towards robust and domain agnostic reinforcement learning competitions [12.731614722371376]
Reinforcement learning competitions have formed the basis for standard research benchmarks.
Despite this, a majority of challenges suffer from the same fundamental problems.
We present a new framework of competition design that promotes the development of algorithms that overcome these barriers.
arXiv Detail & Related papers (2021-06-07T16:15:46Z) - The MineRL 2020 Competition on Sample Efficient Reinforcement Learning
using Human Priors [62.9301667732188]
We propose a second iteration of the MineRL Competition.
The primary goal of the competition is to foster the development of algorithms which can efficiently leverage human demonstrations.
The competition is structured into two rounds in which competitors are provided several paired versions of the dataset and environment.
At the end of each round, competitors submit containerized versions of their learning algorithms to the AIcrowd platform.
arXiv Detail & Related papers (2021-01-26T20:32:30Z) - A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to
District Demand Side Management through CityLearn [0.0]
Reinforcement learning is a promising model-free and adaptive controller for demand side management.
This paper presents the results of the algorithm that was submitted for the CityLearn Challenge.
The proposed solution secured second place in the challenge using a centralised 'Soft Actor Critic' deep reinforcement learning agent.
arXiv Detail & Related papers (2020-09-22T14:03:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.