Unlucky Explorer: A Complete non-Overlapping Map Exploration
- URL: http://arxiv.org/abs/2005.14156v1
- Date: Thu, 28 May 2020 17:19:24 GMT
- Title: Unlucky Explorer: A Complete non-Overlapping Map Exploration
- Authors: Mohammad Sina Kiarostami and Saleh Khalaj Monfared and Mohammadreza
Daneshvaramoli and Ali Oliayi and Negar Yousefian and Dara Rahmati and Saeid
Gorgin
- Abstract summary: We introduce the Maze Dash puzzle as an exploration problem where the agent must find a Hamiltonian Path visiting all the cells.
An optimization has been applied to the proposed Monte-Carlo Tree Search (MCTS) algorithm to obtain a promising result.
Our comparison indicates that the MCTS-based approach is an up-and-coming method that could cope with the test cases with small and medium sizes.
- Score: 0.949996206597248
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Nowadays, the field of Artificial Intelligence in Computer Games (AI in
Games) is going to be more alluring since computer games challenge many aspects
of AI with a wide range of problems, particularly general problems. One of
these kinds of problems is Exploration, which states that an unknown
environment must be explored by one or several agents. In this work, we have
first introduced the Maze Dash puzzle as an exploration problem where the agent
must find a Hamiltonian Path visiting all the cells. Then, we have investigated
to find suitable methods by a focus on Monte-Carlo Tree Search (MCTS) and SAT
to solve this puzzle quickly and accurately. An optimization has been applied
to the proposed MCTS algorithm to obtain a promising result. Also, since the
prefabricated test cases of this puzzle are not large enough to assay the
proposed method, we have proposed and employed a technique to generate solvable
test cases to evaluate the approaches. Eventually, the MCTS-based method has
been assessed by the auto-generated test cases and compared with our
implemented SAT approach that is considered a good rival. Our comparison
indicates that the MCTS-based approach is an up-and-coming method that could
cope with the test cases with small and medium sizes with faster run-time
compared to SAT. However, for certain discussed reasons, including the features
of the problem, tree search organization, and also the approach of MCTS in the
Simulation step, MCTS takes more time to execute in Large size scenarios.
Consequently, we have found the bottleneck for the MCTS-based method in
significant test cases that could be improved in two real-world problems.
Related papers
- LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning [56.273799410256075]
The framework combines Monte Carlo Tree Search (MCTS) with iterative Self-Refine to optimize the reasoning path.
The framework has been tested on general and advanced benchmarks, showing superior performance in terms of search efficiency and problem-solving capability.
arXiv Detail & Related papers (2024-10-03T18:12:29Z) - POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation [76.67608003501479]
We introduce and specify an evaluation protocol defining a range of domain-related metrics computed on the basics of the primary evaluation indicators.
The results of such a comparison, which involves a variety of state-of-the-art MARL, search-based, and hybrid methods, are presented.
arXiv Detail & Related papers (2024-07-20T16:37:21Z) - Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions [48.251724997889184]
We develop a benchmark called Problems with Missing and Contradictory conditions (PMC)
We introduce two novel metrics to evaluate the performance of few-shot prompting methods in these scenarios.
We propose a novel few-shot prompting method called SMT-LIB Prompting (SLP), which utilizes the SMT-LIB language to model the problems instead of solving them directly.
arXiv Detail & Related papers (2024-06-07T16:24:12Z) - Amplifying Exploration in Monte-Carlo Tree Search by Focusing on the
Unknown [19.664506834858244]
Monte-Carlo tree search (MCTS) strategically allocates computational resources to focus on promising segments of the search tree.
Our proposed methodology, denoted as AmEx-MCTS, solves this problem by introducing a novel MCTS formulation.
Our empirical evaluation demonstrates the superior performance of AmEx-MCTS, surpassing classical MCTS and related approaches by a substantial margin.
arXiv Detail & Related papers (2024-02-13T15:05:54Z) - LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
Sequential Decision Scenarios [32.83545787965431]
Building agents based on tree-search planning capabilities with learned models has achieved remarkable success in classic decision-making problems, such as Go and Atari.
It has been deemed challenging or even infeasible to extend Monte Carlo Tree Search (MCTS) based algorithms to diverse real-world applications.
In this work, we introduce LightZero, the first unified benchmark for deploying MCTS/MuZero in general sequential decision scenarios.
arXiv Detail & Related papers (2023-10-12T14:18:09Z) - Monte-Carlo Tree Search for Multi-Agent Pathfinding: Preliminary Results [60.4817465598352]
We introduce an original variant of Monte-Carlo Tree Search (MCTS) tailored to multi-agent pathfinding.
Specifically, we use individual paths to assist the agents with the the goal-reaching behavior.
We also use a dedicated decomposition technique to reduce the branching factor of the tree search procedure.
arXiv Detail & Related papers (2023-07-25T12:33:53Z) - Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions [89.89612827542972]
This paper proposes a variant of Monte-Carlo tree search (MCTS) that spends more search time on harder states and less search time on simpler states adaptively.
We evaluate the performance and computations on $9 times 9$ Go board games and Atari games.
Experiments show that our method can achieve comparable performances to the original search algorithm while requiring less than $50%$ search time on average.
arXiv Detail & Related papers (2022-10-23T06:39:20Z) - Machine Learning for Online Algorithm Selection under Censored Feedback [71.6879432974126]
In online algorithm selection (OAS), instances of an algorithmic problem class are presented to an agent one after another, and the agent has to quickly select a presumably best algorithm from a fixed set of candidate algorithms.
For decision problems such as satisfiability (SAT), quality typically refers to the algorithm's runtime.
In this work, we revisit multi-armed bandit algorithms for OAS and discuss their capability of dealing with the problem.
We adapt them towards runtime-oriented losses, allowing for partially censored data while keeping a space- and time-complexity independent of the time horizon.
arXiv Detail & Related papers (2021-09-13T18:10:52Z) - Monte Carlo Tree Search: A Review of Recent Modifications and
Applications [0.17205106391379024]
Monte Carlo Tree Search (MCTS) is a powerful approach to designing game-playing bots or solving sequential decision problems.
The method relies on intelligent tree search that balances exploration and exploitation.
The method has become a state-of-the-art technique for games, however, in more complex games.
arXiv Detail & Related papers (2021-03-08T17:44:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.