Monte Carlo Tree Search for high precision manufacturing
- URL: http://arxiv.org/abs/2108.01789v1
- Date: Wed, 28 Jul 2021 14:56:17 GMT
- Title: Monte Carlo Tree Search for high precision manufacturing
- Authors: Dorina Weichert, Felix Horchler, Alexander Kister, Marcus Trost,
Johannes Hartung, Stefan Risse
- Abstract summary: We make use of an expert-based simulator and adapt the MCTS default policy to deal with the manufacturing process.
Common reasons for this are that there is no efficient simulator of the process available or there exist problems in applying MCTS to the complex rules of the process.
- Score: 55.60116686945561
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Monte Carlo Tree Search (MCTS) has shown its strength for a lot of
deterministic and stochastic examples, but literature lacks reports of
applications to real world industrial processes. Common reasons for this are
that there is no efficient simulator of the process available or there exist
problems in applying MCTS to the complex rules of the process. In this paper,
we apply MCTS for optimizing a high-precision manufacturing process that has
stochastic and partially observable outcomes. We make use of an
expert-knowledge-based simulator and adapt the MCTS default policy to deal with
the manufacturing process.
Related papers
- Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo [55.452453947359736]
We introduce a novel verification method based on Twisted Sequential Monte Carlo (TSMC)
We apply TSMC to Large Language Models by estimating the expected future rewards at partial solutions.
This approach results in a more straightforward training target that eliminates the need for step-wise human annotations.
arXiv Detail & Related papers (2024-10-02T18:17:54Z) - Learning Logic Specifications for Soft Policy Guidance in POMCP [71.69251176275638]
Partially Observable Monte Carlo Planning (POMCP) is an efficient solver for Partially Observable Markov Decision Processes (POMDPs)
POMCP suffers from sparse reward function, namely, rewards achieved only when the final goal is reached.
In this paper, we use inductive logic programming to learn logic specifications from traces of POMCP executions.
arXiv Detail & Related papers (2023-03-16T09:37:10Z) - Monte-Carlo Tree-Search for Leveraging Performance of Blackbox Job-Shop
Scheduling Heuristics [1.3764085113103217]
In manufacturing, the production is often done on out-of-the-shelf manufacturing lines.
We consider such a setting with a black-box job-shop system and an unknown scheduling that, for a given permutation of jobs, schedules the jobs for the black-box job-shop.
Here, the jobs need to enter the job-shop in the given order of the permutation, but may take different paths within the job shop, which depends on the black-box.
arXiv Detail & Related papers (2022-12-14T23:01:53Z) - Continuous Monte Carlo Graph Search [61.11769232283621]
Continuous Monte Carlo Graph Search ( CMCGS) is an extension of Monte Carlo Tree Search (MCTS) to online planning.
CMCGS takes advantage of the insight that, during planning, sharing the same action policy between several states can yield high performance.
It can be scaled up through parallelization, and it outperforms the Cross-Entropy Method (CEM) in continuous control with learned dynamics models.
arXiv Detail & Related papers (2022-10-04T07:34:06Z) - SMPL: Simulated Industrial Manufacturing and Process Control Learning
Environments [26.451888230418746]
There is little exploration of applying deep reinforcement learning to control manufacturing plants.
We develop an easy-to-use library that includes five high-fidelity simulation environments.
We benchmark online and offline, model-based and model-free reinforcement learning algorithms for comparisons of follow-up research.
arXiv Detail & Related papers (2022-06-17T15:51:35Z) - Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo
sampling [58.14878401145309]
We develop a novel approach to producing more sample-efficient estimators of expectations in the PL model.
We illustrate our findings both theoretically and empirically using real-world recommendation data from Amazon Music and the Yahoo learning-to-rank challenge.
arXiv Detail & Related papers (2022-05-12T11:15:47Z) - Towards Standardizing Reinforcement Learning Approaches for Stochastic
Production Scheduling [77.34726150561087]
reinforcement learning can be used to solve scheduling problems.
Existing studies rely on (sometimes) complex simulations for which the code is unavailable.
There is a vast array of RL designs to choose from.
standardization of model descriptions - both production setup and RL design - and validation scheme are a prerequisite.
arXiv Detail & Related papers (2021-04-16T16:07:10Z) - Monte Carlo Tree Search: A Review of Recent Modifications and
Applications [0.17205106391379024]
Monte Carlo Tree Search (MCTS) is a powerful approach to designing game-playing bots or solving sequential decision problems.
The method relies on intelligent tree search that balances exploration and exploitation.
The method has become a state-of-the-art technique for games, however, in more complex games.
arXiv Detail & Related papers (2021-03-08T17:44:15Z) - Nonintrusive Uncertainty Quantification for automotive crash problems
with VPS/Pamcrash [0.0]
surrogate models (metamodels) allow drastically reducing the computational cost of Monte Carlo process.
kernel Principal Component Analysis (kPCA) is found to be effective to simplify the model outcome description.
A benchmark crash test is used to show the efficiency of combining metamodels with kPCA.
arXiv Detail & Related papers (2021-02-15T16:59:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.