Some challenges of calibrating differentiable agent-based models
- URL: http://arxiv.org/abs/2307.01085v1
- Date: Mon, 3 Jul 2023 15:07:10 GMT
- Title: Some challenges of calibrating differentiable agent-based models
- Authors: Arnau Quera-Bofarull, Joel Dyer, Anisoara Calinescu, Michael
Wooldridge
- Abstract summary: Agent-based models (ABMs) are promising approach to modelling and reasoning about complex systems.
Their application in practice is impeded by their complexity, discrete nature, and the difficulty of performing parameter inference and optimisation tasks.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Agent-based models (ABMs) are a promising approach to modelling and reasoning
about complex systems, yet their application in practice is impeded by their
complexity, discrete nature, and the difficulty of performing parameter
inference and optimisation tasks. This in turn has sparked interest in the
construction of differentiable ABMs as a strategy for combatting these
difficulties, yet a number of challenges remain. In this paper, we discuss and
present experiments that highlight some of these challenges, along with
potential solutions.
Related papers
- Causality can systematically address the monsters under the bench(marks) [64.36592889550431]
Benchmarks are plagued by various biases, artifacts, or leakage.
Models may behave unreliably due to poorly explored failure modes.
causality offers an ideal framework to systematically address these challenges.
arXiv Detail & Related papers (2025-02-07T17:01:37Z) - BloomWise: Enhancing Problem-Solving capabilities of Large Language Models using Bloom's-Taxonomy-Inspired Prompts [59.83547898874152]
We introduce BloomWise, a new prompting technique, inspired by Bloom's taxonomy, to improve the performance of Large Language Models (LLMs)
The decision regarding the need to employ more sophisticated cognitive skills is based on self-evaluation performed by the LLM.
In extensive experiments across 4 popular math reasoning datasets, we have demonstrated the effectiveness of our proposed approach.
arXiv Detail & Related papers (2024-10-05T09:27:52Z) - The Missing Link: Allocation Performance in Causal Machine Learning [7.093692674858259]
We show how the performance of a single CATE model can vary significantly across different decision-making scenarios.
We highlight the differential influence of challenges such as distribution shifts on predictions and allocations.
arXiv Detail & Related papers (2024-07-15T14:57:40Z) - Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns? [57.80779199039929]
Large Language Models (LLMs) have demonstrated remarkable performance in solving math problems.
This paper introduces a novel benchmark, BeyondX, designed to address these limitations by incorporating problems with multiple unknowns.
Empirical study on BeyondX reveals that the performance of existing LLMs, even those fine-tuned specifically on math tasks, significantly decreases as the number of unknowns increases.
arXiv Detail & Related papers (2024-07-06T17:01:04Z) - VC Search: Bridging the Gap Between Well-Defined and Ill-Defined Problems in Mathematical Reasoning [46.25056744404318]
We develop a benchmark called Problems with Missing and Contradictory conditions ( PMC) containing over 5,000 validated ill-defined mathematical problems.
VCSEARCH improves the accuracy of identifying unsolvable problems by at least 12% across different large language models.
arXiv Detail & Related papers (2024-06-07T16:24:12Z) - Bayesian Nonparametrics: An Alternative to Deep Learning [0.5801621787540265]
This survey aims to delve into the significance of Bayesian nonparametrics, particularly in addressing complex challenges across various domains such as statistics, computer science, and electrical engineering.
We uncover the versatility and efficacy of Bayesian nonparametric methodologies, paving the way for innovative solutions to intricate challenges across diverse disciplines.
arXiv Detail & Related papers (2024-03-29T17:32:42Z) - HAZARD Challenge: Embodied Decision Making in Dynamically Changing
Environments [93.94020724735199]
HAZARD consists of three unexpected disaster scenarios, including fire, flood, and wind.
This benchmark enables us to evaluate autonomous agents' decision-making capabilities across various pipelines.
arXiv Detail & Related papers (2024-01-23T18:59:43Z) - ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent [50.508669199496474]
We develop a ReAct-style LLM agent with the ability to reason and act upon external knowledge.
We refine the agent through a ReST-like method that iteratively trains on previous trajectories.
Starting from a prompted large model and after just two iterations of the algorithm, we can produce a fine-tuned small model.
arXiv Detail & Related papers (2023-12-15T18:20:15Z) - Efficient lifting of symmetry breaking constraints for complex
combinatorial problems [9.156939957189502]
This work extends the learning framework and implementation of a model-based approach for Answer Set Programming.
In particular, we incorporate a new conflict analysis algorithm in the Inductive Logic Programming system ILASP.
arXiv Detail & Related papers (2022-05-14T20:42:13Z) - Integration of Convolutional Neural Networks in Mobile Applications [3.0280987248827085]
We study the performance of a system that integrates a Deep Learning model as a trade-off between the accuracy and the complexity.
We identify the most concerning challenges when deploying DL-based software in mobile applications.
arXiv Detail & Related papers (2021-03-11T15:27:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.