On learning agent-based models from data
- URL: http://arxiv.org/abs/2205.05052v1
- Date: Tue, 10 May 2022 17:08:26 GMT
- Title: On learning agent-based models from data
- Authors: Corrado Monti, Marco Pangallo, Gianmarco De Francisci Morales,
Francesco Bonchi
- Abstract summary: Agent-Based Models (ABMs) are used in several fields to study the evolution of complex systems from micro-level assumptions.
We propose a protocol to learn the latent micro-variables of an ABM from data.
- Score: 22.387666772159974
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Agent-Based Models (ABMs) are used in several fields to study the evolution
of complex systems from micro-level assumptions. However, ABMs typically can
not estimate agent-specific (or "micro") variables: this is a major limitation
which prevents ABMs from harnessing micro-level data availability and which
greatly limits their predictive power. In this paper, we propose a protocol to
learn the latent micro-variables of an ABM from data. The first step of our
protocol is to reduce an ABM to a probabilistic model, characterized by a
computationally tractable likelihood. This reduction follows two general design
principles: balance of stochasticity and data availability, and replacement of
unobservable discrete choices with differentiable approximations. Then, our
protocol proceeds by maximizing the likelihood of the latent variables via a
gradient-based expectation maximization algorithm. We demonstrate our protocol
by applying it to an ABM of the housing market, in which agents with different
incomes bid higher prices to live in high-income neighborhoods. We demonstrate
that the obtained model allows accurate estimates of the latent variables,
while preserving the general behavior of the ABM. We also show that our
estimates can be used for out-of-sample forecasting. Our protocol can be seen
as an alternative to black-box data assimilation methods, that forces the
modeler to lay bare the assumptions of the model, to think about the
inferential process, and to spot potential identification problems.
Related papers
- Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks [2.749593964424624]
Agent-Based Models (ABMs) are powerful tools for studying emergent properties in complex systems.<n>We propose a novel framework to learn a differentiable surrogate of any ABM by observing its generated data.<n>Our method combines diffusion models to capture behaviorality and graph neural networks to model agent interactions.
arXiv Detail & Related papers (2025-05-27T16:55:56Z) - Model-free Methods for Event History Analysis and Efficient Adjustment (PhD Thesis) [55.2480439325792]
This thesis is a series of independent contributions to statistics unified by a model-free perspective.
The first chapter elaborates on how a model-free perspective can be used to formulate flexible methods that leverage prediction techniques from machine learning.
The second chapter studies the concept of local independence, which describes whether the evolution of one process is directly influenced by another.
arXiv Detail & Related papers (2025-02-11T19:24:09Z) - Pseudo-Probability Unlearning: Towards Efficient and Privacy-Preserving Machine Unlearning [59.29849532966454]
We propose PseudoProbability Unlearning (PPU), a novel method that enables models to forget data to adhere to privacy-preserving manner.
Our method achieves over 20% improvements in forgetting error compared to the state-of-the-art.
arXiv Detail & Related papers (2024-11-04T21:27:06Z) - Considerations for Distribution Shift Robustness of Diagnostic Models in Healthcare [10.393967785465536]
In the domain of applied ML for health, it is common to predict $Y$ from $X$ without considering further information about the patient.
In this work, we highlight a data generating mechanism common to healthcare settings and discuss how recent theoretical results from the causality literature can be applied to build robust predictive models.
arXiv Detail & Related papers (2024-10-25T14:13:09Z) - Influence Functions for Scalable Data Attribution in Diffusion Models [52.92223039302037]
Diffusion models have led to significant advancements in generative modelling.
Yet their widespread adoption poses challenges regarding data attribution and interpretability.
In this paper, we aim to help address such challenges by developing an textitinfluence functions framework.
arXiv Detail & Related papers (2024-10-17T17:59:02Z) - Minimal Variance Model Aggregation: A principled, non-intrusive, and versatile integration of black box models [0.2455468619225742]
We introduce Minimal Empirical Variance Aggregation (MEVA), a data-driven framework that integrates predictions from various models.
This non-intrusive, model-agnostic approach treats the contributing models as black boxes and accommodates outputs from diverse methodologies.
arXiv Detail & Related papers (2024-09-25T18:33:21Z) - Rejection via Learning Density Ratios [50.91522897152437]
Classification with rejection emerges as a learning paradigm which allows models to abstain from making predictions.
We propose a different distributional perspective, where we seek to find an idealized data distribution which maximizes a pretrained model's performance.
Our framework is tested empirically over clean and noisy datasets.
arXiv Detail & Related papers (2024-05-29T01:32:17Z) - Variational Inference of Parameters in Opinion Dynamics Models [9.51311391391997]
This work uses variational inference to estimate the parameters of an opinion dynamics ABM.
We transform the inference process into an optimization problem suitable for automatic differentiation.
Our approach estimates both macroscopic (bounded confidence intervals and backfire thresholds) and microscopic ($200$ categorical, agent-level roles) more accurately than simulation-based and MCMC methods.
arXiv Detail & Related papers (2024-03-08T14:45:18Z) - Simplifying Complex Observation Models in Continuous POMDP Planning with
Probabilistic Guarantees and Practice [9.444784653236157]
We deal with the question of what would be the implication of using simplified observation models for planning.
Our main contribution is a novel probabilistic bound based on a statistical total variation distance of the simplified model.
Our calculations can be separated into offline and online parts, and we arrive at formal guarantees without having to access the costly model at all during planning.
arXiv Detail & Related papers (2023-11-13T20:55:02Z) - Human Trajectory Forecasting with Explainable Behavioral Uncertainty [63.62824628085961]
Human trajectory forecasting helps to understand and predict human behaviors, enabling applications from social robots to self-driving cars.
Model-free methods offer superior prediction accuracy but lack explainability, while model-based methods provide explainability but cannot predict well.
We show that BNSP-SFM achieves up to a 50% improvement in prediction accuracy, compared with 11 state-of-the-art methods.
arXiv Detail & Related papers (2023-07-04T16:45:21Z) - Predictable MDP Abstraction for Unsupervised Model-Based RL [93.91375268580806]
We propose predictable MDP abstraction (PMA)
Instead of training a predictive model on the original MDP, we train a model on a transformed MDP with a learned action space.
We theoretically analyze PMA and empirically demonstrate that PMA leads to significant improvements over prior unsupervised model-based RL approaches.
arXiv Detail & Related papers (2023-02-08T07:37:51Z) - Calibrating Agent-based Models to Microdata with Graph Neural Networks [1.4911092205861822]
Calibrating agent-based models (ABMs) to data is among the most fundamental requirements to ensure the model fulfils its desired purpose.
We propose to learn parameter posteriors associated with granular microdata directly using temporal graph neural networks.
arXiv Detail & Related papers (2022-06-15T14:41:43Z) - Probabilistic Gradient Boosting Machines for Large-Scale Probabilistic
Regression [51.770998056563094]
Probabilistic Gradient Boosting Machines (PGBM) is a method to create probabilistic predictions with a single ensemble of decision trees.
We empirically demonstrate the advantages of PGBM compared to existing state-of-the-art methods.
arXiv Detail & Related papers (2021-06-03T08:32:13Z) - Forecasting COVID-19 daily cases using phone call data [0.0]
We propose a simple Multiple Linear Regression model optimised to use call data to forecast the number of daily confirmed cases.
Our proposed approach outperforms ARIMA, ETS and a regression model without call data, evaluated by three point forecast error metrics, one prediction interval and two probabilistic forecast accuracy measures.
arXiv Detail & Related papers (2020-10-05T18:07:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.