Machine Generalize Learning in Agent-Based Models: Going Beyond Surrogate Models for Calibration in ABMs
- URL: http://arxiv.org/abs/2509.07013v1
- Date: Sat, 06 Sep 2025 18:28:00 GMT
- Title: Machine Generalize Learning in Agent-Based Models: Going Beyond Surrogate Models for Calibration in ABMs
- Authors: Sima Najafzadehkhoei, George Vega Yon, Bernardo Modenesi, Derek S. Meyer,
- Abstract summary: Calibrating agent-based epidemic models are computationally demanding.<n>We present a supervised machine learning calibrator that learns the inverse mapping from epidemic time series to SIR parameters.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Calibrating agent-based epidemic models is computationally demanding. We present a supervised machine learning calibrator that learns the inverse mapping from epidemic time series to SIR parameters. A three-layer bidirectional LSTM ingests 60-day incidence together with population size and recovery rate, and outputs transmission probability, contact rate, and R0. Training uses a composite loss with an epidemiology-motivated consistency penalty that encourages R0 \* recovery rate to equal transmission probability \* contact rate. In a 1000-scenario simulation study, we compare the calibrator with Approximate Bayesian Computation (likelihood-free MCMC). The method achieves lower error across all targets (MAE: R0 0.0616 vs 0.275; transmission 0.0715 vs 0.128; contact 1.02 vs 4.24), produces tighter predictive intervals with near nominal coverage, and reduces wall clock time from 77.4 s to 2.35 s per calibration. Although contact rate and transmission probability are partially nonidentifiable, the approach reproduces epidemic curves more faithfully than ABC, enabling fast and practical calibration. We evaluate it on SIR agent based epidemics generated with epiworldR and provide an implementation in R.
Related papers
- Suppressing Prior-Comparison Hallucinations in Radiology Report Generation via Semantically Decoupled Latent Steering [94.37535002230504]
We develop a training-free, inference-time control framework termed Semantically Decoupled Latent Steering.<n>Our approach constructs a semantic-free intervention vector via large language model (LLM)-driven semantic decomposition.<n>We show that our approach significantly reduces the probability of historical hallucinations.
arXiv Detail & Related papers (2026-02-27T04:49:01Z) - Efficient Variance-reduced Estimation from Generative EHR Models: The SCOPE and REACH Estimators [1.0765331200875645]
Generative models trained using tokenized electronic health record timelines show promise for clinical outcome prediction.<n>Existing approaches suffer from three key limitations: sparse estimate distributions that poorly differentiate patient risk levels, extreme computational costs, and high sampling variance.<n>We propose two new estimators: the Sum of Conditional Outcome Probability Estimator (SCOPE) and Risk Estimation from Anticipated Conditional Hazards (REACH), that leverage next-token probability distributions discarded by standard Monte Carlo.
arXiv Detail & Related papers (2026-02-03T16:49:44Z) - RAPT: Model-Predictive Out-of-Distribution Detection and Failure Diagnosis for Sim-to-Real Humanoid Robots [1.5765892172285598]
We present RAPT, a lightweight, self-supervised deployment-time monitor for 50Hz humanoid control.<n>RAPT learns a probabilistic-temporal manifold of nominal execution from simulation and evaluates execution-time predictive deviation.<n>We evaluate RAPT on a Unitree G1 humanoid across four complex tasks in simulation and on physical hardware.
arXiv Detail & Related papers (2026-02-02T01:04:55Z) - Residual GRU+MHSA: A Lightweight Hybrid Recurrent Attention Model for Cardiovascular Disease Detection [1.267904597444312]
We propose Residual GRU with Multi-Head Self-Attention, a compact deep learning architecture for clinical records.<n>We evaluate the model on the UCI Heart Disease dataset using 5-fold stratified cross-validation.<n>The proposed model achieves an accuracy of 0.861, macro-F1 of 0.860, ROC-AUC of 0.908, and PR-AUC of 0.904, outperforming all baselines.
arXiv Detail & Related papers (2025-12-16T16:33:59Z) - Transparent Early ICU Mortality Prediction with Clinical Transformer and Per-Case Modality Attribution [42.85462513661566]
We present a lightweight, transparent multimodal ensemble that fuses physiological time-series measurements with unstructured clinical notes from the first 48 hours of an ICU stay.<n>A logistic regression model combines predictions from two modality-specific models: a bidirectional LSTM for vitals and a finetuned ClinicalModernBERT transformer for notes.<n>On the MIMIC-III benchmark, our late-fusion ensemble improves discrimination over the best single model while maintaining well-calibrated predictions.
arXiv Detail & Related papers (2025-11-19T20:11:49Z) - Agentic World Modeling for 6G: Near-Real-Time Generative State-Space Reasoning [70.56067503630486]
We argue that sixth-generation (6G) intelligence is not fluent token prediction but calibrated the capacity to imagine and choose.<n>We show that WM-MS3M cuts mean absolute error (MAE) by 1.69% versus MS3M with 32% fewer parameters and similar latency, and achieves 35-80% lower root mean squared error (RMSE) than attention/hybrid baselines with 2.3-4.1x faster inference.
arXiv Detail & Related papers (2025-11-04T17:22:22Z) - Generalizable Diabetes Risk Stratification via Hybrid Machine Learning Models [0.0]
Diabetes affects over 537 million people worldwide and is projected to reach 783 million by 2045.<n>We compare two hybrid classifiers and assess their generalizability on an external cohort.
arXiv Detail & Related papers (2025-09-24T21:18:52Z) - Organ-Agents: Virtual Human Physiology Simulator via LLMs [66.40796430669158]
Organ-Agents is a multi-agent framework that simulates human physiology via LLM-driven agents.<n>We curated data from 7,134 sepsis patients and 7,895 controls, generating high-resolution trajectories across 9 systems and 125 variables.<n>Organ-Agents achieved high simulation accuracy on 4,509 held-out patients, with per-system MSEs 0.16 and robustness across SOFA-based severity strata.
arXiv Detail & Related papers (2025-08-20T01:58:45Z) - Continuous sPatial-Temporal Deformable Image Registration (CPT-DIR) for motion modelling in radiotherapy: beyond classic voxel-based methods [28.526710503853877]
Current deformable image registration (DIR) implementations rely on discrete volumetric motion representation.<n>We propose a novel approach using implicit neural representation (INR) for continuous modeling of patient anatomical motion.<n>CPT-DIR significantly enhances registration and accuracy, automation, and speed.
arXiv Detail & Related papers (2024-05-01T10:26:08Z) - Neural parameter calibration and uncertainty quantification for epidemic
forecasting [0.0]
We apply a novel and powerful computational method to the problem of learning probability densities on contagion parameters.
Using a neural network, we calibrate an ODE model to data of the spread of COVID-19 in Berlin in 2020.
We show convergence of our method to the true posterior on a simplified SIR model of epidemics, and also demonstrate our method's learning capabilities on a reduced dataset.
arXiv Detail & Related papers (2023-12-05T21:34:59Z) - Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion [56.38386580040991]
Consistency Trajectory Model (CTM) is a generalization of Consistency Models (CM)
CTM enables the efficient combination of adversarial training and denoising score matching loss to enhance performance.
Unlike CM, CTM's access to the score function can streamline the adoption of established controllable/conditional generation methods.
arXiv Detail & Related papers (2023-10-01T05:07:17Z) - Continuous time recurrent neural networks: overview and application to
forecasting blood glucose in the intensive care unit [56.801856519460465]
Continuous time autoregressive recurrent neural networks (CTRNNs) are a deep learning model that account for irregular observations.
We demonstrate the application of these models to probabilistic forecasting of blood glucose in a critical care setting.
arXiv Detail & Related papers (2023-04-14T09:39:06Z) - Distributionally Robust Models with Parametric Likelihood Ratios [123.05074253513935]
Three simple ideas allow us to train models with DRO using a broader class of parametric likelihood ratios.
We find that models trained with the resulting parametric adversaries are consistently more robust to subpopulation shifts when compared to other DRO approaches.
arXiv Detail & Related papers (2022-04-13T12:43:12Z) - Bayesian logistic regression for online recalibration and revision of
risk prediction models with performance guarantees [6.709991492637819]
We introduce two procedures for continual recalibration or revision of an underlying prediction model.
We perform empirical evaluation via simulations and a real-world study predicting COPD risk.
We derive "Type I and II" regret bounds, which guarantee the procedures are non-inferior to a static model and competitive with an oracle logistic reviser.
arXiv Detail & Related papers (2021-10-13T17:03:21Z) - Comparative Analysis of Machine Learning Approaches to Analyze and
Predict the Covid-19 Outbreak [10.307715136465056]
We present a comparative analysis of various machine learning (ML) approaches in predicting the COVID-19 outbreak in the epidemiological domain.
The results reveal the advantages of ML algorithms for supporting decision making of evolving short term policies.
arXiv Detail & Related papers (2021-02-11T11:57:33Z) - A Time-dependent SIR model for COVID-19 with Undetectable Infected
Persons [5.315136504175843]
We propose a time-dependent susceptible-infected-recovered (SIR) model that tracks 2 time series.
Using the data provided by China, we show that the one-day prediction errors for the numbers of confirmed cases are almost in 3%.
Also, the turning point, defined as the day that the transmission rate is less than the recovering rate can be accurately predicted.
arXiv Detail & Related papers (2020-02-28T23:44:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.