Related papers: Supply Chain Optimization via Generative Simulation and Iterative Decision Policies

Supply Chain Optimization via Generative Simulation and Iterative Decision Policies

URL: http://arxiv.org/abs/2507.07355v1
Date: Thu, 10 Jul 2025 00:41:15 GMT
Title: Supply Chain Optimization via Generative Simulation and Iterative Decision Policies
Authors: Haoyue Bai, Haoyu Wang, Nanxu Gong, Xinyuan Wang, Wangyang Ying, Haifeng Chen, Yanjie Fu,
Abstract summary: Sim-to-Dec is a framework combining an efficient simulator with an intelligent decision-making algorithm.<n>Experiments conducted on three real-world datasets demonstrate that Sim-to-Dec significantly improves timely delivery rates and profit.
Score: 39.67447490193419
License: http://creativecommons.org/licenses/by/4.0/
Abstract: High responsiveness and economic efficiency are critical objectives in supply chain transportation, both of which are influenced by strategic decisions on shipping mode. An integrated framework combining an efficient simulator with an intelligent decision-making algorithm can provide an observable, low-risk environment for transportation strategy design. An ideal simulation-decision framework must (1) generalize effectively across various settings, (2) reflect fine-grained transportation dynamics, (3) integrate historical experience with predictive insights, and (4) maintain tight integration between simulation feedback and policy refinement. We propose Sim-to-Dec framework to satisfy these requirements. Specifically, Sim-to-Dec consists of a generative simulation module, which leverages autoregressive modeling to simulate continuous state changes, reducing dependence on handcrafted domain-specific rules and enhancing robustness against data fluctuations; and a history-future dual-aware decision model, refined iteratively through end-to-end optimization with simulator interactions. Extensive experiments conducted on three real-world datasets demonstrate that Sim-to-Dec significantly improves timely delivery rates and profit.

Related papers

MR-LDM -- The Merge-Reactive Longitudinal Decision Model: Game Theoretic Human Decision Modeling for Interactive Sim Agents [0.9883562565157391]
We aim to improve the simulation of the highway merge scenario by targeting a game theoretic model for tactical decision-making.<n>We couple this with an underlying dynamics model to have a unified decision and dynamics model that can capture more realistic interactions.
arXiv Detail & Related papers (2025-07-15T20:41:00Z)
From Abstraction to Reality: DARPA's Vision for Robust Sim-to-Real Autonomy [6.402441477393285]
TIAMAT aims to address rapid and robust transfer of autonomy technologies across dynamic and complex environments.<n>Current methods for simulation-to-reality (sim-to-real) transfer often rely on high-fidelity simulations.<n>TIAMAT's approaches aim to achieve abstract-to-real transfer for effective and rapid real-world adaptation.
arXiv Detail & Related papers (2025-03-14T02:06:10Z)
Safety-Critical Traffic Simulation with Adversarial Transfer of Driving Intentions [11.633051537198687]
IntSim is a strategy that explicitly decouples the driving intentions of surrounding actors from their motion planning.<n>IntSim achieves state-of-the-art performance in simulating realistic safety-critical scenarios.
arXiv Detail & Related papers (2025-03-07T06:59:27Z)
LoopSR: Looping Sim-and-Real for Lifelong Policy Adaptation of Legged Robots [20.715834172041763]
We propose LoopSR, a lifelong policy adaptation framework that continuously refines RL policies in the post-deployment stage.<n>LoopSR employs a transformer-based encoder to map real-world trajectories into a latent space.<n>Autoencoder architecture and contrastive learning methods are adopted to enhance feature extraction of real-world dynamics.
arXiv Detail & Related papers (2024-09-26T16:02:25Z)
SAFE-SIM: Safety-Critical Closed-Loop Traffic Simulation with Diffusion-Controllable Adversaries [94.84458417662407]
We introduce SAFE-SIM, a controllable closed-loop safety-critical simulation framework. Our approach yields two distinct advantages: 1) generating realistic long-tail safety-critical scenarios that closely reflect real-world conditions, and 2) providing controllable adversarial behavior for more comprehensive and interactive evaluations. We validate our framework empirically using the nuScenes and nuPlan datasets across multiple planners, demonstrating improvements in both realism and controllability.
arXiv Detail & Related papers (2023-12-31T04:14:43Z)
Reinforcement Learning with Human Feedback for Realistic Traffic Simulation [53.85002640149283]
Key element of effective simulation is the incorporation of realistic traffic models that align with human knowledge. This study identifies two main challenges: capturing the nuances of human preferences on realism and the unification of diverse traffic simulation models.
arXiv Detail & Related papers (2023-09-01T19:29:53Z)
AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer [10.173835871228718]
AdaptSim aims to optimize task performance in target (real) environments. First, we meta-learn an adaptation policy in simulation using reinforcement learning. We then perform iterative real-world adaptation by inferring new simulation parameter distributions for policy training.
arXiv Detail & Related papers (2023-02-09T19:10:57Z)
Analyzing and Enhancing Closed-loop Stability in Reactive Simulation [25.27603440925488]
We propose a new reactive simulation framework to bridge the human behavior gap between simulation and real-world traffic scenarios. We first propose a new reactive simulation framework, where the smoothness and consistency of the simulated state sequences are crucial factors to stability. We then incorporate the kinematic vehicle model into the framework to improve the closed-loop stability of the reactive simulation.
arXiv Detail & Related papers (2022-08-09T06:31:35Z)
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization [60.73540999409032]
We show that expressive autoregressive dynamics models generate different dimensions of the next state and reward sequentially conditioned on previous dimensions. We also show that autoregressive dynamics models are useful for offline policy optimization by serving as a way to enrich the replay buffer.
arXiv Detail & Related papers (2021-04-28T16:48:44Z)
Multi-intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline [85.9210953301628]
Control of traffic signals is fundamental and critical to alleviate traffic congestion in urban areas. Because of the high complexity of modelling the problem, experimental settings of current works are often inconsistent. We propose a novel and strong baseline model based on deep reinforcement learning with the encoder-decoder structure.
arXiv Detail & Related papers (2021-01-24T03:55:39Z)
TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors [74.67698916175614]
We propose TrafficSim, a multi-agent behavior model for realistic traffic simulation. In particular, we leverage an implicit latent variable model to parameterize a joint actor policy. We show TrafficSim generates significantly more realistic and diverse traffic scenarios as compared to a diverse set of baselines.
arXiv Detail & Related papers (2021-01-17T00:29:30Z)

This list is automatically generated from the titles and abstracts of the papers in this site.