Related papers: Multi-source Plume Tracing via Multi-Agent Reinforcement Learning

Multi-source Plume Tracing via Multi-Agent Reinforcement Learning

URL: http://arxiv.org/abs/2505.08825v1
Date: Mon, 12 May 2025 21:33:15 GMT
Title: Multi-source Plume Tracing via Multi-Agent Reinforcement Learning
Authors: Pedro Antonio Alarcon Granadeno, Theodore Chambers, Jane Cleland-Huang,
Abstract summary: Industrial catastrophes like the Bhopal disaster demonstrate the need for rapid and reliable plume tracing algorithms.<n>Traditional methods, such as gradient-based or biologically inspired approaches, often fail in realistic, turbulent conditions.<n>We present a Multi-Agent Reinforcement Learning (MARL) algorithm designed for localizing multiple airborne pollution sources.
Score: 41.03292974500013
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Industrial catastrophes like the Bhopal disaster (1984) and the Aliso Canyon gas leak (2015) demonstrate the urgent need for rapid and reliable plume tracing algorithms to protect public health and the environment. Traditional methods, such as gradient-based or biologically inspired approaches, often fail in realistic, turbulent conditions. To address these challenges, we present a Multi-Agent Reinforcement Learning (MARL) algorithm designed for localizing multiple airborne pollution sources using a swarm of small uncrewed aerial systems (sUAS). Our method models the problem as a Partially Observable Markov Game (POMG), employing a Long Short-Term Memory (LSTM)-based Action-specific Double Deep Recurrent Q-Network (ADDRQN) that uses full sequences of historical action-observation pairs, effectively approximating latent states. Unlike prior work, we use a general-purpose simulation environment based on the Gaussian Plume Model (GPM), incorporating realistic elements such as a three-dimensional environment, sensor noise, multiple interacting agents, and multiple plume sources. The incorporation of action histories as part of the inputs further enhances the adaptability of our model in complex, partially observable environments. Extensive simulations show that our algorithm significantly outperforms conventional approaches. Specifically, our model allows agents to explore only 1.29\% of the environment to successfully locate pollution sources.

Related papers

Towards Operational Automated Greenhouse Gas Plume Detection [0.15556354682377155]
This work reviews and addresses several key obstacles in the field: data and label quality control, prevention of biases, and correctly aligned modeling objectives.<n>We demonstrate through rigorous experiments using multicampaign data from airborne and spaceborne instruments that are able to achieve operational performance detection.<n>We provide analysis-ready data, models, and source code for deployment and work to define a set of best practices.
arXiv Detail & Related papers (2025-05-27T22:22:54Z)
Open-set Anomaly Segmentation in Complex Scenarios [88.11076112792992]
This paper introduces ComsAmy, a benchmark for open-set anomaly segmentation in complex scenarios.<n>ComsAmy encompasses a wide spectrum of adverse weather conditions, dynamic driving environments, and diverse anomaly types.<n>We propose a novel energy-entropy learning (EEL) strategy that integrates the complementary information from energy and entropy.
arXiv Detail & Related papers (2025-04-28T12:00:10Z)
Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation [13.073844945948132]
Atmospheric turbulence is a major source of image degradation in long-range imaging systems.<n>Many deep learning-based turbulence mitigation (TM) methods have been proposed, but they are slow, memory-hungry, and do not generalize well.<n>We present a new TM method based on two concepts: (1) A turbulence mitigation network based on the Selective State Space Model (MambaTM) and (2) Learned Latent Phase Distortion (LPD)<n>Our proposed method exceeds current state-of-the-art networks on various synthetic and real-world TM benchmarks with significantly faster inference speed.
arXiv Detail & Related papers (2025-04-03T15:33:18Z)
Whenever, Wherever: Towards Orchestrating Crowd Simulations with Spatio-Temporal Spawn Dynamics [65.72663487116439]
We propose nTPP-GMM that models spawn-temporal spawn dynamics using Neural Temporal Point Processes.<n>We evaluate our approach by simulations of three diverse real-world datasets with nTPP-GMM.
arXiv Detail & Related papers (2025-03-20T18:46:41Z)
Multi-Agent Path Finding in Continuous Spaces with Projected Diffusion Models [57.45019514036948]
Multi-Agent Path Finding (MAPF) is a fundamental problem in robotics.<n>This work proposes a novel approach that integrates constrained optimization with diffusion models for MAPF in continuous spaces.
arXiv Detail & Related papers (2024-12-23T21:27:19Z)
Efficient Unsupervised Domain Adaptation Regression for Spatial-Temporal Sensor Fusion [6.963971634605796]
Low-cost, distributed sensor networks in environmental and biomedical domains have enabled continuous, large-scale health monitoring.<n>These systems often face challenges related to degraded data quality caused by sensor drift, noise, and insufficient calibration.<n>Traditional machine learning methods for sensor fusion and calibration rely on extensive feature engineering.<n>We propose a novel unsupervised domain adaptation (UDA) method tailored for regression tasks.
arXiv Detail & Related papers (2024-11-11T12:20:57Z)
A SAM-guided Two-stream Lightweight Model for Anomaly Detection [44.73985145110819]
We propose a SAM-guided Two-stream Lightweight Model for unsupervised anomaly detection (STLM) Our experiments conducted on MVTec AD benchmark show that STLM, with about 16M parameters and achieving an inference time in 20ms, competes effectively with state-of-the-art methods.
arXiv Detail & Related papers (2024-02-29T13:29:10Z)
Surrogate Model for Geological CO2 Storage and Its Use in Hierarchical MCMC History Matching [0.0]
We extend the recently introduced recurrent R-U-Net surrogate model to treat geomodel realizations drawn from a wide range of geological scenarios. We show that, using observed data from monitoring wells in synthetic true' models, geological uncertainty is reduced substantially.
arXiv Detail & Related papers (2023-08-11T18:29:28Z)
Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement [17.72127385405445]
We present a novel formulation of adaptive mesh refinement (AMR) as a fully-cooperative Markov game. We design a novel deep multi-agent reinforcement learning algorithm called Value Decomposition Graph Network (VDGN) We show that VDGN policies significantly outperform error threshold-based policies in global error and cost metrics.
arXiv Detail & Related papers (2022-11-02T00:41:32Z)
Reduced-order modeling for parameterized large-eddy simulations of atmospheric pollutant dispersion [0.0]
Large-eddy simulations (LES) have the potential to accurately represent pollutant concentration spatial variability. LES become prohibitively costly to deploy to understand how plume flow and tracer dispersion change with various atmospheric and source parameters. We propose a non-intrusive reduced-order model combining proper decomposition (POD) and Gaussian process regression (GPR) to predict LES field statistics of interest associated with tracer concentrations.
arXiv Detail & Related papers (2022-08-02T15:06:22Z)
Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation [93.52573037053449]
H-MARL (Hallucinated Multi-Agent Reinforcement Learning) learns successful equilibrium policies after a few interactions with the environment. We demonstrate our approach experimentally on an autonomous driving simulation benchmark.
arXiv Detail & Related papers (2022-03-14T17:24:03Z)
TurbuGAN: An Adversarial Learning Approach to Spatially-Varying Multiframe Blind Deconvolution with Applications to Imaging Through Turbulence [9.156939957189504]
We present a self-supervised and self-calibrating multi-shot approach to imaging through atmospheric turbulence, called TurbuGAN. Our approach requires no paired training data, adapts itself to the distribution of the turbulence, leverages domain-specific data priors, outperforms existing approaches, and can generalize from tens to tens of thousands of measurements.
arXiv Detail & Related papers (2022-03-13T21:32:34Z)
Provable RL with Exogenous Distractors via Multistep Inverse Dynamics [85.52408288789164]
Real-world applications of reinforcement learning (RL) require the agent to deal with high-dimensional observations such as those generated from a megapixel camera. Prior work has addressed such problems with representation learning, through which the agent can provably extract endogenous, latent state information from raw observations. However, such approaches can fail in the presence of temporally correlated noise in the observations.
arXiv Detail & Related papers (2021-10-17T15:21:27Z)

This list is automatically generated from the titles and abstracts of the papers in this site.