GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent
  Active Search
        - URL: http://arxiv.org/abs/2304.02075v1
- Date: Tue, 4 Apr 2023 18:58:16 GMT
- Title: GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent
  Active Search
- Authors: Nikhil Angad Bakshi, Tejus Gupta, Ramina Ghods, Jeff Schneider
- Abstract summary: Generalized Uncertainty-aware Thompson Sampling (GUTS) algorithm is suitable for deployment on heterogeneous multi-robot systems for active search in large unstructured environments.
We conduct field tests using our multi-robot system in an unstructured environment with a search area of 75,000 sq. m.
- Score: 5.861092453610268
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract:   Robotic solutions for quick disaster response are essential to ensure minimal
loss of life, especially when the search area is too dangerous or too vast for
human rescuers. We model this problem as an asynchronous multi-agent
active-search task where each robot aims to efficiently seek objects of
interest (OOIs) in an unknown environment. This formulation addresses the
requirement that search missions should focus on quick recovery of OOIs rather
than full coverage of the search region. Previous approaches fail to accurately
model sensing uncertainty, account for occlusions due to foliage or terrain, or
consider the requirement for heterogeneous search teams and robustness to
hardware and communication failures. We present the Generalized
Uncertainty-aware Thompson Sampling (GUTS) algorithm, which addresses these
issues and is suitable for deployment on heterogeneous multi-robot systems for
active search in large unstructured environments. We show through simulation
experiments that GUTS consistently outperforms existing methods such as
parallelized Thompson Sampling and exhaustive search, recovering all OOIs in
80% of all runs. In contrast, existing approaches recover all OOIs in less than
40% of all runs. We conduct field tests using our multi-robot system in an
unstructured environment with a search area of approximately 75,000 sq. m. Our
system demonstrates robustness to various failure modes, achieving full
recovery of OOIs (where feasible) in every field run, and significantly
outperforming our baseline.
 
      
        Related papers
        - Benchmarking Deep Search over Heterogeneous Enterprise Data [73.55304268238474]
 We present a new benchmark for evaluating a form of retrieval-augmented generation (RAG)<n>RAG requires source-aware, multi-hop reasoning over diverse, sparsed, but related sources.<n>We build it using a synthetic data pipeline that simulates business across product planning, development, and support stages.
 arXiv  Detail & Related papers  (2025-06-29T08:34:59Z)
- MMSearch-R1: Incentivizing LMMs to Search [49.889749277236376]
 We present MMSearch-R1, the first end-to-end reinforcement learning framework that enables on-demand, multi-turn search in real-world Internet environments.<n>Our framework integrates both image and text search tools, allowing the model to reason about when and how to invoke them guided by an outcome-based reward with a search penalty.
 arXiv  Detail & Related papers  (2025-06-25T17:59:42Z)
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning   Trajectory Synthesis [89.99161034065614]
 Retrieval-augmented generation (RAG) systems have advanced large language models (LLMs) in complex deep search scenarios.<n>Existing approaches face critical limitations that lack high-quality training trajectories and suffer from distributional mismatches.<n>This paper introduces SimpleDeepSearcher, a framework that bridges the gap through strategic data engineering rather than complex training paradigms.
 arXiv  Detail & Related papers  (2025-05-22T16:05:02Z)
- Evaluating Robustness of Generative Search Engine on Adversarial Factual   Questions [89.35345649303451]
 Generative search engines have the potential to transform how people seek information online.
But generated responses from existing large language models (LLMs)-backed generative search engines may not always be accurate.
Retrieval-augmented generation exacerbates safety concerns, since adversaries may successfully evade the entire system.
 arXiv  Detail & Related papers  (2024-02-25T11:22:19Z)
- From Simulations to Reality: Enhancing Multi-Robot Exploration for Urban
  Search and Rescue [46.377510400989536]
 We present a novel hybrid algorithm for efficient multi-robot exploration in unknown environments with limited communication and no global positioning information.
We redefine the local best and global best positions to suit scenarios without continuous target information.
The presented work holds promise for enhancing multi-robot exploration in scenarios with limited information and communication capabilities.
 arXiv  Detail & Related papers  (2023-11-28T17:05:25Z)
- Adversarial Search and Tracking with Multiagent Reinforcement Learning
  in Sparsely Observable Environment [7.195547595036644]
 We study a search and tracking (S&T) problem where a team of dynamic search agents must collaborate to track an adversarial, evasive agent.
This problem is challenging for both model-based searching and reinforcement learning (RL) methods since the adversary exhibits reactionary and deceptive evasive behaviors in a large space leading to sparse detections for the search agents.
We propose a novel Multi-Agent RL (MARL) framework that leverages the estimated adversary location from our learnable filtering model.
 arXiv  Detail & Related papers  (2023-06-20T05:31:13Z)
- Factorization of Multi-Agent Sampling-Based Motion Planning [72.42734061131569]
 Modern robotics often involves multiple embodied agents operating within a shared environment.
Standard sampling-based algorithms can be used to search for solutions in the robots' joint space.
We integrate the concept of factorization into sampling-based algorithms, which requires only minimal modifications to existing methods.
We present a general implementation of a factorized SBA, derive an analytical gain in terms of sample complexity for PRM*, and showcase empirical results for RRG.
 arXiv  Detail & Related papers  (2023-04-01T15:50:18Z)
- Heuristic-free Optimization of Force-Controlled Robot Search Strategies
  in Stochastic Environments [13.622757453459748]
 Even relatively simple peg-in-hole tasks are typically subject to variations, requiring search motions to find relevant features such as holes.
This paper introduces an automatic, data-driven and conditioning-free approach to optimize search strategies.
We evaluate our approach on two different industrial robots in the context of spiral and probe search for THT electronics assembly.
 arXiv  Detail & Related papers  (2022-07-15T15:16:08Z)
- Loss Function Discovery for Object Detection via Convergence-Simulation
  Driven Search [101.73248560009124]
 We propose an effective convergence-simulation driven evolutionary search algorithm, CSE-Autoloss, for speeding up the search progress.
We conduct extensive evaluations of loss function search on popular detectors and validate the good generalization capability of searched losses.
Our experiments show that the best-discovered loss function combinations outperform default combinations by 1.1% and 0.8% in terms of mAP for two-stage and one-stage detectors.
 arXiv  Detail & Related papers  (2021-02-09T08:34:52Z)
- Multi-Agent Active Search using Realistic Depth-Aware Noise Model [8.520962086877548]
 Active search for objects of interest in an unknown environment has many robotics applications including search and rescue, detecting gas leaks or locating animal poachers.
Existing algorithms often prioritize the location accuracy of objects of interest while other practical issues such as the reliability of object detection as a function of distance and lines of sight remain largely ignored.
We present an algorithm called Noise-Aware Thompson Sampling (NATS) that addresses these issues for multiple ground-based robots performing active search considering two sources of sensory information from monocular optical imagery and depth maps.
 arXiv  Detail & Related papers  (2020-11-09T23:20:55Z)
- Batch Exploration with Examples for Scalable Robotic Reinforcement
  Learning [63.552788688544254]
 Batch Exploration with Examples (BEE) explores relevant regions of the state-space guided by a modest number of human provided images of important states.
BEE is able to tackle challenging vision-based manipulation tasks both in simulation and on a real Franka robot.
 arXiv  Detail & Related papers  (2020-10-22T17:49:25Z)
- Asynchronous Multi Agent Active Search [6.587280549237275]
 We propose two distinct active search algorithms called SPATS (Sparse Parallel Asynchronous Thompson Sampling) and LATSI (LAplace Thompson Sampling with Information gain)
We consider that targets are sparsely located around the environment in keeping with compressive sensing assumptions.
We provide simulation results as well as theoretical analysis to demonstrate the efficacy of our proposed algorithms.
 arXiv  Detail & Related papers  (2020-06-25T22:17:20Z)
- AutoOD: Automated Outlier Detection via Curiosity-guided Search and
  Self-imitation Learning [72.99415402575886]
 Outlier detection is an important data mining task with numerous practical applications.
We propose AutoOD, an automated outlier detection framework, which aims to search for an optimal neural network model.
 Experimental results on various real-world benchmark datasets demonstrate that the deep model identified by AutoOD achieves the best performance.
 arXiv  Detail & Related papers  (2020-06-19T18:57:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.