Towards Optimal Correlational Object Search
- URL: http://arxiv.org/abs/2110.09991v1
- Date: Tue, 19 Oct 2021 14:03:43 GMT
- Title: Towards Optimal Correlational Object Search
- Authors: Kaiyu Zheng, Rohan Chitnis, Yoonchang Sung, George Konidaris, Stefanie
Tellex
- Abstract summary: Correlational Object Search POMDP can be solved to produce search strategies that use correlational information.
We conduct experiments using AI2-THOR, a realistic simulator of household environments, as well as YOLOv5, a widely-used object detector.
- Score: 25.355936023640506
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In realistic applications of object search, robots will need to locate target
objects in complex environments while coping with unreliable sensors,
especially for small or hard-to-detect objects. In such settings, correlational
information can be valuable for planning efficiently: when looking for a fork,
the robot could start by locating the easier-to-detect refrigerator, since
forks would probably be found nearby. Previous approaches to object search with
correlational information typically resort to ad-hoc or greedy search
strategies. In this paper, we propose the Correlational Object Search POMDP
(COS-POMDP), which can be solved to produce search strategies that use
correlational information. COS-POMDPs contain a correlation-based observation
model that allows us to avoid the exponential blow-up of maintaining a joint
belief about all objects, while preserving the optimal solution to this naive,
exponential POMDP formulation. We propose a hierarchical planning algorithm to
scale up COS-POMDP for practical domains. We conduct experiments using
AI2-THOR, a realistic simulator of household environments, as well as YOLOv5, a
widely-used object detector. Our results show that, particularly for
hard-to-detect objects, such as scrub brush and remote control, our method
offers the most robust performance compared to baselines that ignore
correlations as well as a greedy, next-best view approach.
Related papers
- A Modern Take on Visual Relationship Reasoning for Grasp Planning [10.543168383800532]
We present a modern take on visual relational reasoning for grasp planning.
We introduce D3GD, a novel testbed that includes bin picking scenes with up to 35 objects from 97 distinct categories.
We also propose D3G, a new end-to-end transformer-based dependency graph generation model.
arXiv Detail & Related papers (2024-09-03T16:30:48Z) - Language-Conditioned Observation Models for Visual Object Search [12.498575839909334]
We bridge the gap in realistic object search by posing the problem as a partially observable Markov decision process (POMDP)
We incorporate the neural network's outputs into our language-conditioned observation model (LCOM) to represent dynamically changing sensor noise.
We demonstrate our method on a Boston Dynamics Spot robot, enabling it to handle complex natural language object descriptions and efficiently find objects in a room-scale environment.
arXiv Detail & Related papers (2023-09-13T19:30:53Z) - Scale-aware Automatic Augmentation for Object Detection [63.087930708444695]
We propose Scale-aware AutoAug to learn data augmentation policies for object detection.
In experiments, Scale-aware AutoAug yields significant and consistent improvement on various object detectors.
arXiv Detail & Related papers (2021-03-31T17:11:14Z) - Batch Exploration with Examples for Scalable Robotic Reinforcement
Learning [63.552788688544254]
Batch Exploration with Examples (BEE) explores relevant regions of the state-space guided by a modest number of human provided images of important states.
BEE is able to tackle challenging vision-based manipulation tasks both in simulation and on a real Franka robot.
arXiv Detail & Related papers (2020-10-22T17:49:25Z) - DecAug: Augmenting HOI Detection via Decomposition [54.65572599920679]
Current algorithms suffer from insufficient training samples and category imbalance within datasets.
We propose an efficient and effective data augmentation method called DecAug for HOI detection.
Experiments show that our method brings up to 3.3 mAP and 1.6 mAP improvements on V-COCO and HICODET dataset.
arXiv Detail & Related papers (2020-10-02T13:59:05Z) - POMP: Pomcp-based Online Motion Planning for active visual search in
indoor environments [89.43830036483901]
We focus on the problem of learning an optimal policy for Active Visual Search (AVS) of objects in known indoor environments with an online setup.
Our POMP method uses as input the current pose of an agent and a RGB-D frame.
We validate our method on the publicly available AVD benchmark, achieving an average success rate of 0.76 with an average path length of 17.1.
arXiv Detail & Related papers (2020-09-17T08:23:50Z) - SoDA: Multi-Object Tracking with Soft Data Association [75.39833486073597]
Multi-object tracking (MOT) is a prerequisite for a safe deployment of self-driving cars.
We propose a novel approach to MOT that uses attention to compute track embeddings that encode dependencies between observed objects.
arXiv Detail & Related papers (2020-08-18T03:40:25Z) - Visuomotor Mechanical Search: Learning to Retrieve Target Objects in
Clutter [43.668395529368354]
We present a novel Deep RL procedure that combines teacher-aided exploration, ii) a critic with privileged information, andiii) mid-level representations.
Our approach trains faster and converges to more efficient uncovering solutions than baselines and ablations, and that our uncovering policies lead to an average improvement in the graspability of the target object.
arXiv Detail & Related papers (2020-08-13T18:23:00Z) - AutoOD: Automated Outlier Detection via Curiosity-guided Search and
Self-imitation Learning [72.99415402575886]
Outlier detection is an important data mining task with numerous practical applications.
We propose AutoOD, an automated outlier detection framework, which aims to search for an optimal neural network model.
Experimental results on various real-world benchmark datasets demonstrate that the deep model identified by AutoOD achieves the best performance.
arXiv Detail & Related papers (2020-06-19T18:57:51Z) - Semantic Linking Maps for Active Visual Object Search [14.573513188682183]
We exploit background knowledge about common spatial relations between landmark and target objects.
We propose an active visual object search strategy method through our introduction of the Semantic Linking Maps (SLiM) model.
Based on SLiM, we describe a hybrid search strategy that selects the next best view pose for searching for the target object.
arXiv Detail & Related papers (2020-06-18T18:59:44Z) - Learning hierarchical relationships for object-goal navigation [7.074818959144171]
We present Memory-utilized Joint hierarchical Object Learning for Navigation in Indoor Rooms (MJOLNIR)
MJOLNIR is a target-driven navigation algorithm, which considers the inherent relationship between target objects, and the more salient contextual objects occurring in its surrounding.
Our model learns to converge much faster than other algorithms, without suffering from the well-known overfitting problem.
arXiv Detail & Related papers (2020-03-15T04:01:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.