Object-Category Aware Reinforcement Learning
- URL: http://arxiv.org/abs/2210.07802v1
- Date: Thu, 13 Oct 2022 11:31:32 GMT
- Title: Object-Category Aware Reinforcement Learning
- Authors: Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Xing Hu, Zidong Du,
Xishan Zhang, Qi Guo, and Yunji Chen
- Abstract summary: We propose a novel framework named Object-Category Aware Reinforcement Learning (OCARL)
OCARL uses the category information of objects to facilitate both perception and reasoning.
Our experiments show that OCARL can improve both the sample efficiency and generalization in the OORL domain.
- Score: 18.106722478831113
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object-oriented reinforcement learning (OORL) is a promising way to improve
the sample efficiency and generalization ability over standard RL. Recent works
that try to solve OORL tasks without additional feature engineering mainly
focus on learning the object representations and then solving tasks via
reasoning based on these object representations. However, none of these works
tries to explicitly model the inherent similarity between different object
instances of the same category. Objects of the same category should share
similar functionalities; therefore, the category is the most critical property
of an object. Following this insight, we propose a novel framework named
Object-Category Aware Reinforcement Learning (OCARL), which utilizes the
category information of objects to facilitate both perception and reasoning.
OCARL consists of three parts: (1) Category-Aware Unsupervised Object Discovery
(UOD), which discovers the objects as well as their corresponding categories;
(2) Object-Category Aware Perception, which encodes the category information
and is also robust to the incompleteness of (1) at the same time; (3)
Object-Centric Modular Reasoning, which adopts multiple independent and
object-category-specific networks when reasoning based on objects. Our
experiments show that OCARL can improve both the sample efficiency and
generalization in the OORL domain.
Related papers
- Learning Dynamic Attribute-factored World Models for Efficient
Multi-object Reinforcement Learning [6.447052211404121]
In many reinforcement learning tasks, the agent has to learn to interact with many objects of different types and generalize to unseen combinations and numbers of objects.
Recent works have shown the benefits of object-factored representations and hierarchical abstractions for improving sample efficiency.
We introduce the Dynamic Attribute FacTored RL (DAFT-RL) framework to exploit the benefits of factorization in terms of object attributes.
arXiv Detail & Related papers (2023-07-18T12:41:28Z) - Cycle Consistency Driven Object Discovery [75.60399804639403]
We introduce a method that explicitly optimize the constraint that each object in a scene should be associated with a distinct slot.
By integrating these consistency objectives into various existing slot-based object-centric methods, we showcase substantial improvements in object-discovery performance.
Our results suggest that the proposed approach not only improves object discovery, but also provides richer features for downstream tasks.
arXiv Detail & Related papers (2023-06-03T21:49:06Z) - Universal Instance Perception as Object Discovery and Retrieval [90.96031157557806]
UNI reformulates diverse instance perception tasks into a unified object discovery and retrieval paradigm.
It can flexibly perceive different types of objects by simply changing the input prompts.
UNI shows superior performance on 20 challenging benchmarks from 10 instance-level tasks.
arXiv Detail & Related papers (2023-03-12T14:28:24Z) - Category-Aware Transformer Network for Better Human-Object Interaction
Detection [20.857034771924997]
We study the issue of promoting transformer-based HOI detectors by initializing the Object Query with category-aware semantic information.
Specifically, the Object Query would be represented via category priors represented by an external object detection model to yield better performance.
A HOI detection model equipped with our idea outperforms the baseline by a large margin to achieve a new state-of-the-art result.
arXiv Detail & Related papers (2022-04-11T07:21:24Z) - Complex-Valued Autoencoders for Object Discovery [62.26260974933819]
We propose a distributed approach to object-centric representations: the Complex AutoEncoder.
We show that this simple and efficient approach achieves better reconstruction performance than an equivalent real-valued autoencoder on simple multi-object datasets.
We also show that it achieves competitive unsupervised object discovery performance to a SlotAttention model on two datasets, and manages to disentangle objects in a third dataset where SlotAttention fails - all while being 7-70 times faster to train.
arXiv Detail & Related papers (2022-04-05T09:25:28Z) - Disentangling What and Where for 3D Object-Centric Representations
Through Active Inference [4.088019409160893]
We propose an active inference agent that can learn novel object categories over time.
We show that our agent is able to learn representations for many object categories in an unsupervised way.
We validate our system in an end-to-end fashion where the agent is able to search for an object at a given pose from a pixel-based rendering.
arXiv Detail & Related papers (2021-08-26T12:49:07Z) - Learning Open-World Object Proposals without Learning to Classify [110.30191531975804]
We propose a classification-free Object Localization Network (OLN) which estimates the objectness of each region purely by how well the location and shape of a region overlaps with any ground-truth object.
This simple strategy learns generalizable objectness and outperforms existing proposals on cross-category generalization.
arXiv Detail & Related papers (2021-08-15T14:36:02Z) - Closing the Generalization Gap in One-Shot Object Detection [92.82028853413516]
We show that the key to strong few-shot detection models may not lie in sophisticated metric learning approaches, but instead in scaling the number of categories.
Future data annotation efforts should therefore focus on wider datasets and annotate a larger number of categories.
arXiv Detail & Related papers (2020-11-09T09:31:17Z) - Look-into-Object: Self-supervised Structure Modeling for Object
Recognition [71.68524003173219]
We propose to "look into object" (explicitly yet intrinsically model the object structure) through incorporating self-supervisions.
We show the recognition backbone can be substantially enhanced for more robust representation learning.
Our approach achieves large performance gain on a number of benchmarks, including generic object recognition (ImageNet) and fine-grained object recognition tasks (CUB, Cars, Aircraft)
arXiv Detail & Related papers (2020-03-31T12:22:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.