Learning to Track Dynamic Targets in Partially Known Environments
- URL: http://arxiv.org/abs/2006.10190v1
- Date: Wed, 17 Jun 2020 22:45:24 GMT
- Title: Learning to Track Dynamic Targets in Partially Known Environments
- Authors: Heejin Jeong, Hamed Hassani, Manfred Morari, Daniel D. Lee, George J.
Pappas
- Abstract summary: We use a deep reinforcement learning approach to solve active target tracking.
In particular, we introduce Active Tracking Target Network (ATTN), a unified RL policy that is capable of solving major sub-tasks of active target tracking.
- Score: 48.49957897251128
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We solve active target tracking, one of the essential tasks in autonomous
systems, using a deep reinforcement learning (RL) approach. In this problem, an
autonomous agent is tasked with acquiring information about targets of
interests using its onboard sensors. The classical challenges in this problem
are system model dependence and the difficulty of computing
information-theoretic cost functions for a long planning horizon. RL provides
solutions for these challenges as the length of its effective planning horizon
does not affect the computational complexity, and it drops the strong
dependency of an algorithm on system models. In particular, we introduce Active
Tracking Target Network (ATTN), a unified RL policy that is capable of solving
major sub-tasks of active target tracking -- in-sight tracking, navigation, and
exploration. The policy shows robust behavior for tracking agile and anomalous
targets with a partially known target model. Additionally, the same policy is
able to navigate in obstacle environments to reach distant targets as well as
explore the environment when targets are positioned in unexpected locations.
Related papers
- Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games [6.532258098619471]
We focus on a motion planning task for an evasive target in a partially observable multi-agent adversarial pursuit-evasion games (PEG)
These pursuit-evasion problems are relevant to various applications, such as search and rescue operations and surveillance robots.
We propose a hierarchical architecture that integrates a high-level diffusion model to plan global paths responsive to environment data.
arXiv Detail & Related papers (2024-03-16T03:53:55Z) - Mission-driven Exploration for Accelerated Deep Reinforcement Learning
with Temporal Logic Task Specifications [11.812602599752294]
We consider robots with unknown dynamics operating in environments with unknown structure.
Our goal is to synthesize a control policy that maximizes the probability of satisfying an automaton-encoded task.
We propose a novel DRL algorithm, which has the capability to learn control policies at a notably faster rate compared to similar methods.
arXiv Detail & Related papers (2023-11-28T18:59:58Z) - A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with
Dynamic Obstacle Trajectory Prediction and Its Application with LLMs [6.747468447244154]
This paper proposes a vision-based planning system that combines tracking and trajectory prediction of dynamic obstacles to achieve efficient and reliable autonomous flight.
We conduct experiments in both simulation and real-world environments, and the results indicate that our approach can successfully detect and avoid obstacles in dynamic environments in real-time.
arXiv Detail & Related papers (2023-11-21T08:09:00Z) - Learning Goal-Conditioned Policies Offline with Self-Supervised Reward
Shaping [94.89128390954572]
We propose a novel self-supervised learning phase on the pre-collected dataset to understand the structure and the dynamics of the model.
We evaluate our method on three continuous control tasks, and show that our model significantly outperforms existing approaches.
arXiv Detail & Related papers (2023-01-05T15:07:10Z) - Reinforcement Learning for Agile Active Target Sensing with a UAV [10.070339628481445]
This paper develops a deep reinforcement learning approach to plan informative trajectories.
It exploits its current belief of the target states and incorporates inaccurate sensor models for high-fidelity classification.
A unique characteristic of our approach is that it is robust to varying amounts of deviations from the true target distribution.
arXiv Detail & Related papers (2022-12-16T01:01:17Z) - Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in
Latent Space [76.46113138484947]
General-purpose robots require diverse repertoires of behaviors to complete challenging tasks in real-world unstructured environments.
To address this issue, goal-conditioned reinforcement learning aims to acquire policies that can reach goals for a wide range of tasks on command.
We propose Planning to Practice, a method that makes it practical to train goal-conditioned policies for long-horizon tasks.
arXiv Detail & Related papers (2022-05-17T06:58:17Z) - C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks [133.40619754674066]
Goal-conditioned reinforcement learning can solve tasks in a wide range of domains, including navigation and manipulation.
We propose the distant goal-reaching task by using search at training time to automatically generate intermediate states.
E-step corresponds to planning an optimal sequence of waypoints using graph search, while the M-step aims to learn a goal-conditioned policy to reach those waypoints.
arXiv Detail & Related papers (2021-10-22T22:05:31Z) - Learning Target Candidate Association to Keep Track of What Not to Track [100.80610986625693]
We propose to keep track of distractor objects in order to continue tracking the target.
To tackle the problem of lacking ground-truth correspondences between distractor objects in visual tracking, we propose a training strategy that combines partial annotations with self-supervision.
Our tracker sets a new state-of-the-art on six benchmarks, achieving an AUC score of 67.2% on LaSOT and a +6.1% absolute gain on the OxUvA long-term dataset.
arXiv Detail & Related papers (2021-03-30T17:58:02Z) - Follow the Object: Curriculum Learning for Manipulation Tasks with
Imagined Goals [8.98526174345299]
This paper introduces a notion of imaginary object goals.
For a given manipulation task, the object of interest is first trained to reach a desired target position on its own.
The object policy is then leveraged to build a predictive model of plausible object trajectories.
The proposed algorithm, Follow the Object, has been evaluated on 7 MuJoCo environments.
arXiv Detail & Related papers (2020-08-05T12:19:14Z) - Goal-Aware Prediction: Learning to Model What Matters [105.43098326577434]
One of the fundamental challenges in using a learned forward dynamics model is the mismatch between the objective of the learned model and that of the downstream planner or policy.
We propose to direct prediction towards task relevant information, enabling the model to be aware of the current task and encouraging it to only model relevant quantities of the state space.
We find that our method more effectively models the relevant parts of the scene conditioned on the goal, and as a result outperforms standard task-agnostic dynamics models and model-free reinforcement learning.
arXiv Detail & Related papers (2020-07-14T16:42:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.