Moving object detection for visual odometry in a dynamic environment
based on occlusion accumulation
- URL: http://arxiv.org/abs/2009.08746v1
- Date: Fri, 18 Sep 2020 11:01:46 GMT
- Title: Moving object detection for visual odometry in a dynamic environment
based on occlusion accumulation
- Authors: Haram Kim, Pyojin Kim, H. Jin Kim
- Abstract summary: We propose a moving object detection algorithm that uses RGB-D images.
The proposed algorithm does not require estimating a background model.
We use dense visual odometry (DVO) as a VO method with a bi-square regression weight.
- Score: 31.143322364794894
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Detection of moving objects is an essential capability in dealing with
dynamic environments. Most moving object detection algorithms have been
designed for color images without depth. For robotic navigation where real-time
RGB-D data is often readily available, utilization of the depth information
would be beneficial for obstacle recognition.
Here, we propose a simple moving object detection algorithm that uses RGB-D
images. The proposed algorithm does not require estimating a background model.
Instead, it uses an occlusion model which enables us to estimate the camera
pose on a background confused with moving objects that dominate the scene. The
proposed algorithm allows to separate the moving object detection and visual
odometry (VO) so that an arbitrary robust VO method can be employed in a
dynamic situation with a combination of moving object detection, whereas other
VO algorithms for a dynamic environment are inseparable. In this paper, we use
dense visual odometry (DVO) as a VO method with a bi-square regression weight.
Experimental results show the segmentation accuracy and the performance
improvement of DVO in the situations. We validate our algorithm in public
datasets and our dataset which also publicly accessible.
Related papers
- MV-ROPE: Multi-view Constraints for Robust Category-level Object Pose and Size Estimation [23.615122326731115]
We propose a novel solution that makes use of RGB video streams.
Our framework consists of three modules: a scale-aware monocular dense SLAM solution, a lightweight object pose predictor, and an object-level pose graph.
Our experimental results demonstrate that when utilizing public dataset sequences with high-quality depth information, the proposed method exhibits comparable performance to state-of-the-art RGB-D methods.
arXiv Detail & Related papers (2023-08-17T08:29:54Z) - SalienDet: A Saliency-based Feature Enhancement Algorithm for Object
Detection for Autonomous Driving [160.57870373052577]
We propose a saliency-based OD algorithm (SalienDet) to detect unknown objects.
Our SalienDet utilizes a saliency-based algorithm to enhance image features for object proposal generation.
We design a dataset relabeling approach to differentiate the unknown objects from all objects in training sample set to achieve Open-World Detection.
arXiv Detail & Related papers (2023-05-11T16:19:44Z) - DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object
Detection and Tracking [67.34803048690428]
We propose to model Dynamic Objects in RecurrenT (DORT) to tackle this problem.
DORT extracts object-wise local volumes for motion estimation that also alleviates the heavy computational burden.
It is flexible and practical that can be plugged into most camera-based 3D object detectors.
arXiv Detail & Related papers (2023-03-29T12:33:55Z) - Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation [68.56443382421878]
We propose a novel adaptive multi-source predictor for zero-shot video object segmentation (ZVOS)
In the static object predictor, the RGB source is converted to depth and static saliency sources, simultaneously.
Experiments show that the proposed model outperforms the state-of-the-art methods on three challenging ZVOS benchmarks.
arXiv Detail & Related papers (2023-03-18T10:19:29Z) - Multitask AET with Orthogonal Tangent Regularity for Dark Object
Detection [84.52197307286681]
We propose a novel multitask auto encoding transformation (MAET) model to enhance object detection in a dark environment.
In a self-supervision manner, the MAET learns the intrinsic visual structure by encoding and decoding the realistic illumination-degrading transformation.
We have achieved the state-of-the-art performance using synthetic and real-world datasets.
arXiv Detail & Related papers (2022-05-06T16:27:14Z) - Visual Odometry for RGB-D Cameras [3.655021726150368]
This paper develops a quick and accurate approach to visual odometry of a moving RGB-D camera navigating on a static environment.
The proposed algorithm uses SURF as feature extractor, RANSAC to filter the results and Minimum Mean Square to estimate the rigid transformation of six parameters between successive video frames.
arXiv Detail & Related papers (2022-03-28T21:49:12Z) - Space Non-cooperative Object Active Tracking with Deep Reinforcement
Learning [1.212848031108815]
We propose an end-to-end active visual tracking method based on DQN algorithm, named as DRLAVT.
It can guide the chasing spacecraft approach to arbitrary space non-cooperative target merely relied on color or RGBD images.
It significantly outperforms position-based visual servoing baseline algorithm that adopts state-of-the-art 2D monocular tracker, SiamRPN.
arXiv Detail & Related papers (2021-12-18T06:12:24Z) - You Better Look Twice: a new perspective for designing accurate
detectors with reduced computations [56.34005280792013]
BLT-net is a new low-computation two-stage object detection architecture.
It reduces computations by separating objects from background using a very lite first-stage.
Resulting image proposals are then processed in the second-stage by a highly accurate model.
arXiv Detail & Related papers (2021-07-21T12:39:51Z) - DS-Net: Dynamic Spatiotemporal Network for Video Salient Object
Detection [78.04869214450963]
We propose a novel dynamic temporal-temporal network (DSNet) for more effective fusion of temporal and spatial information.
We show that the proposed method achieves superior performance than state-of-the-art algorithms.
arXiv Detail & Related papers (2020-12-09T06:42:30Z) - Object Detection in the Context of Mobile Augmented Reality [16.49070406578342]
We propose a novel approach that combines the geometric information from VIO with semantic information from object detectors to improve the performance of object detection on mobile devices.
Our approach includes three components: (1) an image orientation correction method, (2) a scale-based filtering approach, and (3) an online semantic map.
The results show that our approach can improve on the accuracy of generic object detectors by 12% on our dataset.
arXiv Detail & Related papers (2020-08-15T05:15:00Z) - Object-oriented SLAM using Quadrics and Symmetry Properties for Indoor
Environments [11.069661312755034]
This paper proposes a sparse object-level SLAM algorithm based on an RGB-D camera.
A quadric representation is used as a landmark to compactly model objects, including their position, orientation, and occupied space.
Experiments have shown that compared with the state-of-art algorithm, especially on the forward trajectory of mobile robots, the proposed algorithm significantly improves the accuracy and convergence speed of quadric reconstruction.
arXiv Detail & Related papers (2020-04-11T04:15:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.