D-InLoc++: Indoor Localization in Dynamic Environments
- URL: http://arxiv.org/abs/2209.10185v1
- Date: Wed, 21 Sep 2022 08:35:32 GMT
- Title: D-InLoc++: Indoor Localization in Dynamic Environments
- Authors: Martina Dubenova, Anna Zderadickova, Ondrej Kafka, Tomas Pajdla,
Michal Polic
- Abstract summary: We show that the movable objects incorporate non-negligible localization error and present a new method to predict the six-degree-of-freedom (6DoF) pose more robustly.
The masks of dynamic objects are employed in the relative pose estimation step and in the final sorting of camera pose proposal.
- Score: 2.9398911304923447
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Most state-of-the-art localization algorithms rely on robust relative pose
estimation and geometry verification to obtain moving object agnostic camera
poses in complex indoor environments. However, this approach is prone to
mistakes if a scene contains repetitive structures, e.g., desks, tables, boxes,
or moving people. We show that the movable objects incorporate non-negligible
localization error and present a new straightforward method to predict the
six-degree-of-freedom (6DoF) pose more robustly. We equipped the localization
pipeline InLoc with real-time instance segmentation network YOLACT++. The masks
of dynamic objects are employed in the relative pose estimation step and in the
final sorting of camera pose proposal. At first, we filter out the matches
laying on masks of the dynamic objects. Second, we skip the comparison of query
and synthetic images on the area related to the moving object. This procedure
leads to a more robust localization. Lastly, we describe and improve the
mistakes caused by gradient-based comparison between synthetic and query images
and publish a new pipeline for simulation of environments with movable objects
from the Matterport scans. All the codes are available on
github.com/dubenma/D-InLocpp .
Related papers
- V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting [1.3493547928462395]
Simultaneous localization and mapping (SLAM) in highly dynamic environments is challenging due to the correlation between moving objects and the camera pose.
We propose a robust method, V3D-SLAM, to remove moving objects via two lightweight re-evaluation stages.
Our experiment on the TUM RGB-D benchmark on dynamic sequences with ground-truth camera trajectories showed that our methods outperform the most recent state-of-the-art SLAM methods.
arXiv Detail & Related papers (2024-10-15T21:08:08Z) - Multi-Model 3D Registration: Finding Multiple Moving Objects in
Cluttered Point Clouds [23.923838486208524]
We investigate a variation of the 3D registration problem, named multi-model 3D registration.
In the multi-model registration problem, we are given two point clouds picturing a set of objects at different poses.
We want to simultaneously reconstruct how all objects moved between the two point clouds.
arXiv Detail & Related papers (2024-02-16T18:01:43Z) - ROAM: Robust and Object-Aware Motion Generation Using Neural Pose
Descriptors [73.26004792375556]
This paper shows that robustness and generalisation to novel scene objects in 3D object-aware character synthesis can be achieved by training a motion model with as few as one reference object.
We leverage an implicit feature representation trained on object-only datasets, which encodes an SE(3)-equivariant descriptor field around the object.
We demonstrate substantial improvements in 3D virtual character motion and interaction quality and robustness to scenarios with unseen objects.
arXiv Detail & Related papers (2023-08-24T17:59:51Z) - DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object
Detection and Tracking [67.34803048690428]
We propose to model Dynamic Objects in RecurrenT (DORT) to tackle this problem.
DORT extracts object-wise local volumes for motion estimation that also alleviates the heavy computational burden.
It is flexible and practical that can be plugged into most camera-based 3D object detectors.
arXiv Detail & Related papers (2023-03-29T12:33:55Z) - InstMove: Instance Motion for Object-centric Video Segmentation [70.16915119724757]
In this work, we study the instance-level motion and present InstMove, which stands for Instance Motion for Object-centric Video.
In comparison to pixel-wise motion, InstMove mainly relies on instance-level motion information that is free from image feature embeddings.
With only a few lines of code, InstMove can be integrated into current SOTA methods for three different video segmentation tasks.
arXiv Detail & Related papers (2023-03-14T17:58:44Z) - Robust Change Detection Based on Neural Descriptor Fields [53.111397800478294]
We develop an object-level online change detection approach that is robust to partially overlapping observations and noisy localization results.
By associating objects via shape code similarity and comparing local object-neighbor spatial layout, our proposed approach demonstrates robustness to low observation overlap and localization noises.
arXiv Detail & Related papers (2022-08-01T17:45:36Z) - Discovering Objects that Can Move [55.743225595012966]
We study the problem of object discovery -- separating objects from the background without manual labels.
Existing approaches utilize appearance cues, such as color, texture, and location, to group pixels into object-like regions.
We choose to focus on dynamic objects -- entities that can move independently in the world.
arXiv Detail & Related papers (2022-03-18T21:13:56Z) - TwistSLAM: Constrained SLAM in Dynamic Environment [0.0]
We present TwistSLAM, a semantic, dynamic, stereo SLAM system that can track dynamic objects in the scene.
Our algorithm creates clusters of points according to their semantic class.
It uses the static parts of the environment to robustly localize the camera and tracks the remaining objects.
arXiv Detail & Related papers (2022-02-24T22:08:45Z) - Combining Local and Global Pose Estimation for Precise Tracking of
Similar Objects [2.861848675707602]
We present a multi-object 6D detection and tracking pipeline for potentially similar and non-textured objects.
A new network architecture, trained solely with synthetic images, allows simultaneous pose estimation of multiple objects.
We show how the system can be used in a real AR assistance application within the field of construction.
arXiv Detail & Related papers (2022-01-31T14:36:57Z) - TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and
Reconstruction [57.1209039399599]
We propose a map representation that allows maintaining a single volume for the entire scene and all the objects therein.
In a multiple dynamic object tracking and reconstruction scenario, our representation allows maintaining accurate reconstruction of surfaces even while they become temporarily occluded by other objects moving in their proximity.
We evaluate the proposed TSDF++ formulation on a public synthetic dataset and demonstrate its ability to preserve reconstructions of occluded surfaces when compared to the standard TSDF map representation.
arXiv Detail & Related papers (2021-05-16T16:15:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.