Fully Convolutional Geometric Features for Category-level Object
Alignment
- URL: http://arxiv.org/abs/2103.04494v1
- Date: Mon, 8 Mar 2021 00:31:56 GMT
- Title: Fully Convolutional Geometric Features for Category-level Object
Alignment
- Authors: Qiaojun Feng, Nikolay Atanasov
- Abstract summary: This paper focuses on pose registration of different object instances from the same category.
Our approach transforms instances of the same category to a normalized canonical coordinate frame and uses metric learning to train fully convolutional geometric features.
- Score: 12.741811850885309
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper focuses on pose registration of different object instances from
the same category. This is required in online object mapping because object
instances detected at test time usually differ from the training instances. Our
approach transforms instances of the same category to a normalized canonical
coordinate frame and uses metric learning to train fully convolutional
geometric features. The resulting model is able to generate pairs of matching
points between the instances, allowing category-level registration. Evaluation
on both synthetic and real-world data shows that our method provides robust
features, leading to accurate alignment of instances with different shapes.
Related papers
- Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation [38.03793706479096]
Category-level 6D object pose estimation aims to estimate the rotation, translation and size of unseen instances within specific categories.
We propose a novel Instance-Adaptive and Geometric-Aware Keypoint Learning method for category-level 6D object pose estimation (AG-Pose)
The proposed AG-Pose outperforms state-of-the-art methods by a large margin without category-specific shape priors.
arXiv Detail & Related papers (2024-03-28T16:02:03Z) - Not All Instances Contribute Equally: Instance-adaptive Class
Representation Learning for Few-Shot Visual Recognition [94.04041301504567]
Few-shot visual recognition refers to recognize novel visual concepts from a few labeled instances.
We propose a novel metric-based meta-learning framework termed instance-adaptive class representation learning network (ICRL-Net) for few-shot visual recognition.
arXiv Detail & Related papers (2022-09-07T10:00:18Z) - Learning to Complete Object Shapes for Object-level Mapping in Dynamic
Scenes [30.500198859451434]
We propose a novel object-level mapping system that can simultaneously segment, track, and reconstruct objects in dynamic scenes.
It can further predict and complete their full geometries by conditioning on reconstructions from depth inputs and a category-level shape prior.
We evaluate its effectiveness by quantitatively and qualitatively testing it in both synthetic and real-world sequences.
arXiv Detail & Related papers (2022-08-09T22:56:33Z) - On Hyperbolic Embeddings in 2D Object Detection [76.12912000278322]
We study whether a hyperbolic geometry better matches the underlying structure of the object classification space.
We incorporate a hyperbolic classifier in two-stage, keypoint-based, and transformer-based object detection architectures.
We observe categorical class hierarchies emerging in the structure of the classification space, resulting in lower classification errors and boosting the overall object detection performance.
arXiv Detail & Related papers (2022-03-15T16:43:40Z) - Frame Averaging for Equivariant Shape Space Learning [85.42901997467754]
A natural way to incorporate symmetries in shape space learning is to ask that the mapping to the shape space (encoder) and mapping from the shape space (decoder) are equivariant to the relevant symmetries.
We present a framework for incorporating equivariance in encoders and decoders by introducing two contributions.
arXiv Detail & Related papers (2021-12-03T06:41:19Z) - Discovering Relationships between Object Categories via Universal
Canonical Maps [80.07703460198198]
We tackle the problem of learning the geometry of multiple categories of deformable objects jointly.
Recent work has shown that it is possible to learn a unified dense pose predictor for several categories of related objects.
We show that improved correspondences can be learned automatically as a natural byproduct of learning category-specific dense pose predictors.
arXiv Detail & Related papers (2021-06-17T18:38:18Z) - I^3Net: Implicit Instance-Invariant Network for Adapting One-Stage
Object Detectors [64.93963042395976]
Implicit Instance-Invariant Network (I3Net) is tailored for adapting one-stage detectors.
I3Net implicitly learns instance-invariant features via exploiting the natural characteristics of deep features in different layers.
Experiments reveal that I3Net exceeds the state-of-the-art performance on benchmark datasets.
arXiv Detail & Related papers (2021-03-25T11:14:36Z) - CORSAIR: Convolutional Object Retrieval and Symmetry-AIded Registration [14.79639149658596]
We develop and approach for fully Convolutional Object Retrieval and Symmetry-AIded Registration.
We present results from synthetic and real-world datasets with different object categories to verify the robustness of our method.
arXiv Detail & Related papers (2021-03-11T19:12:48Z) - Canonical 3D Deformer Maps: Unifying parametric and non-parametric
methods for dense weakly-supervised category reconstruction [79.98689027127855]
We propose a new representation of the 3D shape of common object categories that can be learned from a collection of 2D images of independent objects.
Our method builds in a novel way on concepts from parametric deformation models, non-parametric 3D reconstruction, and canonical embeddings.
It achieves state-of-the-art results in dense 3D reconstruction on public in-the-wild datasets of faces, cars, and birds.
arXiv Detail & Related papers (2020-08-28T15:44:05Z) - OS2D: One-Stage One-Shot Object Detection by Matching Anchor Features [14.115782214599015]
One-shot object detection consists in detecting objects defined by a single demonstration.
We build the one-stage system that performs localization and recognition jointly.
Experimental evaluation on several challenging domains shows that our method can detect unseen classes.
arXiv Detail & Related papers (2020-03-15T11:39:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.