Keypoint Autoencoders: Learning Interest Points of Semantics
- URL: http://arxiv.org/abs/2008.04502v1
- Date: Tue, 11 Aug 2020 03:43:18 GMT
- Title: Keypoint Autoencoders: Learning Interest Points of Semantics
- Authors: Ruoxi Shi, Zhengrong Xue, Xinyang Li
- Abstract summary: We propose Keypoint Autoencoder, an unsupervised learning method for detecting keypoints.
We encourage selecting sparse semantic keypoints by enforcing the reconstruction from keypoints to the original point cloud.
A downstream task of classifying shape with sparse keypoints is conducted to demonstrate the distinctiveness of our selected keypoints.
- Score: 4.551313396927381
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Understanding point clouds is of great importance. Many previous methods
focus on detecting salient keypoints to identity structures of point clouds.
However, existing methods neglect the semantics of points selected, leading to
poor performance on downstream tasks. In this paper, we propose Keypoint
Autoencoder, an unsupervised learning method for detecting keypoints. We
encourage selecting sparse semantic keypoints by enforcing the reconstruction
from keypoints to the original point cloud. To make sparse keypoint selection
differentiable, Soft Keypoint Proposal is adopted by calculating weighted
averages among input points. A downstream task of classifying shape with sparse
keypoints is conducted to demonstrate the distinctiveness of our selected
keypoints. Semantic Accuracy and Semantic Richness are proposed and our method
gives competitive or even better performance than state of the arts on these
two metrics.
Related papers
- GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring [9.322937309882022]
Keypoints come with a score permitting to rank them according to their quality.
While learned keypoints often exhibit better properties than handcrafted ones, their scores are not easily interpretable.
We propose a framework that can refine, and at the same time characterize with an interpretable score, the keypoints extracted by any method.
arXiv Detail & Related papers (2024-08-30T09:39:59Z) - Meta-Point Learning and Refining for Category-Agnostic Pose Estimation [46.98479393474727]
Category-agnostic pose estimation (CAPE) aims to predict keypoints for arbitrary classes given a few support images annotated with keypoints.
We propose a novel framework for CAPE based on such potential keypoints (named as meta-points)
arXiv Detail & Related papers (2024-03-20T14:54:33Z) - SC3K: Self-supervised and Coherent 3D Keypoints Estimation from Rotated,
Noisy, and Decimated Point Cloud Data [17.471342278936365]
We propose a new method to infer keypoints from arbitrary object categories in practical scenarios where point cloud data (PCD) are noisy, down-sampled and arbitrarily rotated.
We achieve these desiderata by proposing a new self-supervised training strategy for keypoints estimation.
We compare the keypoints estimated by the proposed approach with those of the state-of-the-art unsupervised approaches.
arXiv Detail & Related papers (2023-08-10T08:10:01Z) - D-Net: Learning for Distinctive Point Clouds by Self-Attentive Point
Searching and Learnable Feature Fusion [48.57170130169045]
We propose D-Net to learn for distinctive point clouds based on a self-attentive point searching and a learnable feature fusion.
To generate a compact feature representation for each distinctive point set, a stacked self-gated convolution is proposed to extract the distinctive features.
The results show that the learned distinction distribution of a point cloud is highly consistent with objects of the same class and different from objects of other classes.
arXiv Detail & Related papers (2023-05-10T02:19:00Z) - Point-Teaching: Weakly Semi-Supervised Object Detection with Point
Annotations [81.02347863372364]
We present Point-Teaching, a weakly semi-supervised object detection framework.
Specifically, we propose a Hungarian-based point matching method to generate pseudo labels for point annotated images.
We propose a simple-yet-effective data augmentation, termed point-guided copy-paste, to reduce the impact of the unmatched points.
arXiv Detail & Related papers (2022-06-01T07:04:38Z) - From Keypoints to Object Landmarks via Self-Training Correspondence: A
novel approach to Unsupervised Landmark Discovery [37.78933209094847]
This paper proposes a novel paradigm for the unsupervised learning of object landmark detectors.
We validate our method on a variety of difficult datasets, including LS3D, BBCPose, Human3.6M and PennAction.
arXiv Detail & Related papers (2022-05-31T15:44:29Z) - Object Localization under Single Coarse Point Supervision [107.46800858130658]
We propose a POL method using coarse point annotations, relaxing the supervision signals from accurate key points to freely spotted points.
CPR constructs point bags, selects semantic-correlated points, and produces semantic center points through multiple instance learning (MIL)
In this way, CPR defines a weakly supervised evolution procedure, which ensures training high-performance object localizer under coarse point supervision.
arXiv Detail & Related papers (2022-03-17T14:14:11Z) - Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species [28.307200505494126]
We propose a versatile Few-shot Keypoint Detection (FSKD) pipeline, which can detect a varying number of keypoints of different kinds.
Our FSKD involves main and auxiliary keypoint representation learning, similarity learning, and keypoint localization.
We show the effectiveness of our FSKD on (i) novel keypoint detection for unseen species, and (ii) few-shot Fine-Grained Visual Recognition (FGVR) and (iii) Semantic Alignment (SA) downstream tasks.
arXiv Detail & Related papers (2021-12-12T08:39:47Z) - Accurate Grid Keypoint Learning for Efficient Video Prediction [87.71109421608232]
Keypoint-based video prediction methods can consume substantial computing resources in training and deployment.
In this paper, we design a new grid keypoint learning framework, aiming at a robust and explainable intermediate keypoint representation for long-term efficient video prediction.
Our method outperforms the state-ofthe-art video prediction methods while saves 98% more than computing resources.
arXiv Detail & Related papers (2021-07-28T05:04:30Z) - Unsupervised Learning of Category-Specific Symmetric 3D Keypoints from
Point Sets [71.84892018102465]
This paper aims at learning category-specific 3D keypoints, in an unsupervised manner, using a collection of misaligned 3D point clouds of objects from an unknown category.
To the best of our knowledge, this is the first work on learning such keypoints directly from 3D point clouds.
arXiv Detail & Related papers (2020-03-17T10:28:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.