Conditional Link Prediction of Category-Implicit Keypoint Detection
- URL: http://arxiv.org/abs/2011.14462v1
- Date: Sun, 29 Nov 2020 23:00:37 GMT
- Title: Conditional Link Prediction of Category-Implicit Keypoint Detection
- Authors: Ellen Yi-Ge, Rui Fan, Zechun Liu, Zhiqiang Shen
- Abstract summary: We propose an end-to-end category-implicit Keypoint and Link Prediction Network (KLPNet)
In our KLPNet, a novel Conditional Link Prediction Graph is proposed for link prediction among keypoints that are contingent on a predefined category.
Experiments conducted on three publicly available benchmarks demonstrate that our KLPNet consistently outperforms all other state-of-the-art approaches.
- Score: 26.400925420154245
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Keypoints of objects reflect their concise abstractions, while the
corresponding connection links (CL) build the skeleton by detecting the
intrinsic relations between keypoints. Existing approaches are typically
computationally-intensive, inapplicable for instances belonging to multiple
classes, and/or infeasible to simultaneously encode connection information. To
address the aforementioned issues, we propose an end-to-end category-implicit
Keypoint and Link Prediction Network (KLPNet), which is the first approach for
simultaneous semantic keypoint detection (for multi-class instances) and CL
rejuvenation. In our KLPNet, a novel Conditional Link Prediction Graph is
proposed for link prediction among keypoints that are contingent on a
predefined category. Furthermore, a Cross-stage Keypoint Localization Module
(CKLM) is introduced to explore feature aggregation for coarse-to-fine keypoint
localization. Comprehensive experiments conducted on three publicly available
benchmarks demonstrate that our KLPNet consistently outperforms all other
state-of-the-art approaches. Furthermore, the experimental results of CL
prediction also show the effectiveness of our KLPNet with respect to occlusion
problems.
Related papers
- C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection [98.34703790782254]
We introduce Category Common Prompt CLIP, which integrates the category common prompt into the text encoder to inject category-related concepts into the image encoder.
Our method achieves a 12.41% improvement in detection accuracy compared to the original CLIP, without introducing additional parameters during testing.
arXiv Detail & Related papers (2024-08-19T02:14:25Z) - SCAPE: A Simple and Strong Category-Agnostic Pose Estimator [6.705257644513057]
Category-Agnostic Pose Estimation (CAPE) aims to localize keypoints on an object of any category given few exemplars in an in-context manner.
We introduce two key modules: a global keypoint feature perceptor to inject global semantic information into support keypoints, and a keypoint attention refiner to enhance inter-node correlation between keypoints.
SCAPE outperforms prior arts by 2.2 and 1.3 PCK under 1-shot and 5-shot settings with faster inference speed and lighter model capacity.
arXiv Detail & Related papers (2024-07-18T13:02:57Z) - Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning [61.73411954056032]
Key Point Analysis (KPA) continues to be a significant and unresolved issue within the field of argument mining.
We propose a novel approach for KPA with pairwise generation and graph partitioning.
arXiv Detail & Related papers (2024-04-17T13:44:29Z) - Rethinking Few-shot 3D Point Cloud Semantic Segmentation [62.80639841429669]
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS)
We focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution.
To address these issues, we introduce a standardized FS-PCS setting, upon which a new benchmark is built.
arXiv Detail & Related papers (2024-03-01T15:14:47Z) - Open-Vocabulary Animal Keypoint Detection with Semantic-feature Matching [74.75284453828017]
Open-Vocabulary Keypoint Detection (OVKD) task is innovatively designed to use text prompts for identifying arbitrary keypoints across any species.
We have developed a novel framework named Open-Vocabulary Keypoint Detection with Semantic-feature Matching (KDSM)
This framework combines vision and language models, creating an interplay between language features and local keypoint visual features.
arXiv Detail & Related papers (2023-10-08T07:42:41Z) - SC3K: Self-supervised and Coherent 3D Keypoints Estimation from Rotated,
Noisy, and Decimated Point Cloud Data [17.471342278936365]
We propose a new method to infer keypoints from arbitrary object categories in practical scenarios where point cloud data (PCD) are noisy, down-sampled and arbitrarily rotated.
We achieve these desiderata by proposing a new self-supervised training strategy for keypoints estimation.
We compare the keypoints estimated by the proposed approach with those of the state-of-the-art unsupervised approaches.
arXiv Detail & Related papers (2023-08-10T08:10:01Z) - Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural
Network [52.29330138835208]
Accurately matching local features between a pair of images is a challenging computer vision task.
Previous studies typically use attention based graph neural networks (GNNs) with fully-connected graphs over keypoints within/across images.
We propose MaKeGNN, a sparse attention-based GNN architecture which bypasses non-repeatable keypoints and leverages matchable ones to guide message passing.
arXiv Detail & Related papers (2023-07-04T02:50:44Z) - From Key Points to Key Point Hierarchy: Structured and Expressive
Opinion Summarization [9.567280503274226]
Key Point Analysis (KPA) has been recently proposed for deriving fine-grained insights from collections of textual comments.
We introduce the task of organizing a given set of key points into a hierarchy, according to their specificity.
We develop ThinkP, a high quality benchmark dataset of key point hierarchies for business and product reviews.
arXiv Detail & Related papers (2023-06-06T16:45:44Z) - Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species [28.307200505494126]
We propose a versatile Few-shot Keypoint Detection (FSKD) pipeline, which can detect a varying number of keypoints of different kinds.
Our FSKD involves main and auxiliary keypoint representation learning, similarity learning, and keypoint localization.
We show the effectiveness of our FSKD on (i) novel keypoint detection for unseen species, and (ii) few-shot Fine-Grained Visual Recognition (FGVR) and (iii) Semantic Alignment (SA) downstream tasks.
arXiv Detail & Related papers (2021-12-12T08:39:47Z) - Rethinking Counting and Localization in Crowds:A Purely Point-Based
Framework [59.578339075658995]
We propose a purely point-based framework for joint crowd counting and individual localization.
We design an intuitive solution under this framework, which is called Point to Point Network (P2PNet)
arXiv Detail & Related papers (2021-07-27T11:41:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.