MD-Net: Multi-Detector for Local Feature Extraction
- URL: http://arxiv.org/abs/2208.05350v1
- Date: Wed, 10 Aug 2022 13:52:31 GMT
- Title: MD-Net: Multi-Detector for Local Feature Extraction
- Authors: Emanuele Santellani (1), Christian Sormann (1), Mattia Rossi (2),
Andreas Kuhn (2), Friedrich Fraundorfer (1) ((1) Graz University of
Technology, (2) Sony Europe B.V.)
- Abstract summary: We propose a deep feature extraction network capable of detecting a predefined number of complementary sets of keypoints at each image.
We train our network to predict the keypoints and compute the corresponding descriptors jointly.
With extensive experiments we show that our network, trained only on synthetically warped images, achieves competitive results on 3D reconstruction and re-localization tasks.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Establishing a sparse set of keypoint correspon dences between images is a
fundamental task in many computer vision pipelines. Often, this translates into
a computationally expensive nearest neighbor search, where every keypoint
descriptor at one image must be compared with all the descriptors at the
others. In order to lower the computational cost of the matching phase, we
propose a deep feature extraction network capable of detecting a predefined
number of complementary sets of keypoints at each image. Since only the
descriptors within the same set need to be compared across the different
images, the matching phase computational complexity decreases with the number
of sets. We train our network to predict the keypoints and compute the
corresponding descriptors jointly. In particular, in order to learn
complementary sets of keypoints, we introduce a novel unsupervised loss which
penalizes intersections among the different sets. Additionally, we propose a
novel descriptor-based weighting scheme meant to penalize the detection of
keypoints with non-discriminative descriptors. With extensive experiments we
show that our feature extraction network, trained only on synthetically warped
images and in a fully unsupervised manner, achieves competitive results on 3D
reconstruction and re-localization tasks at a reduced matching complexity.
Related papers
- Improving the matching of deformable objects by learning to detect
keypoints [6.4587163310833855]
We propose a novel learned keypoint detection method to increase the number of correct matches for the task of non-rigid image correspondence.
We train an end-to-end convolutional neural network (CNN) to find keypoint locations that are more appropriate to the considered descriptor.
Experiments demonstrate that our method enhances the Mean Matching Accuracy of numerous descriptors when used in conjunction with our detection method.
We also apply our method on the complex real-world task object retrieval where our detector performs on par with the finest keypoint detectors currently available for this task.
arXiv Detail & Related papers (2023-09-01T13:02:19Z) - Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural
Network [52.29330138835208]
Accurately matching local features between a pair of images is a challenging computer vision task.
Previous studies typically use attention based graph neural networks (GNNs) with fully-connected graphs over keypoints within/across images.
We propose MaKeGNN, a sparse attention-based GNN architecture which bypasses non-repeatable keypoints and leverages matchable ones to guide message passing.
arXiv Detail & Related papers (2023-07-04T02:50:44Z) - Minimum Description Length Clustering to Measure Meaningful Image
Complexity [65.268245109828]
We present a new image complexity metric through hierarchical clustering of patches.
We use the minimum description length principle to determine the number of clusters and designate certain points as outliers and, hence, correctly assign white noise a low score.
arXiv Detail & Related papers (2023-06-26T15:37:04Z) - DDM-NET: End-to-end learning of keypoint feature Detection, Description
and Matching for 3D localization [34.66510265193038]
We propose an end-to-end framework that jointly learns keypoint detection, descriptor representation and cross-frame matching.
We design a self-supervised image warping correspondence loss for both feature detection and matching.
We also propose a new loss to robustly handle both definite inlier/outlier matches and less-certain matches.
arXiv Detail & Related papers (2022-12-08T21:43:56Z) - Fast Key Points Detection and Matching for Tree-Structured Images [4.929206987094714]
This paper offers a new authentication algorithm based on image matching of nano-resolution visual identifiers with tree-shaped patterns.
The proposed algorithm is applicable to a variety of tree-structured image matching, but our focus is on dendrites, recently-developed visual identifiers.
arXiv Detail & Related papers (2022-11-07T00:22:56Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - Compositional Sketch Search [91.84489055347585]
We present an algorithm for searching image collections using free-hand sketches.
We exploit drawings as a concise and intuitive representation for specifying entire scene compositions.
arXiv Detail & Related papers (2021-06-15T09:38:09Z) - A region-based descriptor network for uniformly sampled keypoints [35.394659345406865]
Matching keypoint pairs of different images is a basic task of computer vision.
Most methods require customized extremum point schemes to obtain the coordinates of feature points with high confidence.
In this paper, we design a region-based descriptor by combining the context features of a deep network.
arXiv Detail & Related papers (2021-01-26T07:31:22Z) - Generalized Iris Presentation Attack Detection Algorithm under
Cross-Database Settings [63.90855798947425]
Presentation attacks pose major challenges to most of the biometric modalities.
We propose a generalized deep learning-based presentation attack detection network, MVANet.
It is inspired by the simplicity and success of hybrid algorithm or fusion of multiple detection networks.
arXiv Detail & Related papers (2020-10-25T22:42:27Z) - Compact Deep Aggregation for Set Retrieval [87.52470995031997]
We focus on retrieving images containing multiple faces from a large scale dataset of images.
Here the set consists of the face descriptors in each image, and given a query for multiple identities, the goal is then to retrieve, in order, images which contain all the identities.
We show that this compact descriptor has minimal loss of discriminability up to two faces per image, and degrades slowly after that.
arXiv Detail & Related papers (2020-03-26T08:43:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.