SIM2E: Benchmarking the Group Equivariant Capability of Correspondence
Matching Algorithms
- URL: http://arxiv.org/abs/2208.09896v1
- Date: Sun, 21 Aug 2022 14:47:02 GMT
- Title: SIM2E: Benchmarking the Group Equivariant Capability of Correspondence
Matching Algorithms
- Authors: Shuai Su, Zhongkai Zhao, Yixin Fei, Shuda Li, Qijun Chen, Rui Fan
- Abstract summary: This paper presents a specialized dataset dedicated to evaluating sim(2)-equivariant correspondence matching algorithms.
We compare the performance of 16 state-of-the-art (SoTA) correspondence matching approaches.
Since the subpixel accuracy achieved by CNN-based correspondence matching approaches is unsatisfactory, this specific area requires more attention in future works.
- Score: 12.892976023503818
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Correspondence matching is a fundamental problem in computer vision and
robotics applications. Solving correspondence matching problems using neural
networks has been on the rise recently. Rotation-equivariance and
scale-equivariance are both critical in correspondence matching applications.
Classical correspondence matching approaches are designed to withstand scaling
and rotation transformations. However, the features extracted using
convolutional neural networks (CNNs) are only translation-equivariant to a
certain extent. Recently, researchers have strived to improve the
rotation-equivariance of CNNs based on group theories. Sim(2) is the group of
similarity transformations in the 2D plane. This paper presents a specialized
dataset dedicated to evaluating sim(2)-equivariant correspondence matching
algorithms. We compare the performance of 16 state-of-the-art (SoTA)
correspondence matching approaches. The experimental results demonstrate the
importance of group equivariant algorithms for correspondence matching on
various sim(2) transformation conditions. Since the subpixel accuracy achieved
by CNN-based correspondence matching approaches is unsatisfactory, this
specific area requires more attention in future works. Our dataset is publicly
available at: mias.group/SIM2E.
Related papers
- Affine Invariance in Continuous-Domain Convolutional Neural Networks [6.019182604573028]
This research studies affine invariance on continuous-domain convolutional neural networks.
We introduce a new criterion to assess the similarity of two input signals under affine transformations.
Our research could eventually extend the scope of geometrical transformations that practical deep-learning pipelines can handle.
arXiv Detail & Related papers (2023-11-13T14:17:57Z) - A2B: Anchor to Barycentric Coordinate for Robust Correspondence [25.719939636977934]
We show that geometric-invariant coordinate representations, such as barycentric coordinates, can significantly reduce mismatches between features.
We introduce DEGREE, a novel anchor-to-barycentric (A2B) coordinate encoding approach, which generates multiple affine-invariant correspondence coordinates from paired images.
arXiv Detail & Related papers (2023-06-05T10:28:53Z) - Learning Partial Correlation based Deep Visual Representation for Image
Classification [61.0532370259644]
We formulate sparse inverse covariance estimation (SICE) as a novel structured layer of CNN.
Our work obtains a partial correlation based deep visual representation and mitigates the small sample problem.
Experiments show the efficacy and superior classification performance of our model.
arXiv Detail & Related papers (2023-04-23T10:09:01Z) - Equivariant Similarity for Vision-Language Foundation Models [134.77524524140168]
This study focuses on the multimodal similarity function that is not only the major training objective but also the core delivery to support downstream tasks.
We propose EqSim, a regularization loss that can be efficiently calculated from any two matched training pairs.
Compared to the existing evaluation sets, EqBen is the first to focus on "visual-minimal change"
arXiv Detail & Related papers (2023-03-25T13:22:56Z) - Deep Neural Networks with Efficient Guaranteed Invariances [77.99182201815763]
We address the problem of improving the performance and in particular the sample complexity of deep neural networks.
Group-equivariant convolutions are a popular approach to obtain equivariant representations.
We propose a multi-stream architecture, where each stream is invariant to a different transformation.
arXiv Detail & Related papers (2023-03-02T20:44:45Z) - Equivariance versus Augmentation for Spherical Images [0.7388859384645262]
We analyze the role of rotational equivariance in convolutional neural networks (CNNs) applied to spherical images.
We compare the performance of the group equivariant networks known as S2CNNs and standard non-equivariant CNNs trained with an increasing amount of data augmentation.
arXiv Detail & Related papers (2022-02-08T16:49:30Z) - Improving the Sample-Complexity of Deep Classification Networks with
Invariant Integration [77.99182201815763]
Leveraging prior knowledge on intraclass variance due to transformations is a powerful method to improve the sample complexity of deep neural networks.
We propose a novel monomial selection algorithm based on pruning methods to allow an application to more complex problems.
We demonstrate the improved sample complexity on the Rotated-MNIST, SVHN and CIFAR-10 datasets.
arXiv Detail & Related papers (2022-02-08T16:16:11Z) - Convolutional Hough Matching Networks [39.524998833064956]
We introduce a Hough transform perspective on convolutional matching and propose an effective geometric matching algorithm, dubbed Convolutional Hough Matching (CHM)
We cast it into a trainable neural layer with a semi-isotropic high-dimensional kernel, which learns non-rigid matching with a small number of interpretable parameters.
Our method sets a new state of the art on standard benchmarks for semantic visual correspondence, proving its strong robustness to challenging intra-class variations.
arXiv Detail & Related papers (2021-03-31T06:17:03Z) - Finding Geometric Models by Clustering in the Consensus Space [61.65661010039768]
We propose a new algorithm for finding an unknown number of geometric models, e.g., homographies.
We present a number of applications where the use of multiple geometric models improves accuracy.
These include pose estimation from multiple generalized homographies; trajectory estimation of fast-moving objects.
arXiv Detail & Related papers (2021-03-25T14:35:07Z) - Group Equivariant Conditional Neural Processes [30.134634059773703]
We present the group equivariant conditional neural process (EquivCNP)
We show that EquivCNP achieves comparable performance to conventional conditional neural processes in a 1D regression task.
arXiv Detail & Related papers (2021-02-17T13:50:07Z) - 3D Correspondence Grouping with Compatibility Features [51.869670613445685]
We present a simple yet effective method for 3D correspondence grouping.
The objective is to accurately classify initial correspondences obtained by matching local geometric descriptors into inliers and outliers.
We propose a novel representation for 3D correspondences, dubbed compatibility feature (CF), to describe the consistencies within inliers and inconsistencies within outliers.
arXiv Detail & Related papers (2020-07-21T02:39:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.