Local Consensus Enhanced Siamese Network with Reciprocal Loss for
Two-view Correspondence Learning
- URL: http://arxiv.org/abs/2308.03217v1
- Date: Sun, 6 Aug 2023 22:20:09 GMT
- Title: Local Consensus Enhanced Siamese Network with Reciprocal Loss for
Two-view Correspondence Learning
- Authors: Linbo Wang, Jing Wu, Xianyong Fang, Zhengyi Liu, Chenjie Cao, Yanwei
Fu
- Abstract summary: Two-view correspondence learning usually establish an end-to-end network to jointly predict correspondence reliability and relative pose.
We propose a Local Feature Consensus (LFC) plugin block to augment the features of existing models.
We extend existing models to a Siamese network with a reciprocal loss that exploits the supervision of mutual projection.
- Score: 35.5851523517487
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent studies of two-view correspondence learning usually establish an
end-to-end network to jointly predict correspondence reliability and relative
pose. We improve such a framework from two aspects. First, we propose a Local
Feature Consensus (LFC) plugin block to augment the features of existing
models. Given a correspondence feature, the block augments its neighboring
features with mutual neighborhood consensus and aggregates them to produce an
enhanced feature. As inliers obey a uniform cross-view transformation and share
more consistent learned features than outliers, feature consensus strengthens
inlier correlation and suppresses outlier distraction, which makes output
features more discriminative for classifying inliers/outliers. Second, existing
approaches supervise network training with the ground truth correspondences and
essential matrix projecting one image to the other for an input image pair,
without considering the information from the reverse mapping. We extend
existing models to a Siamese network with a reciprocal loss that exploits the
supervision of mutual projection, which considerably promotes the matching
performance without introducing additional model parameters. Building upon
MSA-Net, we implement the two proposals and experimentally achieve
state-of-the-art performance on benchmark datasets.
Related papers
- Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence [51.54175067684008]
This paper introduces a Transformer-based integrative feature and cost aggregation network designed for dense matching tasks.
We first show that feature aggregation and cost aggregation exhibit distinct characteristics and reveal the potential for substantial benefits stemming from the judicious use of both aggregation processes.
Our framework is evaluated on standard benchmarks for semantic matching, and also applied to geometric matching, where we show that our approach achieves significant improvements compared to existing methods.
arXiv Detail & Related papers (2024-03-17T07:02:55Z) - Dual-stream contrastive predictive network with joint handcrafted
feature view for SAR ship classification [9.251342335645765]
We propose a novel dual-stream contrastive predictive network (DCPNet)
The first task is to construct positive sample pairs, guiding the core encoder to learn more general representations.
The second task is to encourage adaptive capture of the correspondence between deep features and handcrated features, achieving knowledge transfer within the model, and effectively improving the redundancy caused by the feature fusion.
arXiv Detail & Related papers (2023-11-26T05:47:01Z) - FECANet: Boosting Few-Shot Semantic Segmentation with Feature-Enhanced
Context-Aware Network [48.912196729711624]
Few-shot semantic segmentation is the task of learning to locate each pixel of a novel class in a query image with only a few annotated support images.
We propose a Feature-Enhanced Context-Aware Network (FECANet) to suppress the matching noise caused by inter-class local similarity.
In addition, we propose a novel correlation reconstruction module that encodes extra correspondence relations between foreground and background and multi-scale context semantic features.
arXiv Detail & Related papers (2023-01-19T16:31:13Z) - Recurrent Feedback Improves Recognition of Partially Occluded Objects [1.452875650827562]
We investigate if and how artificial neural networks also benefit from recurrence.
We find that classification accuracy is significantly higher for recurrent models when compared to feedforward models of matched parametric complexity.
arXiv Detail & Related papers (2021-04-21T16:18:34Z) - Online Multiple Object Tracking with Cross-Task Synergy [120.70085565030628]
We propose a novel unified model with synergy between position prediction and embedding association.
The two tasks are linked by temporal-aware target attention and distractor attention, as well as identity-aware memory aggregation model.
arXiv Detail & Related papers (2021-04-01T10:19:40Z) - Adversarial Feature Augmentation and Normalization for Visual
Recognition [109.6834687220478]
Recent advances in computer vision take advantage of adversarial data augmentation to ameliorate the generalization ability of classification models.
Here, we present an effective and efficient alternative that advocates adversarial augmentation on intermediate feature embeddings.
We validate the proposed approach across diverse visual recognition tasks with representative backbone networks.
arXiv Detail & Related papers (2021-03-22T20:36:34Z) - CoADNet: Collaborative Aggregation-and-Distribution Networks for
Co-Salient Object Detection [91.91911418421086]
Co-Salient Object Detection (CoSOD) aims at discovering salient objects that repeatedly appear in a given query group containing two or more relevant images.
One challenging issue is how to effectively capture co-saliency cues by modeling and exploiting inter-image relationships.
We present an end-to-end collaborative aggregation-and-distribution network (CoADNet) to capture both salient and repetitive visual patterns from multiple images.
arXiv Detail & Related papers (2020-11-10T04:28:11Z) - Cascaded Human-Object Interaction Recognition [175.60439054047043]
We introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding.
At each stage, an instance localization network progressively refines HOI proposals and feeds them into an interaction recognition network.
With our carefully-designed human-centric relation features, these two modules work collaboratively towards effective interaction understanding.
arXiv Detail & Related papers (2020-03-09T17:05:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.