Epipolar-Guided Deep Object Matching for Scene Change Detection
- URL: http://arxiv.org/abs/2007.15540v1
- Date: Thu, 30 Jul 2020 15:48:40 GMT
- Title: Epipolar-Guided Deep Object Matching for Scene Change Detection
- Authors: Kento Doi, Ryuhei Hamaguchi, Shun Iwase, Rio Yokota, Yutaka Matsuo,
Ken Sakurada
- Abstract summary: This paper describes a viewpoint-robust object-based change detection network (OBJ-CDNet)
Mobile cameras capture images from different viewpoints each time due to differences in camera trajectory and shutter timing.
We introduce a deep graph matching network that establishes object correspondence between an image pair.
- Score: 23.951526610952765
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper describes a viewpoint-robust object-based change detection network
(OBJ-CDNet). Mobile cameras such as drive recorders capture images from
different viewpoints each time due to differences in camera trajectory and
shutter timing. However, previous methods for pixel-wise change detection are
vulnerable to the viewpoint differences because they assume aligned image pairs
as inputs. To cope with the difficulty, we introduce a deep graph matching
network that establishes object correspondence between an image pair. The
introduction enables us to detect object-wise scene changes without precise
image alignment. For more accurate object matching, we propose an
epipolar-guided deep graph matching network (EGMNet), which incorporates the
epipolar constraint into the deep graph matching layer used in OBJCDNet. To
evaluate our network's robustness against viewpoint differences, we created
synthetic and real datasets for scene change detection from an image pair. The
experimental results verified the effectiveness of our network.
Related papers
- Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection [57.883265488038134]
We propose a hierarchical graph interaction network termed HGINet for camouflaged object detection.
The network is capable of discovering imperceptible objects via effective graph interaction among the hierarchical tokenized features.
Our experiments demonstrate the superior performance of HGINet compared to existing state-of-the-art methods.
arXiv Detail & Related papers (2024-08-27T12:53:25Z) - MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification [29.15203530375882]
Change (CD) from remote sensing (RS) images using deep learning has been widely investigated in the literature.
We propose MaskCD to detect changed areas by adaptively generating categorized masks from input image pairs.
It reconstructs the desired changed objects by decoding the pixel-wise representations into learnable mask proposals.
arXiv Detail & Related papers (2024-04-18T11:05:15Z) - Learning-based Relational Object Matching Across Views [63.63338392484501]
We propose a learning-based approach which combines local keypoints with novel object-level features for matching object detections between RGB images.
We train our object-level matching features based on appearance and inter-frame and cross-frame spatial relations between objects in an associative graph neural network.
arXiv Detail & Related papers (2023-05-03T19:36:51Z) - PoserNet: Refining Relative Camera Poses Exploiting Object Detections [14.611595909419297]
We use objectness regions to guide the pose estimation problem rather than explicit semantic object detections.
We propose Pose Refiner Network (PoserNet) a light-weight Graph Network to refine the approximate pair-wise relative camera poses.
We evaluate on the 7-Scenes dataset across varied sizes of graphs and show how this process can be beneficial to optimisation-based Motion Averaging algorithms.
arXiv Detail & Related papers (2022-07-19T17:58:33Z) - Pairwise Comparison Network for Remote Sensing Scene Classification [0.0]
This paper proposes a pairwise comparison network, which contains two main steps: pairwise selection and pairwise representation.
The proposed network first selects similar image pairs, and then represents the image pairs with pairwise representations.
The self-representation is introduced to highlight the informative parts of each image itself, while the mutual-representation is proposed to capture the subtle differences between image pairs.
arXiv Detail & Related papers (2022-05-17T07:31:36Z) - Complex Scene Image Editing by Scene Graph Comprehension [17.72638225034884]
We propose a two-stage method for achieving complex scene image editing by Scene Graph (SGC-Net)
In the first stage, we train a Region of Interest (RoI) prediction network that uses scene graphs and predict the locations of the target objects.
The second stage uses a conditional diffusion model to edit the image based on our RoI predictions.
arXiv Detail & Related papers (2022-03-24T05:12:54Z) - Learning Hierarchical Graph Representation for Image Manipulation
Detection [50.04902159383709]
The objective of image manipulation detection is to identify and locate the manipulated regions in the images.
Recent approaches mostly adopt the sophisticated Convolutional Neural Networks (CNNs) to capture the tampering artifacts left in the images.
We propose a hierarchical Graph Convolutional Network (HGCN-Net), which consists of two parallel branches.
arXiv Detail & Related papers (2022-01-15T01:54:25Z) - City-scale Scene Change Detection using Point Clouds [71.73273007900717]
We propose a method for detecting structural changes in a city using images captured from mounted cameras over two different times.
A direct comparison of the two point clouds for change detection is not ideal due to inaccurate geo-location information.
To circumvent this problem, we propose a deep learning-based non-rigid registration on the point clouds.
Experiments show that our method is able to detect scene changes effectively, even in the presence of viewpoint and illumination differences.
arXiv Detail & Related papers (2021-03-26T08:04:13Z) - D-Unet: A Dual-encoder U-Net for Image Splicing Forgery Detection and
Localization [108.8592577019391]
Image splicing forgery detection is a global binary classification task that distinguishes the tampered and non-tampered regions by image fingerprints.
We propose a novel network called dual-encoder U-Net (D-Unet) for image splicing forgery detection, which employs an unfixed encoder and a fixed encoder.
In an experimental comparison study of D-Unet and state-of-the-art methods, D-Unet outperformed the other methods in image-level and pixel-level detection.
arXiv Detail & Related papers (2020-12-03T10:54:02Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.