Coarse-to-fine Task-driven Inpainting for Geoscience Images
- URL: http://arxiv.org/abs/2211.11059v1
- Date: Sun, 20 Nov 2022 19:14:51 GMT
- Title: Coarse-to-fine Task-driven Inpainting for Geoscience Images
- Authors: Sun Huiming and Ma Jin and Guo Qing and Song Shaoyue and Yuewei Lin
and Yu Hongkai
- Abstract summary: This paper aims to repair the occluded regions for a better geoscience task performance with the advanced visualization quality simultaneously.
Because of the complex context of geoscience images, we propose a coarse-to-fine encoder-decoder network with coarse-to-fine adversarial context discriminators to reconstruct the occluded image regions.
- Score: 1.7741871563668714
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The processing and recognition of geoscience images have wide applications.
Most of existing researches focus on understanding the high-quality geoscience
images by assuming that all the images are clear. However, in many real-world
cases, the geoscience images might contain occlusions during the image
acquisition. This problem actually implies the image inpainting problem in
computer vision and multimedia. To the best of our knowledge, all the existing
image inpainting algorithms learn to repair the occluded regions for a better
visualization quality, they are excellent for natural images but not good
enough for geoscience images by ignoring the geoscience related tasks. This
paper aims to repair the occluded regions for a better geoscience task
performance with the advanced visualization quality simultaneously, without
changing the current deployed deep learning based geoscience models. Because of
the complex context of geoscience images, we propose a coarse-to-fine
encoder-decoder network with coarse-to-fine adversarial context discriminators
to reconstruct the occluded image regions. Due to the limited data of
geoscience images, we use a MaskMix based data augmentation method to exploit
more information from limited geoscience image data. The experimental results
on three public geoscience datasets for remote sensing scene recognition,
cross-view geolocation and semantic segmentation tasks respectively show the
effectiveness and accuracy of the proposed method.
Related papers
- Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework [51.26566634946208]
We introduce smileGeo, a novel visual geo-localization framework.
By inter-agent communication, smileGeo integrates the inherent knowledge of these agents with additional retrieved information.
Results show that our approach significantly outperforms current state-of-the-art methods.
arXiv Detail & Related papers (2024-08-21T03:31:30Z) - GeoCLIP: Clip-Inspired Alignment between Locations and Images for
Effective Worldwide Geo-localization [61.10806364001535]
Worldwide Geo-localization aims to pinpoint the precise location of images taken anywhere on Earth.
Existing approaches divide the globe into discrete geographic cells, transforming the problem into a classification task.
We propose GeoCLIP, a novel CLIP-inspired Image-to-GPS retrieval approach that enforces alignment between the image and its corresponding GPS locations.
arXiv Detail & Related papers (2023-09-27T20:54:56Z) - GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark [56.08664336835741]
We propose a GeoGraphic Language Understanding Evaluation benchmark, named GeoGLUE.
We collect data from open-released geographic resources and introduce six natural language understanding tasks.
We pro vide evaluation experiments and analysis of general baselines, indicating the effectiveness and significance of the GeoGLUE benchmark.
arXiv Detail & Related papers (2023-05-11T03:21:56Z) - CSP: Self-Supervised Contrastive Spatial Pre-Training for
Geospatial-Visual Representations [90.50864830038202]
We present Contrastive Spatial Pre-Training (CSP), a self-supervised learning framework for geo-tagged images.
We use a dual-encoder to separately encode the images and their corresponding geo-locations, and use contrastive objectives to learn effective location representations from images.
CSP significantly boosts the model performance with 10-34% relative improvement with various labeled training data sampling ratios.
arXiv Detail & Related papers (2023-05-01T23:11:18Z) - Where We Are and What We're Looking At: Query Based Worldwide Image
Geo-localization Using Hierarchies and Scenes [53.53712888703834]
We introduce an end-to-end transformer-based architecture that exploits the relationship between different geographic levels.
We achieve state of the art street level accuracy on 4 standard geo-localization datasets.
arXiv Detail & Related papers (2023-03-07T21:47:58Z) - Revisiting Near/Remote Sensing with Geospatial Attention [24.565068569913382]
This work addresses the task of overhead image segmentation when auxiliary ground-level images are available.
Recent work has shown that performing joint inference over these two modalities, often called near/remote sensing, can yield significant accuracy improvements.
We introduce the concept of geospatial attention, a geometry-aware attention mechanism that explicitly considers the geospatial relationship between the pixels in a ground-level image and a geographic location.
arXiv Detail & Related papers (2022-04-04T19:19:50Z) - Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image
Matching [102.39635336450262]
We address the problem of ground-to-satellite image geo-localization by matching a query image captured at the ground level against a large-scale database with geotagged satellite images.
Our new method is able to achieve the fine-grained location of a query image, up to pixel size precision of the satellite image.
arXiv Detail & Related papers (2022-03-26T20:10:38Z) - Visual and Object Geo-localization: A Comprehensive Survey [11.120155713865918]
Geo-localization refers to the process of determining where on earth some entity' is located.
This paper provides a comprehensive survey of geo-localization involving images, which involves either determining from where an image has been captured (Image geo-localization) or geo-locating objects within an image (Object geo-localization)
We will provide an in-depth study, including a summary of popular algorithms, a description of proposed datasets, and an analysis of performance results to illustrate the current state of each field.
arXiv Detail & Related papers (2021-12-30T20:46:53Z) - High Definition image classification in Geoscience using Machine
Learning [0.0]
HD digital photos taken with drones are widely used in the study of Geoscience.
blurry images are often taken in collected data, and it takes a lot of time and effort to distinguish clear images from blurry ones.
In this work, we apply Machine learning techniques, such as Support Vector Machine (SVM) and Neural Network (NN) to classify HD images in Geoscience as clear and blurry.
arXiv Detail & Related papers (2020-09-25T17:30:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.