Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking
- URL: http://arxiv.org/abs/2309.01606v2
- Date: Fri, 2 Feb 2024 14:15:32 GMT
- Title: Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking
- Authors: Yong Cao, Ruixue Ding, Boli Chen, Xianzhi Li, Min Chen, Daniel
Hershcovich, Pengjun Xie, and Fei Huang
- Abstract summary: Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates.
We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
- Score: 61.60169764507917
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Chinese geographic re-ranking task aims to find the most relevant addresses
among retrieved candidates, which is crucial for location-related services such
as navigation maps. Unlike the general sentences, geographic contexts are
closely intertwined with geographical concepts, from general spans (e.g.,
province) to specific spans (e.g., road). Given this feature, we propose an
innovative framework, namely Geo-Encoder, to more effectively integrate Chinese
geographical semantics into re-ranking pipelines. Our methodology begins by
employing off-the-shelf tools to associate text with geographical spans,
treating them as chunking units. Then, we present a multi-task learning module
to simultaneously acquire an effective attention matrix that determines chunk
contributions to extra semantic representations. Furthermore, we put forth an
asynchronous update mechanism for the proposed addition task, aiming to guide
the model capable of effectively focusing on specific chunks. Experiments on
two distinct Chinese geographic re-ranking datasets, show that the Geo-Encoder
achieves significant improvements when compared to state-of-the-art baselines.
Notably, it leads to a substantial improvement in the Hit@1 score of MGEO-BERT,
increasing it by 6.22% from 62.76 to 68.98 on the GeoTES dataset.
Related papers
- TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning [36.725822223732635]
We propose TorchSpatial, a learning framework and benchmark for location (point) encoding.
TorchSpatial contains three key components: 1) a unified location encoding framework that consolidates 15 commonly recognized location encoders; 2) the LocBench benchmark tasks encompassing 7 geo-aware image classification and 4 geo-aware image regression datasets; and 3) a comprehensive suite of evaluation metrics to quantify geo-aware models' overall performance as well as their geographic bias, with a novel Geo-Bias Score metric.
arXiv Detail & Related papers (2024-06-21T21:33:16Z) - Learning Geospatial Region Embedding with Heterogeneous Graph [16.864563545518124]
We present GeoHG, an effective heterogeneous graph structure for learning comprehensive region embeddings for various downstream tasks.
Specifically, we tailor satellite image representation learning through geo-entity segmentation and point-of-interest (POI) integration for expressive intra-regional features.
GeoHG unifies informative spatial interdependencies and socio-environmental attributes into a powerful heterogeneous graph to encourage explicit modeling of higher-order inter-regional relationships.
arXiv Detail & Related papers (2024-05-23T03:19:02Z) - GeoLLM: Extracting Geospatial Knowledge from Large Language Models [49.20315582673223]
We present GeoLLM, a novel method that can effectively extract geospatial knowledge from large language models.
We demonstrate the utility of our approach across multiple tasks of central interest to the international community, including the measurement of population density and economic livelihoods.
Our experiments reveal that LLMs are remarkably sample-efficient, rich in geospatial information, and robust across the globe.
arXiv Detail & Related papers (2023-10-10T00:03:23Z) - GeoCLIP: Clip-Inspired Alignment between Locations and Images for
Effective Worldwide Geo-localization [61.10806364001535]
Worldwide Geo-localization aims to pinpoint the precise location of images taken anywhere on Earth.
Existing approaches divide the globe into discrete geographic cells, transforming the problem into a classification task.
We propose GeoCLIP, a novel CLIP-inspired Image-to-GPS retrieval approach that enforces alignment between the image and its corresponding GPS locations.
arXiv Detail & Related papers (2023-09-27T20:54:56Z) - GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark [56.08664336835741]
We propose a GeoGraphic Language Understanding Evaluation benchmark, named GeoGLUE.
We collect data from open-released geographic resources and introduce six natural language understanding tasks.
We pro vide evaluation experiments and analysis of general baselines, indicating the effectiveness and significance of the GeoGLUE benchmark.
arXiv Detail & Related papers (2023-05-11T03:21:56Z) - GeoNet: Benchmarking Unsupervised Adaptation across Geographies [71.23141626803287]
We study the problem of geographic robustness and make three main contributions.
First, we introduce a large-scale dataset GeoNet for geographic adaptation.
Second, we hypothesize that the major source of domain shifts arise from significant variations in scene context.
Third, we conduct an extensive evaluation of several state-of-the-art unsupervised domain adaptation algorithms and architectures.
arXiv Detail & Related papers (2023-03-27T17:59:34Z) - MGeo: Multi-Modal Geographic Pre-Training Method [49.78466122982627]
We propose a novel query-POI matching method Multi-modal Geographic language model (MGeo)
MGeo represents GC as a new modality and is able to fully extract multi-modal correlations for accurate query-POI matching.
Our proposed multi-modal pre-training method can significantly improve the query-POI matching capability of generic PTMs.
arXiv Detail & Related papers (2023-01-11T03:05:12Z) - geoGAT: Graph Model Based on Attention Mechanism for Geographic Text
Classification [14.42443348221996]
We use the method of graph convolutional neural network with attention mechanism to achieve this function.
The Macro-F Score of the geoGAT we used reached 95% on the new Chinese dataset.
arXiv Detail & Related papers (2021-01-13T09:32:15Z) - SE-KGE: A Location-Aware Knowledge Graph Embedding Model for Geographic
Question Answering and Spatial Semantic Lifting [9.949690056661218]
We propose a location-aware KG embedding model called SE-KGE.
It encodes spatial information such as point coordinates or bounding boxes of geographic entities into the KG embedding space.
We also construct a geographic knowledge graph as well as a set of geographic query-answer pairs called DBGeo to evaluate the performance of SE-KGE.
arXiv Detail & Related papers (2020-04-25T17:46:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.