Improving Toponym Resolution with Better Candidate Generation,
Transformer-based Reranking, and Two-Stage Resolution
- URL: http://arxiv.org/abs/2305.11315v1
- Date: Thu, 18 May 2023 21:52:48 GMT
- Title: Improving Toponym Resolution with Better Candidate Generation,
Transformer-based Reranking, and Two-Stage Resolution
- Authors: Zeyu Zhang and Steven Bethard
- Abstract summary: Geocoding is the task of converting location mentions in text into structured data that encodes the geospatial semantics.
We propose a new architecture for geocoding, GeoNorm.
Our proposed toponym resolution framework achieves state-of-the-art performance on multiple datasets.
- Score: 30.855736793066406
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Geocoding is the task of converting location mentions in text into structured
data that encodes the geospatial semantics. We propose a new architecture for
geocoding, GeoNorm. GeoNorm first uses information retrieval techniques to
generate a list of candidate entries from the geospatial ontology. Then it
reranks the candidate entries using a transformer-based neural network that
incorporates information from the ontology such as the entry's population. This
generate-and-rerank process is applied twice: first to resolve the less
ambiguous countries, states, and counties, and second to resolve the remaining
location mentions, using the identified countries, states, and counties as
context. Our proposed toponym resolution framework achieves state-of-the-art
performance on multiple datasets. Code and models are available at
\url{https://github.com/clulab/geonorm}.
Related papers
- CityGuessr: City-Level Video Geo-Localization on a Global Scale [54.371452373726584]
We propose a novel problem of worldwide video geolocalization with the objective of hierarchically predicting the correct city, state/province, country, and continent, given a video.
No large scale video datasets that have extensive worldwide coverage exist, to train models for solving this problem.
We introduce a new dataset, CityGuessr68k comprising of 68,269 videos from 166 cities all over the world.
arXiv Detail & Related papers (2024-11-10T03:20:00Z) - GeoCLIP: Clip-Inspired Alignment between Locations and Images for
Effective Worldwide Geo-localization [61.10806364001535]
Worldwide Geo-localization aims to pinpoint the precise location of images taken anywhere on Earth.
Existing approaches divide the globe into discrete geographic cells, transforming the problem into a classification task.
We propose GeoCLIP, a novel CLIP-inspired Image-to-GPS retrieval approach that enforces alignment between the image and its corresponding GPS locations.
arXiv Detail & Related papers (2023-09-27T20:54:56Z) - Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking [61.60169764507917]
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates.
We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
arXiv Detail & Related papers (2023-09-04T13:44:50Z) - GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark [56.08664336835741]
We propose a GeoGraphic Language Understanding Evaluation benchmark, named GeoGLUE.
We collect data from open-released geographic resources and introduce six natural language understanding tasks.
We pro vide evaluation experiments and analysis of general baselines, indicating the effectiveness and significance of the GeoGLUE benchmark.
arXiv Detail & Related papers (2023-05-11T03:21:56Z) - Mordecai 3: A Neural Geoparser and Event Geocoder [5.71097144710995]
Mordecai3 is a new end-to-end text geoparser and event geolocation system.
It performs toponym resolution using a new neural ranking model to resolve a place name extracted from a document to its entry in the Geonames gazetteer.
It also performs event geocoding, the process of linking events reported in text with the place names where they are reported to occur.
arXiv Detail & Related papers (2023-03-23T21:10:04Z) - MGeo: Multi-Modal Geographic Pre-Training Method [49.78466122982627]
We propose a novel query-POI matching method Multi-modal Geographic language model (MGeo)
MGeo represents GC as a new modality and is able to fully extract multi-modal correlations for accurate query-POI matching.
Our proposed multi-modal pre-training method can significantly improve the query-POI matching capability of generic PTMs.
arXiv Detail & Related papers (2023-01-11T03:05:12Z) - Transformer Based Geocoding [0.0]
We formulate the problem of predicting a geolocation from free text as a sequence-to-sequence problem.
We obtain a geocoding model by training a T5 encoder-decoder transformer model using free text as an input and geolocation as an output.
arXiv Detail & Related papers (2023-01-02T10:13:32Z) - Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image
Matching [102.39635336450262]
We address the problem of ground-to-satellite image geo-localization by matching a query image captured at the ground level against a large-scale database with geotagged satellite images.
Our new method is able to achieve the fine-grained location of a query image, up to pixel size precision of the satellite image.
arXiv Detail & Related papers (2022-03-26T20:10:38Z) - A Review of Location Encoding for GeoAI: Methods and Applications [14.279748049042665]
A common need for artificial intelligence models in the broader geoscience is to represent and encode various types of spatial data.
One fundamental step is to encode a single point location into an embedding space.
This embedding is learning-friendly for downstream machine learning models such as support vector machines and neural networks.
arXiv Detail & Related papers (2021-11-07T05:25:49Z) - Hierarchical Attention Fusion for Geo-Localization [7.544917072241684]
We introduce a hierarchical attention fusion network using multi-scale features for geo-localization.
We extract the hierarchical feature maps from a convolutional neural network (CNN) and organically fuse the extracted features for image representations.
Our training is self-supervised using adaptive weights to control the attention of feature emphasis from each hierarchical level.
arXiv Detail & Related papers (2021-02-18T07:07:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.