Geo-Semantic-Parsing: AI-powered geoparsing by traversing semantic knowledge graphs
- URL: http://arxiv.org/abs/2503.01386v1
- Date: Mon, 03 Mar 2025 10:30:23 GMT
- Title: Geo-Semantic-Parsing: AI-powered geoparsing by traversing semantic knowledge graphs
- Authors: Leonardo Nizzoli, Marco Avvenuti, Maurizio Tesconi, Stefano Cresci,
- Abstract summary: We introduce a novel geoparsing and geotagging technique called Geo-Semantic-Parsing (GSP)<n>GSP identifies location references in free text and extracts the corresponding geographic coordinates.<n>We evaluate GSP on a well-known reference dataset including almost 10k event-related tweets.
- Score: 0.7422344184734279
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Online social networks convey rich information about geospatial facets of reality. However in most cases, geographic information is not explicit and structured, thus preventing its exploitation in real-time applications. We address this limitation by introducing a novel geoparsing and geotagging technique called Geo-Semantic-Parsing (GSP). GSP identifies location references in free text and extracts the corresponding geographic coordinates. To reach this goal, we employ a semantic annotator to identify relevant portions of the input text and to link them to the corresponding entity in a knowledge graph. Then, we devise and experiment with several efficient strategies for traversing the knowledge graph, thus expanding the available set of information for the geoparsing task. Finally, we exploit all available information for learning a regression model that selects the best entity with which to geotag the input text. We evaluate GSP on a well-known reference dataset including almost 10k event-related tweets, achieving $F1=0.66$. We extensively compare our results with those of 2 baselines and 3 state-of-the-art geoparsing techniques, achieving the best performance. On the same dataset, competitors obtain $F1 \leq 0.55$. We conclude by providing in-depth analyses of our results, showing that the overall superior performance of GSP is mainly due to a large improvement in recall, with respect to existing techniques.
Related papers
- GeoAggregator: An Efficient Transformer Model for Geo-Spatial Tabular Data [5.40483645224129]
This paper introduces GeoAggregator, an efficient and lightweight algorithm for geospatial data modeling.
We benchmark it against spatial statistical models, XGBoost, and several state-of-the-art geospatial deep learning methods.
Results demonstrate that GeoAggregators achieve the best or second-best performance compared to their competitors on nearly all datasets.
arXiv Detail & Related papers (2025-02-20T20:39:15Z) - Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework [59.42946541163632]
We introduce a comprehensive geolocation framework with three key components.<n>GeoComp, a large-scale dataset; GeoCoT, a novel reasoning method; and GeoEval, an evaluation metric.<n>We demonstrate that GeoCoT significantly boosts geolocation accuracy by up to 25% while enhancing interpretability.
arXiv Detail & Related papers (2025-02-19T14:21:25Z) - The S2 Hierarchical Discrete Global Grid as a Nexus for Data Representation, Integration, and Querying Across Geospatial Knowledge Graphs [4.358099505067763]
This paper outlines the implementation of Google's S2 Geometry within KnowWhereGraph.
Ultimately, this work demonstrates the potential of DGGS frameworks, particularly S2, for building scalable GeoKGs.
arXiv Detail & Related papers (2024-10-18T18:30:05Z) - GeoLLM: Extracting Geospatial Knowledge from Large Language Models [49.20315582673223]
We present GeoLLM, a novel method that can effectively extract geospatial knowledge from large language models.
We demonstrate the utility of our approach across multiple tasks of central interest to the international community, including the measurement of population density and economic livelihoods.
Our experiments reveal that LLMs are remarkably sample-efficient, rich in geospatial information, and robust across the globe.
arXiv Detail & Related papers (2023-10-10T00:03:23Z) - GeoCLIP: Clip-Inspired Alignment between Locations and Images for
Effective Worldwide Geo-localization [61.10806364001535]
Worldwide Geo-localization aims to pinpoint the precise location of images taken anywhere on Earth.
Existing approaches divide the globe into discrete geographic cells, transforming the problem into a classification task.
We propose GeoCLIP, a novel CLIP-inspired Image-to-GPS retrieval approach that enforces alignment between the image and its corresponding GPS locations.
arXiv Detail & Related papers (2023-09-27T20:54:56Z) - Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking [61.60169764507917]
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates.
We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
arXiv Detail & Related papers (2023-09-04T13:44:50Z) - K2: A Foundation Language Model for Geoscience Knowledge Understanding
and Utilization [105.89544876731942]
Large language models (LLMs) have achieved great success in general domains of natural language processing.
We present the first-ever LLM in geoscience, K2, alongside a suite of resources developed to further promote LLM research within geoscience.
arXiv Detail & Related papers (2023-06-08T09:29:05Z) - GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark [56.08664336835741]
We propose a GeoGraphic Language Understanding Evaluation benchmark, named GeoGLUE.
We collect data from open-released geographic resources and introduce six natural language understanding tasks.
We pro vide evaluation experiments and analysis of general baselines, indicating the effectiveness and significance of the GeoGLUE benchmark.
arXiv Detail & Related papers (2023-05-11T03:21:56Z) - Are We There Yet? Evaluating State-of-the-Art Neural Network based
Geoparsers Using EUPEG as a Benchmarking Platform [2.8935588665357077]
In June 2019, a geoparsing competition, Toponym Resolution in Scientific Papers, was held.
Winning teams developed neural network based geoparsers that achieved outstanding performances.
This work performs a systematic evaluation of these state-of-the-art geoparsers using our recently developed benchmarking platform EUPEG.
arXiv Detail & Related papers (2020-07-15T03:13:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.