Related papers: Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

URL: http://arxiv.org/abs/2404.18784v1
Date: Mon, 29 Apr 2024 15:18:33 GMT
Title: Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input
Authors: Tessa Masis, Brendan O'Connor,
Abstract summary: We present a method which represents real-world locations as averaged embeddings from labeled user-input location names. We show that our approach improves geo-entity linking on a global and multilingual social media dataset.
Score: 2.516307239032451
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Geo-entity linking is the task of linking a location mention to the real-world geographic location. In this paper we explore the challenging task of geo-entity linking for noisy, multilingual social media data. There are few open-source multilingual geo-entity linking tools available and existing ones are often rule-based, which break easily in social media settings, or LLM-based, which are too expensive for large-scale datasets. We present a method which represents real-world locations as averaged embeddings from labeled user-input location names and allows for selective prediction via an interpretable confidence score. We show that our approach improves geo-entity linking on a global and multilingual social media dataset, and discuss progress and problems with evaluating at different geographic granularities.

Related papers

OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence [51.0456395687016]
multimodal large language models (LLMs) have opened new frontiers in artificial intelligence. We propose a MLLM (OmniGeo) tailored to geospatial applications. By combining the strengths of natural language understanding and spatial reasoning, our model enhances the ability of instruction following and the accuracy of GeoAI systems.
arXiv Detail & Related papers (2025-03-20T16:45:48Z)
Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search [2.9658923973538034]
We propose a GeoQA Portal using a multi-agent Large Language Model framework for seamless natural language interaction with geospatial data. Case studies, evaluations, and user tests confirm its effectiveness for non-experts, bridging GIS complexity and public access.
arXiv Detail & Related papers (2025-03-18T13:39:46Z)
Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework [59.42946541163632]
We introduce a comprehensive geolocation framework with three key components. GeoComp, a large-scale dataset; GeoCoT, a novel reasoning method; and GeoEval, an evaluation metric. We demonstrate that GeoCoT significantly boosts geolocation accuracy by up to 25% while enhancing interpretability.
arXiv Detail & Related papers (2025-02-19T14:21:25Z)
Leveraging Large Language Models to Geolocate Linguistic Variations in Social Media Posts [0.0]
We address the GeoLingIt challenge of geolocalizing tweets written in Italian by leveraging large language models (LLMs) Our approach involves fine-tuning pre-trained LLMs to simultaneously predict these geolocalization aspects. This work is conducted as part of the Large Language Models course at the Bertinoro International Spring School 2024.
arXiv Detail & Related papers (2024-07-22T20:54:35Z)
GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding [45.36562604939258]
This paper introduces GeoLM, a language model that enhances the understanding of geo-entities in natural language. We demonstrate that GeoLM exhibits promising capabilities in supporting toponym recognition, toponym linking, relation extraction, and geo-entity typing.
arXiv Detail & Related papers (2023-10-23T01:20:01Z)
GeoLLM: Extracting Geospatial Knowledge from Large Language Models [49.20315582673223]
We present GeoLLM, a novel method that can effectively extract geospatial knowledge from large language models. We demonstrate the utility of our approach across multiple tasks of central interest to the international community, including the measurement of population density and economic livelihoods. Our experiments reveal that LLMs are remarkably sample-efficient, rich in geospatial information, and robust across the globe.
arXiv Detail & Related papers (2023-10-10T00:03:23Z)
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization [61.10806364001535]
Worldwide Geo-localization aims to pinpoint the precise location of images taken anywhere on Earth. Existing approaches divide the globe into discrete geographic cells, transforming the problem into a classification task. We propose GeoCLIP, a novel CLIP-inspired Image-to-GPS retrieval approach that enforces alignment between the image and its corresponding GPS locations.
arXiv Detail & Related papers (2023-09-27T20:54:56Z)
Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking [61.60169764507917]
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates. We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
arXiv Detail & Related papers (2023-09-04T13:44:50Z)
GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark [56.08664336835741]
We propose a GeoGraphic Language Understanding Evaluation benchmark, named GeoGLUE. We collect data from open-released geographic resources and introduce six natural language understanding tasks. We pro vide evaluation experiments and analysis of general baselines, indicating the effectiveness and significance of the GeoGLUE benchmark.
arXiv Detail & Related papers (2023-05-11T03:21:56Z)
MGeo: Multi-Modal Geographic Pre-Training Method [49.78466122982627]
We propose a novel query-POI matching method Multi-modal Geographic language model (MGeo) MGeo represents GC as a new modality and is able to fully extract multi-modal correlations for accurate query-POI matching. Our proposed multi-modal pre-training method can significantly improve the query-POI matching capability of generic PTMs.
arXiv Detail & Related papers (2023-01-11T03:05:12Z)
Geosocial Location Classification: Associating Type to Places Based on Geotagged Social-Media Posts [22.313111311130662]
Associating type to locations can be used to enrich maps and can serve a plethora of geospatial applications. We study the problem of Geosocial Location Classification, where the type of a site, e.g., a building, is discovered based on social-media posts.
arXiv Detail & Related papers (2020-02-05T16:09:52Z)

This list is automatically generated from the titles and abstracts of the papers in this site.