Recognition of Implicit Geographic Movement in Text
- URL: http://arxiv.org/abs/2201.12799v1
- Date: Sun, 30 Jan 2022 12:22:55 GMT
- Title: Recognition of Implicit Geographic Movement in Text
- Authors: Scott Pezanowski, Prasenjit Mitra
- Abstract summary: Analyzing the geographic movement of humans, animals, and other phenomena is a growing field of research.
We created a corpus of sentences labeled as describing geographic movement or not.
We developed an iterative process employing hand labeling, crowd voting for confirmation, and machine learning to predict more labels.
- Score: 3.3241479835797123
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Analyzing the geographic movement of humans, animals, and other phenomena is
a growing field of research. This research has benefited urban planning,
logistics, animal migration understanding, and much more. Typically, the
movement is captured as precise geographic coordinates and time stamps with
Global Positioning Systems (GPS). Although some research uses computational
techniques to take advantage of implicit movement in descriptions of route
directions, hiking paths, and historical exploration routes, innovation would
accelerate with a large and diverse corpus. We created a corpus of sentences
labeled as describing geographic movement or not and including the type of
entity moving. Creating this corpus proved difficult without any comparable
corpora to start with, high human labeling costs, and since movement can at
times be interpreted differently. To overcome these challenges, we developed an
iterative process employing hand labeling, crowd voting for confirmation, and
machine learning to predict more labels. By merging advances in word embeddings
with traditional machine learning models and model ensembling, prediction
accuracy is at an acceptable level to produce a large silver-standard corpus
despite the small gold-standard corpus training set. Our corpus will likely
benefit computational processing of geography in text and spatial cognition, in
addition to detection of movement.
Related papers
- Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition [24.701574433327746]
We investigate the feasibility of probing a large language model for geography-based object knowledge.
We propose geography knowledge regularization to ensure that soft prompts trained on a source set of geographies generalize to an unseen target set.
Accuracy gains over prompting baselines on DollarStreet are up to +2.8/1.2/1.6 on target data from Africa/Asia/Americas, and +4.6 overall on the hardest classes.
arXiv Detail & Related papers (2024-01-03T01:11:16Z) - Navigation with Large Language Models: Semantic Guesswork as a Heuristic
for Planning [73.0990339667978]
Navigation in unfamiliar environments presents a major challenge for robots.
We use language models to bias exploration of novel real-world environments.
We evaluate LFG in challenging real-world environments and simulated benchmarks.
arXiv Detail & Related papers (2023-10-16T06:21:06Z) - Bridging the Gap between Human Motion and Action Semantics via Kinematic Phrases [59.32509533292653]
Motion understanding aims to establish a reliable mapping between motion and action semantics.
We propose Kinematic Phrases (KP) that take the objective kinematic facts of human motion with proper abstraction, interpretability, and generality.
Based on KP, we can unify a motion knowledge base and build a motion understanding system.
arXiv Detail & Related papers (2023-10-06T12:08:15Z) - Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking [61.60169764507917]
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates.
We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
arXiv Detail & Related papers (2023-09-04T13:44:50Z) - SituatedGen: Incorporating Geographical and Temporal Contexts into
Generative Commonsense Reasoning [55.44065342292846]
Generative commonsense reasoning is the task that requires machines, given a group of keywords, to compose a single coherent sentence with commonsense plausibility.
We formalize this challenging task as SituatedGen, where machines with commonsense should generate a pair of contrastive sentences given a group of keywords including geographical or temporal entities.
Experiments show that state-of-the-art generative language models struggle to generate sentences with commonsense plausibility and still lag far behind human performance.
arXiv Detail & Related papers (2023-06-21T20:36:55Z) - GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark [56.08664336835741]
We propose a GeoGraphic Language Understanding Evaluation benchmark, named GeoGLUE.
We collect data from open-released geographic resources and introduce six natural language understanding tasks.
We pro vide evaluation experiments and analysis of general baselines, indicating the effectiveness and significance of the GeoGLUE benchmark.
arXiv Detail & Related papers (2023-05-11T03:21:56Z) - Exploring Descriptions of Movement Through Geovisual Analytics [2.813813570843999]
We present GeoMovement, a system that is based on combining machine learning and rule-based extraction of movement-related information with state-of-the-art visualization techniques.
Along with the depiction of movement, our tool can extract and present a lack of movement.
arXiv Detail & Related papers (2022-03-01T18:23:02Z) - Differentiating Geographic Movement Described in Text Documents [2.813813570843999]
We show how interpreting geographic movement described in text is challenging because of general spatial terms, linguistic constructions that make the thing(s) moving unclear.
We identify multiple important characteristics of movement descriptions that humans use to differentiate one movement description from another.
Our findings contribute towards an improved understanding of the important characteristics of the underused information about geographic movement that is in the form of text descriptions.
arXiv Detail & Related papers (2022-01-12T11:49:13Z) - UCPhrase: Unsupervised Context-aware Quality Phrase Tagging [63.86606855524567]
UCPhrase is a novel unsupervised context-aware quality phrase tagger.
We induce high-quality phrase spans as silver labels from consistently co-occurring word sequences.
We show that our design is superior to state-of-the-art pre-trained, unsupervised, and distantly supervised methods.
arXiv Detail & Related papers (2021-05-28T19:44:24Z) - Unsupervised embedding of trajectories captures the latent structure of
scientific migration [4.028844692958469]
We show the ability of the model word2vec to encode nuanced relationships between discrete locations from migration trajectories.
We show that the power of word2vec to encode migration patterns stems from its mathematical equivalence with the gravity model of mobility.
Using techniques that leverage its semantic structure, we demonstrate that embeddings can learn the rich structure that underpins scientific migration.
arXiv Detail & Related papers (2020-12-04T18:58:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.