A Transformer-based Framework for POI-level Social Post Geolocation
- URL: http://arxiv.org/abs/2211.01336v1
- Date: Wed, 26 Oct 2022 10:30:51 GMT
- Title: A Transformer-based Framework for POI-level Social Post Geolocation
- Authors: Menglin Li, Kwan Hui Lim, Teng Guo, Junhua Liu
- Abstract summary: We present a transformer-based general framework, which builds upon pre-trained language models and considers non-textual data.
We show that three variants of our proposed framework outperform multiple state-of-art baselines by a large margin in terms of accuracy and distance error metrics.
- Score: 4.027087283290081
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: POI-level geo-information of social posts is critical to many location-based
applications and services. However, the multi-modality, complexity and diverse
nature of social media data and their platforms limit the performance of
inferring such fine-grained locations and their subsequent applications. To
address this issue, we present a transformer-based general framework, which
builds upon pre-trained language models and considers non-textual data, for
social post geolocation at the POI level. To this end, inputs are categorized
to handle different social data, and an optimal combination strategy is
provided for feature representations. Moreover, a uniform representation of
hierarchy is proposed to learn temporal information, and a concatenated version
of encodings is employed to capture feature-wise positions better. Experimental
results on various social datasets demonstrate that three variants of our
proposed framework outperform multiple state-of-art baselines by a large margin
in terms of accuracy and distance error metrics.
Related papers
- POIFormer: A Transformer-Based Framework for Accurate and Scalable Point-of-Interest Attribution [3.729614737011418]
textsfPOIFormer is a novel Transformer-based framework for accurate and efficient POI attribution.<n>textsfPOIFormer enables accurate, efficient attribution in large, noisy mobility datasets.
arXiv Detail & Related papers (2025-07-12T04:37:52Z) - AdaptGOT: A Pre-trained Model for Adaptive Contextual POI Representation Learning [7.277204616781735]
We propose the AdaptGOT model, which integrates theAdaptive representation learning technique and the Geographical-Co-Occurrence-Text representation.<n>The AdaptGOT model comprises three key components: (1) contextual neighborhood generation, which integrates advanced mixed sampling techniques such as KNN, density-based, importance-based, and category-aware strategies to capture complex contextual neighborhoods; (2) an advanced GOT representation enhanced by an attention mechanism, designed to derive high-quality, customized representations and efficiently capture complex interrelations between POIs; and (3) the MoE-based adaptive encoder-decoder architecture, which ensures topological consistency and enriches contextual representation by
arXiv Detail & Related papers (2025-06-21T08:06:06Z) - LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization [58.65395773049273]
Location Preference Optimization (LPO) is a novel approach that leverages locational data to optimize interaction preferences.<n>LPO uses information entropy to predict interaction positions by focusing on zones rich in information.<n>Our code will be made publicly available soon, at https://github.com/AIDC-AI/LPO.
arXiv Detail & Related papers (2025-06-11T03:43:30Z) - Geography-Aware Large Language Models for Next POI Recommendation [21.03555605703108]
Next Point-of-Interest (POI) recommendation task aims to predict users' next destinations based on their historical movement data.<n>We propose GA-LLM (Geography-Aware Large Language Model), a novel framework that enhances Large Language Models with two specialized components.<n>Experiments on three real-world datasets demonstrate the state-of-the-art performance of GA-LLM.
arXiv Detail & Related papers (2025-05-18T03:20:20Z) - HMCGeo: IP Region Prediction Based on Hierarchical Multi-label Classification [9.993613732452122]
Fine-grained IP geolocation plays a critical role in applications such as location-based services and cybersecurity.
This paper proposes a novel hierarchical multi-label classification framework for IP region prediction, named HMCGeo.
We show that HMCGeo achieves superior performance across all geographical granularities, significantly outperforming existing IP geolocation methods.
arXiv Detail & Related papers (2025-01-26T08:58:14Z) - Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation [50.31351006532924]
Human pose estimation (HPE) has received increasing attention recently due to its wide application in motion analysis, virtual reality, healthcare, etc.
It suffers from the lack of labeled diverse real-world datasets due to the time- and labor-intensive annotation.
We introduce a novel framework that capitalizes on both representation aggregation and segregation for domain adaptive human pose estimation.
arXiv Detail & Related papers (2024-12-29T17:59:45Z) - MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context Understanding [12.572321050617571]
Estimating the Most Important Person (MIP) in any social event setup is a challenging problem due to contextual complexity and scarcity of labeled data.
We aim to address the problem by annotating a large-scale in-the-wild' dataset for identifying human perceptions about MIP in an image.
The proposed dataset will play a vital role in building the next-generation social situation understanding methods.
arXiv Detail & Related papers (2024-09-10T05:28:38Z) - Personalized Collaborative Fine-Tuning for On-Device Large Language Models [33.68104398807581]
We explore on-device self-supervised collaborative fine-tuning of large language models with limited local data availability.
We introduce three distinct trust-weighted gradient aggregation schemes: weight similarity-based, prediction similarity-based and validation performance-based.
Our protocols, driven by prediction and performance metrics, surpass both FedAvg and local fine-tuning methods.
arXiv Detail & Related papers (2024-04-15T12:54:31Z) - Global Point Cloud Registration Network for Large Transformations [46.7301374772952]
We present ReLaTo, an architecture that faces the cases where large transformations happen while maintaining good performance for local transformations.
This paper uses a novel Softmax pooling layer to find correspondences in a bilateral consensus manner between two point sets, sampling the most confident matches.
A target-guided denoising step is then applied to both the obtained matches and latent features, estimating the final fine registration.
arXiv Detail & Related papers (2024-03-26T18:52:48Z) - Towards Effective Next POI Prediction: Spatial and Semantic Augmentation with Remote Sensing Data [10.968721742000653]
We propose an effective deep-learning method within a two-step prediction framework.
Our method first incorporates remote sensing data, capturing pivotal environmental context.
We construct the QR-P graph for the user's historical trajectories to encapsulate historical travel knowledge.
arXiv Detail & Related papers (2024-03-22T04:22:36Z) - Multi-modal Representation Learning for Social Post Location Inference [7.911777986696313]
In this work, we propose a novel Multi-modal Representation Learning Framework (MRLF) capable of fusing different modalities of social posts for location inference.
To overcome the noisy user-generated textual content, we introduce a novel attention-based character-aware module.
The experimental results show that MRLF can make accurate location predictions and open a new door to understanding the multi-modal data of social posts for online inference tasks.
arXiv Detail & Related papers (2023-06-11T02:35:48Z) - MGeo: Multi-Modal Geographic Pre-Training Method [49.78466122982627]
We propose a novel query-POI matching method Multi-modal Geographic language model (MGeo)
MGeo represents GC as a new modality and is able to fully extract multi-modal correlations for accurate query-POI matching.
Our proposed multi-modal pre-training method can significantly improve the query-POI matching capability of generic PTMs.
arXiv Detail & Related papers (2023-01-11T03:05:12Z) - LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context
Propagation in Transformers [60.51925353387151]
We propose a novel module named Local Context Propagation (LCP) to exploit the message passing between neighboring local regions.
We use the overlap points of adjacent local regions as intermediaries, then re-weight the features of these shared points from different local regions before passing them to the next layers.
The proposed method is applicable to different tasks and outperforms various transformer-based methods in benchmarks including 3D shape classification and dense prediction tasks.
arXiv Detail & Related papers (2022-10-23T15:43:01Z) - Self-supervised Graph-based Point-of-interest Recommendation [66.58064122520747]
Next Point-of-Interest (POI) recommendation has become a prominent component in location-based e-commerce.
We propose a Self-supervised Graph-enhanced POI Recommender (S2GRec) for next POI recommendation.
In particular, we devise a novel Graph-enhanced Self-attentive layer to incorporate the collaborative signals from both global transition graph and local trajectory graphs.
arXiv Detail & Related papers (2022-10-22T17:29:34Z) - Hierarchical Local-Global Transformer for Temporal Sentence Grounding [58.247592985849124]
This paper studies the multimedia problem of temporal sentence grounding.
It aims to accurately determine the specific video segment in an untrimmed video according to a given sentence query.
arXiv Detail & Related papers (2022-08-31T14:16:56Z) - Cross-Domain Facial Expression Recognition: A Unified Evaluation
Benchmark and Adversarial Graph Learning [85.6386289476598]
We develop a novel adversarial graph representation adaptation (AGRA) framework for cross-domain holistic-local feature co-adaptation.
We conduct extensive and fair evaluations on several popular benchmarks and show that the proposed AGRA framework outperforms previous state-of-the-art methods.
arXiv Detail & Related papers (2020-08-03T15:00:31Z) - A Unified Theory of Decentralized SGD with Changing Topology and Local
Updates [70.9701218475002]
We introduce a unified convergence analysis of decentralized communication methods.
We derive universal convergence rates for several applications.
Our proofs rely on weak assumptions.
arXiv Detail & Related papers (2020-03-23T17:49:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.