POIFormer: A Transformer-Based Framework for Accurate and Scalable Point-of-Interest Attribution
- URL: http://arxiv.org/abs/2507.09137v1
- Date: Sat, 12 Jul 2025 04:37:52 GMT
- Title: POIFormer: A Transformer-Based Framework for Accurate and Scalable Point-of-Interest Attribution
- Authors: Nripsuta Ani Saxena, Shang-Ling Hsu, Mehul Shetty, Omar Alkhadra, Cyrus Shahabi, Abigail L. Horn,
- Abstract summary: textsfPOIFormer is a novel Transformer-based framework for accurate and efficient POI attribution.<n>textsfPOIFormer enables accurate, efficient attribution in large, noisy mobility datasets.
- Score: 3.729614737011418
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurately attributing user visits to specific Points of Interest (POIs) is a foundational task for mobility analytics, personalized services, marketing and urban planning. However, POI attribution remains challenging due to GPS inaccuracies, typically ranging from 2 to 20 meters in real-world settings, and the high spatial density of POIs in urban environments, where multiple venues can coexist within a small radius (e.g., over 50 POIs within a 100-meter radius in dense city centers). Relying on proximity is therefore often insufficient for determining which POI was actually visited. We introduce \textsf{POIFormer}, a novel Transformer-based framework for accurate and efficient POI attribution. Unlike prior approaches that rely on limited spatiotemporal, contextual, or behavioral features, \textsf{POIFormer} jointly models a rich set of signals, including spatial proximity, visit timing and duration, contextual features from POI semantics, and behavioral features from user mobility and aggregated crowd behavior patterns--using the Transformer's self-attention mechanism to jointly model complex interactions across these dimensions. By leveraging the Transformer to model a user's past and future visits (with the current visit masked) and incorporating crowd-level behavioral patterns through pre-computed KDEs, \textsf{POIFormer} enables accurate, efficient attribution in large, noisy mobility datasets. Its architecture supports generalization across diverse data sources and geographic contexts while avoiding reliance on hard-to-access or unavailable data layers, making it practical for real-world deployment. Extensive experiments on real-world mobility datasets demonstrate significant improvements over existing baselines, particularly in challenging real-world settings characterized by spatial noise and dense POI clustering.
Related papers
- LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization [58.65395773049273]
Location Preference Optimization (LPO) is a novel approach that leverages locational data to optimize interaction preferences.<n>LPO uses information entropy to predict interaction positions by focusing on zones rich in information.<n>Our code will be made publicly available soon, at https://github.com/AIDC-AI/LPO.
arXiv Detail & Related papers (2025-06-11T03:43:30Z) - MobCLIP: Learning General-purpose Geospatial Representation at Scale [0.7224497621488285]
We present MobCLIP, the first nationwide general-purpose location encoder.<n>Adopting a novel CLIP-based architecture, our framework aligns 100M+ POIs, nationwide remote sensing imagery, and structured demographic statistics with a billion-edge mobility graph.<n>We show MobCLIP achieves significantly superior general-purpose predictive performances than state-of-the-art models by an average of 35%.
arXiv Detail & Related papers (2025-06-02T04:14:03Z) - Mixture-of-Experts for Personalized and Semantic-Aware Next Location Prediction [20.726107072683575]
NextLocMoE is a novel framework built upon large language models (LLMs) and structured around a dual-level Mixture-of-Experts (MoE) design.<n>Our architecture comprises two specialized modules: a Location Semantics MoE that operates at the embedding level to encode rich functional semantics of locations, and a Personalized MoE embedded within the Transformer backbone to dynamically adapt to individual user mobility patterns.
arXiv Detail & Related papers (2025-05-30T13:45:19Z) - Geography-Aware Large Language Models for Next POI Recommendation [21.03555605703108]
Next Point-of-Interest (POI) recommendation task aims to predict users' next destinations based on their historical movement data.<n>We propose GA-LLM (Geography-Aware Large Language Model), a novel framework that enhances Large Language Models with two specialized components.<n>Experiments on three real-world datasets demonstrate the state-of-the-art performance of GA-LLM.
arXiv Detail & Related papers (2025-05-18T03:20:20Z) - PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments [73.80718037070773]
We present the multi-modal Pedestrian-Focused Scene dataset, rigorously annotated in semi-structured scenes with the format of nuScenes.<n>We also propose a novel Hybrid Multi-Scale Fusion Network (HMFN) to detect pedestrians in densely populated and occluded scenarios.
arXiv Detail & Related papers (2025-02-21T09:57:53Z) - Rethinking Urban Mobility Prediction: A Super-Multivariate Time Series
Forecasting Approach [71.67506068703314]
Long-term urban mobility predictions play a crucial role in the effective management of urban facilities and services.
Traditionally, urban mobility data has been structured as videos, treating longitude and latitude as fundamental pixels.
In our research, we introduce a fresh perspective on urban mobility prediction.
Instead of oversimplifying urban mobility data as traditional video data, we regard it as a complex time series.
arXiv Detail & Related papers (2023-12-04T07:39:05Z) - Revisiting Mobility Modeling with Graph: A Graph Transformer Model for
Next Point-of-Interest Recommendation [9.863982037173443]
Next Point-of-Interest (POI) recommendation plays a crucial role in urban mobility applications.
We propose textbfunderlineMobility textbfunderlineGraph textbfunderlineTransformer (MobGT)
MobGT enables us to fully leverage graphs to capture both the spatial and temporal features in users' mobility patterns.
arXiv Detail & Related papers (2023-10-02T14:11:16Z) - Self-supervised Graph-based Point-of-interest Recommendation [66.58064122520747]
Next Point-of-Interest (POI) recommendation has become a prominent component in location-based e-commerce.
We propose a Self-supervised Graph-enhanced POI Recommender (S2GRec) for next POI recommendation.
In particular, we devise a novel Graph-enhanced Self-attentive layer to incorporate the collaborative signals from both global transition graph and local trajectory graphs.
arXiv Detail & Related papers (2022-10-22T17:29:34Z) - Conditioned Human Trajectory Prediction using Iterative Attention Blocks [70.36888514074022]
We present a simple yet effective pedestrian trajectory prediction model aimed at pedestrians positions prediction in urban-like environments.
Our model is a neural-based architecture that can run several layers of attention blocks and transformers in an iterative sequential fashion.
We show that without explicit introduction of social masks, dynamical models, social pooling layers, or complicated graph-like structures, it is possible to produce on par results with SoTA models.
arXiv Detail & Related papers (2022-06-29T07:49:48Z) - Masked Transformer for Neighhourhood-aware Click-Through Rate Prediction [74.52904110197004]
We propose Neighbor-Interaction based CTR prediction, which put this task into a Heterogeneous Information Network (HIN) setting.
In order to enhance the representation of the local neighbourhood, we consider four types of topological interaction among the nodes.
We conduct comprehensive experiments on two real world datasets and the experimental results show that our proposed method outperforms state-of-the-art CTR models significantly.
arXiv Detail & Related papers (2022-01-25T12:44:23Z) - Modelling of Bi-directional Spatio-Temporal Dependence and Users'
Dynamic Preferences for Missing POI Check-in Identification [38.51964956686177]
We develop a model, named Bi-STDDP, which can integrate bi-directional-temporal dependence and users' dynamic preferences.
Results demonstrate significant improvements of our model compared with state-of-the-art methods.
arXiv Detail & Related papers (2021-12-31T03:54:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.