Related papers: A Transformer-based Framework for POI-level Social Post Geolocation

Related papers

POIFormer: A Transformer-Based Framework for Accurate and Scalable Point-of-Interest Attribution [3.729614737011418]
textsfPOIFormer is a novel Transformer-based framework for accurate and efficient POI attribution.<n>textsfPOIFormer enables accurate, efficient attribution in large, noisy mobility datasets.
arXiv Detail & Related papers (2025-07-12T04:37:52Z)
AdaptGOT: A Pre-trained Model for Adaptive Contextual POI Representation Learning [7.277204616781735]
We propose the AdaptGOT model, which integrates theAdaptive representation learning technique and the Geographical-Co-Occurrence-Text representation.<n>The AdaptGOT model comprises three key components: (1) contextual neighborhood generation, which integrates advanced mixed sampling techniques such as KNN, density-based, importance-based, and category-aware strategies to capture complex contextual neighborhoods; (2) an advanced GOT representation enhanced by an attention mechanism, designed to derive high-quality, customized representations and efficiently capture complex interrelations between POIs; and (3) the MoE-based adaptive encoder-decoder architecture, which ensures topological consistency and enriches contextual representation by
arXiv Detail & Related papers (2025-06-21T08:06:06Z)
LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization [58.65395773049273]
Location Preference Optimization (LPO) is a novel approach that leverages locational data to optimize interaction preferences.<n>LPO uses information entropy to predict interaction positions by focusing on zones rich in information.<n>Our code will be made publicly available soon, at https://github.com/AIDC-AI/LPO.
arXiv Detail & Related papers (2025-06-11T03:43:30Z)
Geography-Aware Large Language Models for Next POI Recommendation [21.03555605703108]
Next Point-of-Interest (POI) recommendation task aims to predict users' next destinations based on their historical movement data.<n>We propose GA-LLM (Geography-Aware Large Language Model), a novel framework that enhances Large Language Models with two specialized components.<n>Experiments on three real-world datasets demonstrate the state-of-the-art performance of GA-LLM.
arXiv Detail & Related papers (2025-05-18T03:20:20Z)
HMCGeo: IP Region Prediction Based on Hierarchical Multi-label Classification [9.993613732452122]
Fine-grained IP geolocation plays a critical role in applications such as location-based services and cybersecurity. This paper proposes a novel hierarchical multi-label classification framework for IP region prediction, named HMCGeo. We show that HMCGeo achieves superior performance across all geographical granularities, significantly outperforming existing IP geolocation methods.
arXiv Detail & Related papers (2025-01-26T08:58:14Z)
Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation [50.31351006532924]
Human pose estimation (HPE) has received increasing attention recently due to its wide application in motion analysis, virtual reality, healthcare, etc. It suffers from the lack of labeled diverse real-world datasets due to the time- and labor-intensive annotation. We introduce a novel framework that capitalizes on both representation aggregation and segregation for domain adaptive human pose estimation.
arXiv Detail & Related papers (2024-12-29T17:59:45Z)
MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context Understanding [12.572321050617571]
Estimating the Most Important Person (MIP) in any social event setup is a challenging problem due to contextual complexity and scarcity of labeled data. We aim to address the problem by annotating a large-scale in-the-wild' dataset for identifying human perceptions about MIP in an image. The proposed dataset will play a vital role in building the next-generation social situation understanding methods.
arXiv Detail & Related papers (2024-09-10T05:28:38Z)
Personalized Collaborative Fine-Tuning for On-Device Large Language Models [33.68104398807581]
We explore on-device self-supervised collaborative fine-tuning of large language models with limited local data availability. We introduce three distinct trust-weighted gradient aggregation schemes: weight similarity-based, prediction similarity-based and validation performance-based. Our protocols, driven by prediction and performance metrics, surpass both FedAvg and local fine-tuning methods.
arXiv Detail & Related papers (2024-04-15T12:54:31Z)
Global Point Cloud Registration Network for Large Transformations [46.7301374772952]
We present ReLaTo, an architecture that faces the cases where large transformations happen while maintaining good performance for local transformations. This paper uses a novel Softmax pooling layer to find correspondences in a bilateral consensus manner between two point sets, sampling the most confident matches. A target-guided denoising step is then applied to both the obtained matches and latent features, estimating the final fine registration.
arXiv Detail & Related papers (2024-03-26T18:52:48Z)
Towards Effective Next POI Prediction: Spatial and Semantic Augmentation with Remote Sensing Data [10.968721742000653]
We propose an effective deep-learning method within a two-step prediction framework. Our method first incorporates remote sensing data, capturing pivotal environmental context. We construct the QR-P graph for the user's historical trajectories to encapsulate historical travel knowledge.
arXiv Detail & Related papers (2024-03-22T04:22:36Z)
Multi-modal Representation Learning for Social Post Location Inference [7.911777986696313]
In this work, we propose a novel Multi-modal Representation Learning Framework (MRLF) capable of fusing different modalities of social posts for location inference. To overcome the noisy user-generated textual content, we introduce a novel attention-based character-aware module. The experimental results show that MRLF can make accurate location predictions and open a new door to understanding the multi-modal data of social posts for online inference tasks.
arXiv Detail & Related papers (2023-06-11T02:35:48Z)
MGeo: Multi-Modal Geographic Pre-Training Method [49.78466122982627]
We propose a novel query-POI matching method Multi-modal Geographic language model (MGeo) MGeo represents GC as a new modality and is able to fully extract multi-modal correlations for accurate query-POI matching. Our proposed multi-modal pre-training method can significantly improve the query-POI matching capability of generic PTMs.
arXiv Detail & Related papers (2023-01-11T03:05:12Z)
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers [60.51925353387151]
We propose a novel module named Local Context Propagation (LCP) to exploit the message passing between neighboring local regions. We use the overlap points of adjacent local regions as intermediaries, then re-weight the features of these shared points from different local regions before passing them to the next layers. The proposed method is applicable to different tasks and outperforms various transformer-based methods in benchmarks including 3D shape classification and dense prediction tasks.
arXiv Detail & Related papers (2022-10-23T15:43:01Z)
Self-supervised Graph-based Point-of-interest Recommendation [66.58064122520747]
Next Point-of-Interest (POI) recommendation has become a prominent component in location-based e-commerce. We propose a Self-supervised Graph-enhanced POI Recommender (S2GRec) for next POI recommendation. In particular, we devise a novel Graph-enhanced Self-attentive layer to incorporate the collaborative signals from both global transition graph and local trajectory graphs.
arXiv Detail & Related papers (2022-10-22T17:29:34Z)
Hierarchical Local-Global Transformer for Temporal Sentence Grounding [58.247592985849124]
This paper studies the multimedia problem of temporal sentence grounding. It aims to accurately determine the specific video segment in an untrimmed video according to a given sentence query.
arXiv Detail & Related papers (2022-08-31T14:16:56Z)
Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchmark and Adversarial Graph Learning [85.6386289476598]
We develop a novel adversarial graph representation adaptation (AGRA) framework for cross-domain holistic-local feature co-adaptation. We conduct extensive and fair evaluations on several popular benchmarks and show that the proposed AGRA framework outperforms previous state-of-the-art methods.
arXiv Detail & Related papers (2020-08-03T15:00:31Z)
A Unified Theory of Decentralized SGD with Changing Topology and Local Updates [70.9701218475002]
We introduce a unified convergence analysis of decentralized communication methods. We derive universal convergence rates for several applications. Our proofs rely on weak assumptions.
arXiv Detail & Related papers (2020-03-23T17:49:15Z)

This list is automatically generated from the titles and abstracts of the papers in this site.