UniTraj: Learning a Universal Trajectory Foundation Model from Billion-Scale Worldwide Traces
- URL: http://arxiv.org/abs/2411.03859v2
- Date: Sat, 16 Nov 2024 06:53:43 GMT
- Title: UniTraj: Learning a Universal Trajectory Foundation Model from Billion-Scale Worldwide Traces
- Authors: Yuanshao Zhu, James Jianqiao Yu, Xiangyu Zhao, Xuetao Wei, Yuxuan Liang,
- Abstract summary: UniTraj is a task-adaptive, region-independent, and highly generalizable human trajectory foundation model.
WorldTrace is the first large-scale, high-quality, globally distributed dataset sourced from open web platforms.
UniTraj consistently outperforms existing approaches in terms of scalability and adaptability.
- Score: 33.519954227942016
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Human trajectory modeling is essential for deciphering movement patterns and supporting advanced applications across various domains. However, existing methods are often tailored to specific tasks and regions, resulting in limitations related to task specificity, regional dependency, and data quality sensitivity. Addressing these challenges requires a universal human trajectory foundation model capable of generalizing and scaling across diverse tasks and geographic contexts. To this end, we propose UniTraj, a Universal human Trajectory foundation model that is task-adaptive, region-independent, and highly generalizable. To further enhance performance, we construct WorldTrace, the first large-scale, high-quality, globally distributed dataset sourced from open web platforms, encompassing 2.45 million trajectories with billions of points across 70 countries. Through multiple resampling and masking strategies designed for pre-training, UniTraj effectively overcomes geographic and task constraints, adapting to heterogeneous data quality. Extensive experiments across multiple trajectory analysis tasks and real-world datasets demonstrate that UniTraj consistently outperforms existing approaches in terms of scalability and adaptability. These results underscore the potential of UniTraj as a versatile, robust solution for a wide range of trajectory analysis applications, with WorldTrace serving as an ideal but non-exclusive foundation for training.
Related papers
- Trajectory World Models for Heterogeneous Environments [67.27233466954814]
Heterogeneity in sensors and actuators across environments poses a significant challenge to building large-scale pre-trained world models.
We introduce UniTraj, a unified dataset comprising over one million trajectories from 80 environments, designed to scale data while preserving critical diversity.
We propose TrajWorld, a novel architecture capable of flexibly handling varying sensor and actuator information and capturing environment dynamics in-context.
arXiv Detail & Related papers (2025-02-03T13:59:08Z) - One Fits All: General Mobility Trajectory Modeling via Masked Conditional Diffusion [11.373845190033297]
Trajectory data play a crucial role in many applications, ranging from network optimization to urban planning.
Existing studies on trajectory data are task-specific, and their applicability is limited to the specific tasks on which they have been trained, such as generation, recovery, or prediction.
We propose a general trajectory modeling framework via conditional diffusion (named GenMove)
Our model significantly outperforms state-of-the-art baselines, with the highest performance exceeding 13% improvement in generation tasks.
arXiv Detail & Related papers (2025-01-23T03:13:45Z) - TrajLearn: Trajectory Prediction Learning using Deep Generative Models [4.097342535693401]
Trajectory prediction aims to estimate an entity's future path using its current position and historical movement data.
To address these challenges, we introduce TrajLearn, a novel model for trajectory prediction.
TrajLearn predicts the next $k$ steps by integrating a customized beam search for exploring multiple potential paths.
arXiv Detail & Related papers (2024-12-30T23:38:52Z) - A Wave is Worth 100 Words: Investigating Cross-Domain Transferability in Time Series [13.555837288440946]
This paper proposes a novel cross-domain pretraining method based on Wave Quantization (termed as WQ4TS)
We transfer the time series data from different domains into a common spectral latent space, and enable the model to learn the temporal pattern knowledge of different domains directly from the common space.
The proposed WQ4TS achieves the best performance on 87.5% of all tasks, and the average improvement of the metrics on all the tasks is up to 34.7%.
arXiv Detail & Related papers (2024-12-01T11:35:06Z) - Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection [64.08296187555095]
Uni$2$Det is a framework for unified and universal multi-dataset training on 3D detection.
We introduce multi-stage prompting modules for multi-dataset 3D detection.
Results on zero-shot cross-dataset transfer validate the generalization capability of our proposed method.
arXiv Detail & Related papers (2024-09-30T17:57:50Z) - Transferable Unsupervised Outlier Detection Framework for Human Semantic Trajectories [9.816270572121724]
We propose Transferable Outlier Detection for Human Semantic Trajectories (TOD4Traj) framework.
ToD4Traj first introduces a modality feature unification module to align diverse data feature representations.
A contrastive learning module is further pro-posed for identifying regular mobility patterns both temporally and across populations.
arXiv Detail & Related papers (2024-09-28T22:31:00Z) - ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model [39.0442700565278]
ControlTraj is a Controllable Trajectory generation framework with the topology-constrained diffusion model.
We develop a novel road segment autoencoder to extract fine-grained road segment embedding.
The encoded features, along with trip attributes, are subsequently merged into the proposed geographic denoising UNet architecture.
arXiv Detail & Related papers (2024-04-23T09:42:45Z) - SilverSight: A Multi-Task Chinese Financial Large Language Model Based on Adaptive Semantic Space Learning [4.540505713937026]
This study introduces an Adaptive Semantic Space Learning (ASSL) framework to enhance the performance and selection efficacy of multi-expert models.
Our research findings demonstrate that our framework can achieve results close to those obtained with full data training using only 10% of the data, while also exhibiting strong generalization capabilities.
arXiv Detail & Related papers (2024-04-07T13:02:21Z) - Federated Learning with Projected Trajectory Regularization [65.6266768678291]
Federated learning enables joint training of machine learning models from distributed clients without sharing their local data.
One key challenge in federated learning is to handle non-identically distributed data across the clients.
We propose a novel federated learning framework with projected trajectory regularization (FedPTR) for tackling the data issue.
arXiv Detail & Related papers (2023-12-22T02:12:08Z) - UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human
Generation [59.77275587857252]
A holistic human dataset inevitably has insufficient and low-resolution information on local parts.
We propose to use multi-source datasets with various resolution images to jointly learn a high-resolution human generative model.
arXiv Detail & Related papers (2023-09-25T17:58:46Z) - A General Purpose Neural Architecture for Geospatial Systems [142.43454584836812]
We present a roadmap towards the construction of a general-purpose neural architecture (GPNA) with a geospatial inductive bias.
We envision how such a model may facilitate cooperation between members of the community.
arXiv Detail & Related papers (2022-11-04T09:58:57Z) - Multi-Level Branched Regularization for Federated Learning [46.771459325434535]
We propose a novel architectural regularization technique that constructs multiple auxiliary branches in each local model by grafting local and globalworks at several different levels.
We demonstrate remarkable performance gains in terms of accuracy and efficiency compared to existing methods.
arXiv Detail & Related papers (2022-07-14T13:59:26Z) - DRFLM: Distributionally Robust Federated Learning with Inter-client
Noise via Local Mixup [58.894901088797376]
federated learning has emerged as a promising approach for training a global model using data from multiple organizations without leaking their raw data.
We propose a general framework to solve the above two challenges simultaneously.
We provide comprehensive theoretical analysis including robustness analysis, convergence analysis, and generalization ability.
arXiv Detail & Related papers (2022-04-16T08:08:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.