Multi-view user representation learning for user matching without
personal information
- URL: http://arxiv.org/abs/2312.14533v1
- Date: Fri, 22 Dec 2023 08:58:42 GMT
- Title: Multi-view user representation learning for user matching without
personal information
- Authors: Hongliu Cao, Ilias El Baamrani, Eoin Thomas
- Abstract summary: We propose a similarity based multi-view information fusion to learn a better user representation from URLs.
The experimental results show that the proposed multi-view user representation learning can take advantage of the complementary information from different views.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: As the digitization of travel industry accelerates, analyzing and
understanding travelers' behaviors becomes increasingly important. However,
traveler data frequently exhibit high data sparsity due to the relatively low
frequency of user interactions with travel providers. Compounding this effect
the multiplication of devices, accounts and platforms while browsing travel
products online also leads to data dispersion. To deal with these challenges,
probabilistic traveler matching can be used. Most existing solutions for user
matching are not suitable for traveler matching as a traveler's browsing
history is typically short and URLs in the travel industry are very
heterogeneous with many tokens. To deal with these challenges, we propose the
similarity based multi-view information fusion to learn a better user
representation from URLs by treating the URLs as multi-view data. The
experimental results show that the proposed multi-view user representation
learning can take advantage of the complementary information from different
views, highlight the key information in URLs and perform significantly better
than other representation learning solutions for the user matching task.
Related papers
- TRACE: Transformer-based user Representations from Attributed Clickstream Event sequences [37.69303106863453]
We introduce TRACE, a novel approach to generate rich user embeddings from live multi-session clickstreams for real-time recommendation applications.
We demonstrate TRACE's superior performance over vanilla transformer and LLM-style architectures through extensive experiments on a large-scale travel e-commerce dataset.
arXiv Detail & Related papers (2024-09-02T23:33:19Z) - Retrieval Augmentation via User Interest Clustering [57.63883506013693]
Industrial recommender systems are sensitive to the patterns of user-item engagement.
We propose a novel approach that efficiently constructs user interest and facilitates low computational cost inference.
Our approach has been deployed in multiple products at Meta, facilitating short-form video related recommendation.
arXiv Detail & Related papers (2024-08-07T16:35:10Z) - Regularized Contrastive Partial Multi-view Outlier Detection [76.77036536484114]
We propose a novel method named Regularized Contrastive Partial Multi-view Outlier Detection (RCPMOD)
In this framework, we utilize contrastive learning to learn view-consistent information and distinguish outliers by the degree of consistency.
Experimental results on four benchmark datasets demonstrate that our proposed approach could outperform state-of-the-art competitors.
arXiv Detail & Related papers (2024-08-02T14:34:27Z) - Knowledge-Aware Multi-Intent Contrastive Learning for Multi-Behavior Recommendation [6.522900133742931]
Multi-behavioral recommendation provides users with more accurate choices based on diverse behaviors, such as view, add to cart, and purchase.
We propose a novel model: Knowledge-Aware Multi-Intent Contrastive Learning (KAMCL) model.
This model uses relationships in the knowledge graph to construct intents, aiming to mine the connections between users' multi-behaviors from the perspective of intents to achieve more accurate recommendations.
arXiv Detail & Related papers (2024-04-18T08:39:52Z) - Accelerating exploration and representation learning with offline
pre-training [52.6912479800592]
We show that exploration and representation learning can be improved by separately learning two different models from a single offline dataset.
We show that learning a state representation using noise-contrastive estimation and a model of auxiliary reward can significantly improve the sample efficiency on the challenging NetHack benchmark.
arXiv Detail & Related papers (2023-03-31T18:03:30Z) - Cross-view Graph Contrastive Representation Learning on Partially
Aligned Multi-view Data [52.491074276133325]
Multi-view representation learning has developed rapidly over the past decades and has been applied in many fields.
We propose a new cross-view graph contrastive learning framework, which integrates multi-view information to align data and learn latent representations.
Experiments conducted on several real datasets demonstrate the effectiveness of the proposed method on the clustering and classification tasks.
arXiv Detail & Related papers (2022-11-08T09:19:32Z) - Perceptual Score: What Data Modalities Does Your Model Perceive? [73.75255606437808]
We introduce the perceptual score, a metric that assesses the degree to which a model relies on the different subsets of the input features.
We find that recent, more accurate multi-modal models for visual question-answering tend to perceive the visual data less than their predecessors.
Using the perceptual score also helps to analyze model biases by decomposing the score into data subset contributions.
arXiv Detail & Related papers (2021-10-27T12:19:56Z) - What's Your Value of Travel Time? Collecting Traveler-Centered Mobility
Data via Crowdsourcing [4.297843164736973]
We build upon a different paradigm of worthwhile time in which travelers can use their travel time for other activities.
We present a new dataset, which contains data about travelers and their journeys, collected from a dedicated mobile application.
arXiv Detail & Related papers (2021-04-12T20:48:28Z) - Destination similarity based on implicit user interest [0.0]
A new similarity method is proposed to measure the destination similarity in terms of implicit user interest.
By comparing the proposed method to several other widely used similarity measures in recommender systems, the proposed method achieves a significant improvement on travel data.
arXiv Detail & Related papers (2021-02-12T18:45:23Z) - Virtual ID Discovery from E-commerce Media at Alibaba: Exploiting
Richness of User Click Behavior for Visual Search Relevance [40.98749837102654]
We propose to discover Virtual ID from user click behavior to improve visual search relevance at Alibaba.
As a totally click-data driven approach, we collect various types of click data for training deep networks without any human annotations.
Our networks are more effective to encode richer supervision and better distinguish real-shot images in terms of category and feature.
arXiv Detail & Related papers (2021-02-09T06:31:20Z) - Laplacian Denoising Autoencoder [114.21219514831343]
We propose to learn data representations with a novel type of denoising autoencoder.
The noisy input data is generated by corrupting latent clean data in the gradient domain.
Experiments on several visual benchmarks demonstrate that better representations can be learned with the proposed approach.
arXiv Detail & Related papers (2020-03-30T16:52:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.