Exploiting Latent Codes: Interactive Fashion Product Generation, Similar
Image Retrieval, and Cross-Category Recommendation using Variational
Autoencoders
- URL: http://arxiv.org/abs/2009.01053v1
- Date: Wed, 2 Sep 2020 13:27:30 GMT
- Title: Exploiting Latent Codes: Interactive Fashion Product Generation, Similar
Image Retrieval, and Cross-Category Recommendation using Variational
Autoencoders
- Authors: James-Andrew Sarmiento
- Abstract summary: Author proposes using Variational Autoencoder (VAE) to build an interactive fashion product application framework.
This pipeline is applicable in the booming industry of e-commerce enabling direct user interaction in specifying desired products.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The rise of deep learning applications in the fashion industry has fueled
advances in curating large-scale datasets to build applications for product
design, image retrieval, and recommender systems. In this paper, the author
proposes using Variational Autoencoder (VAE) to build an interactive fashion
product application framework that allows the users to generate products with
attributes according to their liking, retrieve similar styles for the same
product category, and receive content-based recommendations from other
categories. Fashion product images dataset containing eyewear, footwear, and
bags are appropriate to illustrate that this pipeline is applicable in the
booming industry of e-commerce enabling direct user interaction in specifying
desired products paired with new methods for data matching, and recommendation
systems by using VAE and exploiting its generated latent codes.
Related papers
- Multi-modal Generative Models in Recommendation System [34.45328907249946]
Many recommendation systems limit user inputs to text strings or behavior signals such as clicks and purchases.
With the advent of generative AI, users have come to expect richer levels of interactions.
We argue that future recommendation systems will benefit from a multi-modal understanding of the products.
arXiv Detail & Related papers (2024-09-17T08:55:50Z) - MMGRec: Multimodal Generative Recommendation with Transformer Model [81.61896141495144]
MMGRec aims to introduce a generative paradigm into multimodal recommendation.
We first devise a hierarchical quantization method Graph CF-RQVAE to assign Rec-ID for each item from its multimodal information.
We then train a Transformer-based recommender to generate the Rec-IDs of user-preferred items based on historical interaction sequences.
arXiv Detail & Related papers (2024-04-25T12:11:27Z) - A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys) [57.30228361181045]
This survey connects key advancements in recommender systems using Generative Models (Gen-RecSys)
It covers: interaction-driven generative models; the use of large language models (LLM) and textual data for natural language recommendation; and the integration of multimodal models for generating and processing images/videos in RS.
Our work highlights necessary paradigms for evaluating the impact and harm of Gen-RecSys and identifies open challenges.
arXiv Detail & Related papers (2024-03-31T06:57:57Z) - Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential
Recommendations [50.03560306423678]
We propose Ada-Retrieval, an adaptive multi-round retrieval paradigm for recommender systems.
Ada-Retrieval iteratively refines user representations to better capture potential candidates in the full item space.
arXiv Detail & Related papers (2024-01-12T15:26:40Z) - Unified Vision-Language Representation Modeling for E-Commerce
Same-Style Products Retrieval [12.588713044749177]
Same-style products retrieval plays an important role in e-commerce platforms.
We propose a unified vision-language modeling method for e-commerce same-style products retrieval.
It is capable of cross-modal product-to-product retrieval, as well as style transfer and user-interactive search.
arXiv Detail & Related papers (2023-02-10T07:24:23Z) - Towards High-Order Complementary Recommendation via Logical Reasoning
Network [19.232457960085625]
We propose a logical reasoning network, LOGIREC, to learn embeddings of products.
LOGIREC is capable of capturing the asymmetric complementary relationship between products.
We also propose a hybrid network that is jointly optimized for learning a more generic product representation.
arXiv Detail & Related papers (2022-12-09T16:27:03Z) - ItemSage: Learning Product Embeddings for Shopping Recommendations at
Pinterest [60.841761065439414]
At Pinterest, we build a single set of product embeddings called ItemSage to provide relevant recommendations in all shopping use cases.
This approach has led to significant improvements in engagement and conversion metrics, while reducing both infrastructure and maintenance cost.
arXiv Detail & Related papers (2022-05-24T02:28:58Z) - Knowledge-Enhanced Hierarchical Graph Transformer Network for
Multi-Behavior Recommendation [56.12499090935242]
This work proposes a Knowledge-Enhanced Hierarchical Graph Transformer Network (KHGT) to investigate multi-typed interactive patterns between users and items in recommender systems.
KHGT is built upon a graph-structured neural architecture to capture type-specific behavior characteristics.
We show that KHGT consistently outperforms many state-of-the-art recommendation methods across various evaluation settings.
arXiv Detail & Related papers (2021-10-08T09:44:00Z) - A Retail Product Categorisation Dataset [2.538209532048867]
identification of similar products is a common sub-task.
Our goal is to boost the evaluation of machine learning methods for the prediction of the category of the retail products.
arXiv Detail & Related papers (2021-03-25T14:23:48Z) - Exploration-Exploitation Motivated Variational Auto-Encoder for
Recommender Systems [1.52292571922932]
We introduce an exploitation-exploration motivated variational auto-encoder (XploVAE) to collaborative filtering.
To facilitate personalized recommendations, we construct user-specific subgraphs, which contain the first-order proximity capturing observed user-item interactions.
A hierarchical latent space model is utilized to learn the personalized item embedding for a given user, along with the population distribution of all user subgraphs.
arXiv Detail & Related papers (2020-06-05T17:37:46Z) - Learning Diverse Fashion Collocation by Neural Graph Filtering [78.9188246136867]
We propose a novel fashion collocation framework, Neural Graph Filtering, that models a flexible set of fashion items via a graph neural network.
By applying symmetric operations on the edge vectors, this framework allows varying numbers of inputs/outputs and is invariant to their ordering.
We evaluate the proposed approach on three popular benchmarks, the Polyvore dataset, the Polyvore-D dataset, and our reorganized Amazon Fashion dataset.
arXiv Detail & Related papers (2020-03-11T16:17:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.