Exploiting Latent Codes: Interactive Fashion Product Generation, Similar
  Image Retrieval, and Cross-Category Recommendation using Variational
  Autoencoders
        - URL: http://arxiv.org/abs/2009.01053v1
- Date: Wed, 2 Sep 2020 13:27:30 GMT
- Title: Exploiting Latent Codes: Interactive Fashion Product Generation, Similar
  Image Retrieval, and Cross-Category Recommendation using Variational
  Autoencoders
- Authors: James-Andrew Sarmiento
- Abstract summary: Author proposes using Variational Autoencoder (VAE) to build an interactive fashion product application framework.
This pipeline is applicable in the booming industry of e-commerce enabling direct user interaction in specifying desired products.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   The rise of deep learning applications in the fashion industry has fueled
advances in curating large-scale datasets to build applications for product
design, image retrieval, and recommender systems. In this paper, the author
proposes using Variational Autoencoder (VAE) to build an interactive fashion
product application framework that allows the users to generate products with
attributes according to their liking, retrieve similar styles for the same
product category, and receive content-based recommendations from other
categories. Fashion product images dataset containing eyewear, footwear, and
bags are appropriate to illustrate that this pipeline is applicable in the
booming industry of e-commerce enabling direct user interaction in specifying
desired products paired with new methods for data matching, and recommendation
systems by using VAE and exploiting its generated latent codes.
 
      
        Related papers
        - Visual Product Graph: Bridging Visual Products And Composite Images For   End-to-End Style Recommendations [1.130790932059036]
 Visual Product Graph (VPG) is an online real-time retrieval system that enables navigation from individual products to composite scenes containing those products, along with complementary recommendations.<n>Our system achieves a 78.8% extremely similar@1 in end-to-end human relevance evaluations, and a 6% module engagement rate.<n>The "Ways to Style It" module, powered by the Visual Product Graph technology, is deployed in production at Pinterest.
 arXiv  Detail & Related papers  (2025-05-27T17:26:55Z)
- Universal Item Tokenization for Transferable Generative Recommendation [89.42584009980676]
 We propose UTGRec, a universal item tokenization approach for transferable Generative Recommendation.
By devising tree-structured codebooks, we discretize content representations into corresponding codes for item tokenization.
For raw content reconstruction, we employ dual lightweight decoders to reconstruct item text and images from discrete representations.
For collaborative knowledge integration, we assume that co-occurring items are similar and integrate collaborative signals through co-occurrence alignment and reconstruction.
 arXiv  Detail & Related papers  (2025-04-06T08:07:49Z)
- Multi-Modality Transformer for E-Commerce: Inferring User Purchase   Intention to Bridge the Query-Product Gap [1.2356255208135267]
 PINCER transforms initial user queries into pseudo-product representations.
We demonstrate our model's superior performance over state-of-the-art alternatives on e-commerce online retrieval.
 arXiv  Detail & Related papers  (2025-01-21T23:47:39Z)
- Multi-modal Generative Models in Recommendation System [34.45328907249946]
 Many recommendation systems limit user inputs to text strings or behavior signals such as clicks and purchases.
With the advent of generative AI, users have come to expect richer levels of interactions.
We argue that future recommendation systems will benefit from a multi-modal understanding of the products.
 arXiv  Detail & Related papers  (2024-09-17T08:55:50Z)
- MMGRec: Multimodal Generative Recommendation with Transformer Model [81.61896141495144]
 MMGRec aims to introduce a generative paradigm into multimodal recommendation.
We first devise a hierarchical quantization method Graph CF-RQVAE to assign Rec-ID for each item from its multimodal information.
We then train a Transformer-based recommender to generate the Rec-IDs of user-preferred items based on historical interaction sequences.
 arXiv  Detail & Related papers  (2024-04-25T12:11:27Z)
- A Review of Modern Recommender Systems Using Generative Models   (Gen-RecSys) [57.30228361181045]
 This survey connects key advancements in recommender systems using Generative Models (Gen-RecSys)
It covers: interaction-driven generative models; the use of large language models (LLM) and textual data for natural language recommendation; and the integration of multimodal models for generating and processing images/videos in RS.
Our work highlights necessary paradigms for evaluating the impact and harm of Gen-RecSys and identifies open challenges.
 arXiv  Detail & Related papers  (2024-03-31T06:57:57Z)
- Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential
  Recommendations [50.03560306423678]
 We propose Ada-Retrieval, an adaptive multi-round retrieval paradigm for recommender systems.
Ada-Retrieval iteratively refines user representations to better capture potential candidates in the full item space.
 arXiv  Detail & Related papers  (2024-01-12T15:26:40Z)
- Unified Vision-Language Representation Modeling for E-Commerce
  Same-Style Products Retrieval [12.588713044749177]
 Same-style products retrieval plays an important role in e-commerce platforms.
We propose a unified vision-language modeling method for e-commerce same-style products retrieval.
It is capable of cross-modal product-to-product retrieval, as well as style transfer and user-interactive search.
 arXiv  Detail & Related papers  (2023-02-10T07:24:23Z)
- Towards High-Order Complementary Recommendation via Logical Reasoning
  Network [19.232457960085625]
 We propose a logical reasoning network, LOGIREC, to learn embeddings of products.
 LOGIREC is capable of capturing the asymmetric complementary relationship between products.
We also propose a hybrid network that is jointly optimized for learning a more generic product representation.
 arXiv  Detail & Related papers  (2022-12-09T16:27:03Z)
- ItemSage: Learning Product Embeddings for Shopping Recommendations at
  Pinterest [60.841761065439414]
 At Pinterest, we build a single set of product embeddings called ItemSage to provide relevant recommendations in all shopping use cases.
This approach has led to significant improvements in engagement and conversion metrics, while reducing both infrastructure and maintenance cost.
 arXiv  Detail & Related papers  (2022-05-24T02:28:58Z)
- Knowledge-Enhanced Hierarchical Graph Transformer Network for
  Multi-Behavior Recommendation [56.12499090935242]
 This work proposes a Knowledge-Enhanced Hierarchical Graph Transformer Network (KHGT) to investigate multi-typed interactive patterns between users and items in recommender systems.
 KHGT is built upon a graph-structured neural architecture to capture type-specific behavior characteristics.
We show that KHGT consistently outperforms many state-of-the-art recommendation methods across various evaluation settings.
 arXiv  Detail & Related papers  (2021-10-08T09:44:00Z)
- A Retail Product Categorisation Dataset [2.538209532048867]
 identification of similar products is a common sub-task.
Our goal is to boost the evaluation of machine learning methods for the prediction of the category of the retail products.
 arXiv  Detail & Related papers  (2021-03-25T14:23:48Z)
- Exploration-Exploitation Motivated Variational Auto-Encoder for
  Recommender Systems [1.52292571922932]
 We introduce an exploitation-exploration motivated variational auto-encoder (XploVAE) to collaborative filtering.
To facilitate personalized recommendations, we construct user-specific subgraphs, which contain the first-order proximity capturing observed user-item interactions.
A hierarchical latent space model is utilized to learn the personalized item embedding for a given user, along with the population distribution of all user subgraphs.
 arXiv  Detail & Related papers  (2020-06-05T17:37:46Z)
- Learning Diverse Fashion Collocation by Neural Graph Filtering [78.9188246136867]
 We propose a novel fashion collocation framework, Neural Graph Filtering, that models a flexible set of fashion items via a graph neural network.
By applying symmetric operations on the edge vectors, this framework allows varying numbers of inputs/outputs and is invariant to their ordering.
We evaluate the proposed approach on three popular benchmarks, the Polyvore dataset, the Polyvore-D dataset, and our reorganized Amazon Fashion dataset.
 arXiv  Detail & Related papers  (2020-03-11T16:17:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.