AI Recommendation System for Enhanced Customer Experience: A Novel
Image-to-Text Method
- URL: http://arxiv.org/abs/2311.09624v1
- Date: Thu, 16 Nov 2023 07:15:44 GMT
- Title: AI Recommendation System for Enhanced Customer Experience: A Novel
Image-to-Text Method
- Authors: Mohamaed Foued Ayedi, Hiba Ben Salem, Soulaimen Hammami, Ahmed Ben
Said, Rateb Jabbar, Achraf CHabbouh
- Abstract summary: This research describes an innovative end-to-end pipeline that uses artificial intelligence to provide fine-grained visual interpretation for fashion recommendations.
When customers upload images of desired products or outfits, the system automatically generates meaningful descriptions emphasizing stylistic elements.
- Score: 2.2975420753582028
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Existing fashion recommendation systems encounter difficulties in using
visual data for accurate and personalized recommendations. This research
describes an innovative end-to-end pipeline that uses artificial intelligence
to provide fine-grained visual interpretation for fashion recommendations. When
customers upload images of desired products or outfits, the system
automatically generates meaningful descriptions emphasizing stylistic elements.
These captions guide retrieval from a global fashion product catalogue to offer
similar alternatives that fit the visual characteristics of the original image.
On a dataset of over 100,000 categorized fashion photos, the pipeline was
trained and evaluated. The F1-score for the object detection model was 0.97,
exhibiting exact fashion object recognition capabilities optimized for
recommendation. This visually aware system represents a key advancement in
customer engagement through personalized fashion recommendations
Related papers
- Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder [57.574544285878794]
Ada-Adapter is a novel framework for few-shot style personalization of diffusion models.
Our method enables efficient zero-shot style transfer utilizing a single reference image.
We demonstrate the effectiveness of our approach on various artistic styles, including flat art, 3D rendering, and logo design.
arXiv Detail & Related papers (2024-07-08T02:00:17Z) - VILA: Learning Image Aesthetics from User Comments with Vision-Language
Pretraining [53.470662123170555]
We propose learning image aesthetics from user comments, and exploring vision-language pretraining methods to learn multimodal aesthetic representations.
Specifically, we pretrain an image-text encoder-decoder model with image-comment pairs, using contrastive and generative objectives to learn rich and generic aesthetic semantics without human labels.
Our results show that our pretrained aesthetic vision-language model outperforms prior works on image aesthetic captioning over the AVA-Captions dataset.
arXiv Detail & Related papers (2023-03-24T23:57:28Z) - HIVE: Harnessing Human Feedback for Instructional Visual Editing [127.29436858998064]
We present a novel framework to harness human feedback for instructional visual editing (HIVE)
Specifically, we collect human feedback on the edited images and learn a reward function to capture the underlying user preferences.
We then introduce scalable diffusion model fine-tuning methods that can incorporate human preferences based on the estimated reward.
arXiv Detail & Related papers (2023-03-16T19:47:41Z) - End-to-End Image-Based Fashion Recommendation [5.210197476419621]
In fashion-based recommendation settings, incorporating the item image features is considered a crucial factor.
We propose a simple yet effective attribute-aware model that incorporates image features for better item representation learning.
Experiments on two image-based real-world recommender systems datasets show that the proposed model significantly outperforms all state-of-the-art image-based models.
arXiv Detail & Related papers (2022-05-05T21:14:42Z) - Single-Item Fashion Recommender: Towards Cross-Domain Recommendations [0.0]
This article first suggests a content-based fashion recommender system that uses a parallel neural network to take a single fashion item shop image as input.
Next, the same structure is enhanced to personalize the results based on user preferences.
The last contribution of this paper is a new evaluation metric for recommendation tasks called objective-guided human score.
arXiv Detail & Related papers (2021-11-01T08:15:31Z) - Attribute-aware Explainable Complementary Clothing Recommendation [37.30129304097086]
This work aims to tackle the explainability challenge in fashion recommendation tasks by proposing a novel Attribute-aware Fashion Recommender (AFRec)
AFRec recommender assesses the outfit compatibility by explicitly leveraging the extracted attribute-level representations from each item's visual feature.
The attributes serve as the bridge between two fashion items, where we quantify the affinity of a pair of items through the learned compatibility between their attributes.
arXiv Detail & Related papers (2021-07-04T14:56:07Z) - User-Guided Personalized Image Aesthetic Assessment based on Deep
Reinforcement Learning [64.07820203919283]
We propose a novel user-guided personalized image aesthetic assessment framework.
It leverages user interactions to retouch and rank images for aesthetic assessment based on deep reinforcement learning (DRL)
It generates personalized aesthetic distribution that is more in line with the aesthetic preferences of different users.
arXiv Detail & Related papers (2021-06-14T15:19:48Z) - Addressing the Cold-Start Problem in Outfit Recommendation Using Visual
Preference Modelling [51.147871738838305]
This paper attempts to address the cold-start problem for new users by leveraging a novel visual preference modelling approach.
We demonstrate the use of our approach with feature-weighted clustering to personalise occasion-oriented outfit recommendation.
arXiv Detail & Related papers (2020-08-04T10:07:09Z) - Learning Diverse Fashion Collocation by Neural Graph Filtering [78.9188246136867]
We propose a novel fashion collocation framework, Neural Graph Filtering, that models a flexible set of fashion items via a graph neural network.
By applying symmetric operations on the edge vectors, this framework allows varying numbers of inputs/outputs and is invariant to their ordering.
We evaluate the proposed approach on three popular benchmarks, the Polyvore dataset, the Polyvore-D dataset, and our reorganized Amazon Fashion dataset.
arXiv Detail & Related papers (2020-03-11T16:17:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.