A Retail Product Categorisation Dataset
- URL: http://arxiv.org/abs/2103.13864v1
- Date: Thu, 25 Mar 2021 14:23:48 GMT
- Title: A Retail Product Categorisation Dataset
- Authors: Febin Sebastian Elayanithottathil and Janis Keuper
- Abstract summary: identification of similar products is a common sub-task.
Our goal is to boost the evaluation of machine learning methods for the prediction of the category of the retail products.
- Score: 2.538209532048867
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Most eCommerce applications, like web-shops have millions of products. In
this context, the identification of similar products is a common sub-task,
which can be utilized in the implementation of recommendation systems, product
search engines and internal supply logistics. Providing this data set, our goal
is to boost the evaluation of machine learning methods for the prediction of
the category of the retail products from tuples of images and descriptions.
Related papers
- Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models [50.370043676415875]
In smart retail applications, the large number of products and their frequent turnover necessitate reliable zero-shot object classification methods.
We introduce the MIMEX dataset, comprising 28 distinct product categories.
We benchmark the zero-shot object classification performance of state-of-the-art vision-language models (VLMs) on the proposed MIMEX dataset.
arXiv Detail & Related papers (2024-09-23T12:28:40Z) - PAE: LLM-based Product Attribute Extraction for E-Commerce Fashion Trends [0.6445605125467574]
This paper presents PAE, a product attribute extraction algorithm for future trend reports consisting text and images in PDF format.
Our contributions are three-fold: (a) We develop PAE, an efficient framework to extract attributes from unstructured data (text and images); (b) We provide catalog matching methodology based on BERT representations to discover the existing attributes using upcoming attribute values; (c) We conduct extensive experiments with several baselines and show that PAE is an effective, flexible and on par or superior (avg 92.5% F1-Score) framework to existing state-of-the-art for attribute value extraction
arXiv Detail & Related papers (2024-05-27T17:50:25Z) - Product Information Extraction using ChatGPT [69.12244027050454]
This paper explores the potential of ChatGPT for extracting attribute/value pairs from product descriptions.
Our results show that ChatGPT achieves a performance similar to a pre-trained language model but requires much smaller amounts of training data and computation for fine-tuning.
arXiv Detail & Related papers (2023-06-23T09:30:01Z) - Automated Extraction of Fine-Grained Standardized Product Information
from Unstructured Multilingual Web Data [66.21317300595483]
We show how recent advances in machine learning, combined with a recently published multilingual data set, enable robust product attribute extraction.
Our models can reliably predict product attributes across online shops, languages, or both.
arXiv Detail & Related papers (2023-02-23T16:26:11Z) - Identifying Substitute and Complementary Products for Assortment
Optimization with Cleora Embeddings [0.0]
The paper introduces a novel method for finding products' substitutes and complements based on the graph embedding Cleora algorithm.
It is concluded that the new approach presented here offers suitable choices of recommended products, requiring a minimal amount of additional information.
arXiv Detail & Related papers (2022-08-10T11:56:36Z) - ItemSage: Learning Product Embeddings for Shopping Recommendations at
Pinterest [60.841761065439414]
At Pinterest, we build a single set of product embeddings called ItemSage to provide relevant recommendations in all shopping use cases.
This approach has led to significant improvements in engagement and conversion metrics, while reducing both infrastructure and maintenance cost.
arXiv Detail & Related papers (2022-05-24T02:28:58Z) - Characterization of Frequent Online Shoppers using Statistical Learning
with Sparsity [54.26540039514418]
This work reports a method to learn the shopping preferences of frequent shoppers to an online gift store by combining ideas from retail analytics and statistical learning with sparsity.
arXiv Detail & Related papers (2021-11-11T05:36:39Z) - Text Classification for Predicting Multi-level Product Categories [0.0]
In an online shopping platform, a detailed classification of the products facilitates user navigation.
In this study, we focus on product title classification of the grocery products.
arXiv Detail & Related papers (2021-09-02T17:00:05Z) - An Overview of Recommender Systems and Machine Learning in Feature
Modeling and Configuration [55.67505546330206]
We give an overview of a potential new line of research which is related to the application of recommender systems and machine learning techniques.
In this paper, we give examples of the application of recommender systems and machine learning and discuss future research issues.
arXiv Detail & Related papers (2021-02-12T17:21:36Z) - Exploiting Knowledge Graphs for Facilitating Product/Service Discovery [1.2691047660244332]
This work presents a cost-effective solution for e-commerce on the Data Web by employing an unsupervised approach for data classification.
The proposed architecture describes available products in web language OWL and stores them in a triple store.
User input specifications for certain products are matched against the available product categories to generate a knowledge graph.
arXiv Detail & Related papers (2020-10-11T10:22:10Z) - Exploiting Latent Codes: Interactive Fashion Product Generation, Similar
Image Retrieval, and Cross-Category Recommendation using Variational
Autoencoders [0.0]
Author proposes using Variational Autoencoder (VAE) to build an interactive fashion product application framework.
This pipeline is applicable in the booming industry of e-commerce enabling direct user interaction in specifying desired products.
arXiv Detail & Related papers (2020-09-02T13:27:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.