Text Classification for Predicting Multi-level Product Categories
- URL: http://arxiv.org/abs/2109.01084v1
- Date: Thu, 2 Sep 2021 17:00:05 GMT
- Title: Text Classification for Predicting Multi-level Product Categories
- Authors: Hadi Jahanshahi, Ozan Ozyegen, Mucahit Cevik, Beste Bulut, Deniz
Yigit, Fahrettin F. Gonen, Ay\c{s}e Ba\c{s}ar
- Abstract summary: In an online shopping platform, a detailed classification of the products facilitates user navigation.
In this study, we focus on product title classification of the grocery products.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In an online shopping platform, a detailed classification of the products
facilitates user navigation. It also helps online retailers keep track of the
price fluctuations in a certain industry or special discounts on a specific
product category. Moreover, an automated classification system may help to
pinpoint incorrect or subjective categories suggested by an operator. In this
study, we focus on product title classification of the grocery products. We
perform a comprehensive comparison of six different text classification models
to establish a strong baseline for this task, which involves testing both
traditional and recent machine learning methods. In our experiments, we
investigate the generalizability of the trained models to the products of other
online retailers, the dynamic masking of infeasible subcategories for
pretrained language models, and the benefits of incorporating product titles in
multiple languages. Our numerical results indicate that dynamic masking of
subcategories is effective in improving prediction accuracy. In addition, we
observe that using bilingual product titles is generally beneficial, and neural
network-based models perform significantly better than SVM and XGBoost models.
Lastly, we investigate the reasons for the misclassified products and propose
future research directions to further enhance the prediction models.
Related papers
- Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models [50.370043676415875]
In smart retail applications, the large number of products and their frequent turnover necessitate reliable zero-shot object classification methods.
We introduce the MIMEX dataset, comprising 28 distinct product categories.
We benchmark the zero-shot object classification performance of state-of-the-art vision-language models (VLMs) on the proposed MIMEX dataset.
arXiv Detail & Related papers (2024-09-23T12:28:40Z) - Generative Multi-modal Models are Good Class-Incremental Learners [51.5648732517187]
We propose a novel generative multi-modal model (GMM) framework for class-incremental learning.
Our approach directly generates labels for images using an adapted generative model.
Under the Few-shot CIL setting, we have improved by at least 14% accuracy over all the current state-of-the-art methods with significantly less forgetting.
arXiv Detail & Related papers (2024-03-27T09:21:07Z) - Multi-level Product Category Prediction through Text Classification [0.0]
This article investigates applying advanced machine learning models, specifically LSTM and BERT, for text classification to predict multiple categories in the retail sector.
The study demonstrates how applying data augmentation techniques and the focal loss function can significantly enhance accuracy in classifying products into multiple categories using a robust Brazilian retail dataset.
arXiv Detail & Related papers (2024-03-03T23:10:36Z) - Multi-output Headed Ensembles for Product Item Classification [0.9053163124987533]
We propose a deep learning based classification model framework for e-commerce catalogs.
We show improvements against robust industry standard baseline models.
We also propose a novel way to evaluate model performance using user sessions.
arXiv Detail & Related papers (2023-07-29T01:23:36Z) - Consistent Text Categorization using Data Augmentation in e-Commerce [1.558017967663767]
We propose a new framework for consistent text categorization.
Our goal is to improve the model's consistency while maintaining its production-level performance.
arXiv Detail & Related papers (2023-05-09T12:47:28Z) - Automated Extraction of Fine-Grained Standardized Product Information
from Unstructured Multilingual Web Data [66.21317300595483]
We show how recent advances in machine learning, combined with a recently published multilingual data set, enable robust product attribute extraction.
Our models can reliably predict product attributes across online shops, languages, or both.
arXiv Detail & Related papers (2023-02-23T16:26:11Z) - Automatically Discovering Novel Visual Categories with Self-supervised
Prototype Learning [68.63910949916209]
This paper tackles the problem of novel category discovery (NCD), which aims to discriminate unknown categories in large-scale image collections.
We propose a novel adaptive prototype learning method consisting of two main stages: prototypical representation learning and prototypical self-training.
We conduct extensive experiments on four benchmark datasets and demonstrate the effectiveness and robustness of the proposed method with state-of-the-art performance.
arXiv Detail & Related papers (2022-08-01T16:34:33Z) - Guiding Generative Language Models for Data Augmentation in Few-Shot
Text Classification [59.698811329287174]
We leverage GPT-2 for generating artificial training instances in order to improve classification performance.
Our results show that fine-tuning GPT-2 in a handful of label instances leads to consistent classification improvements.
arXiv Detail & Related papers (2021-11-17T12:10:03Z) - Learning and Evaluating Representations for Deep One-class
Classification [59.095144932794646]
We present a two-stage framework for deep one-class classification.
We first learn self-supervised representations from one-class data, and then build one-class classifiers on learned representations.
In experiments, we demonstrate state-of-the-art performance on visual domain one-class classification benchmarks.
arXiv Detail & Related papers (2020-11-04T23:33:41Z) - Counterfactual Inference for Consumer Choice Across Many Product
Categories [6.347014958509367]
We build on techniques from the machine learning literature on probabilistic models of matrix factorization.
We show that our model improves over traditional modeling approaches that consider each category in isolation.
Using held-out data, we show that our model can accurately distinguish which consumers are most price sensitive to a given product.
arXiv Detail & Related papers (2019-06-06T15:11:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.