Multi-output Headed Ensembles for Product Item Classification
- URL: http://arxiv.org/abs/2307.15858v1
- Date: Sat, 29 Jul 2023 01:23:36 GMT
- Title: Multi-output Headed Ensembles for Product Item Classification
- Authors: Hotaka Shiokawa and Pradipto Das and Arthur Toth and Justin Chiu
- Abstract summary: We propose a deep learning based classification model framework for e-commerce catalogs.
We show improvements against robust industry standard baseline models.
We also propose a novel way to evaluate model performance using user sessions.
- Score: 0.9053163124987533
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In this paper, we revisit the problem of product item classification for
large-scale e-commerce catalogs. The taxonomy of e-commerce catalogs consists
of thousands of genres to which are assigned items that are uploaded by
merchants on a continuous basis. The genre assignments by merchants are often
wrong but treated as ground truth labels in automatically generated training
sets, thus creating a feedback loop that leads to poorer model quality over
time. This problem of taxonomy classification becomes highly pronounced due to
the unavailability of sizable curated training sets.
Under such a scenario it is common to combine multiple classifiers to combat
poor generalization performance from a single classifier. We propose an
extensible deep learning based classification model framework that benefits
from the simplicity and robustness of averaging ensembles and fusion based
classifiers. We are also able to use metadata features and low-level feature
engineering to boost classification performance. We show these improvements
against robust industry standard baseline models that employ hyperparameter
optimization.
Additionally, due to continuous insertion, deletion and updates to real-world
high-volume e-commerce catalogs, assessing model performance for deployment
using A/B testing and/or manual annotation becomes a bottleneck. To this end,
we also propose a novel way to evaluate model performance using user sessions
that provides better insights in addition to traditional measures of precision
and recall.
Related papers
- Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models [50.370043676415875]
In smart retail applications, the large number of products and their frequent turnover necessitate reliable zero-shot object classification methods.
We introduce the MIMEX dataset, comprising 28 distinct product categories.
We benchmark the zero-shot object classification performance of state-of-the-art vision-language models (VLMs) on the proposed MIMEX dataset.
arXiv Detail & Related papers (2024-09-23T12:28:40Z) - Generative Multi-modal Models are Good Class-Incremental Learners [51.5648732517187]
We propose a novel generative multi-modal model (GMM) framework for class-incremental learning.
Our approach directly generates labels for images using an adapted generative model.
Under the Few-shot CIL setting, we have improved by at least 14% accuracy over all the current state-of-the-art methods with significantly less forgetting.
arXiv Detail & Related papers (2024-03-27T09:21:07Z) - From Categories to Classifiers: Name-Only Continual Learning by Exploring the Web [118.67589717634281]
Continual learning often relies on the availability of extensive annotated datasets, an assumption that is unrealistically time-consuming and costly in practice.
We explore a novel paradigm termed name-only continual learning where time and cost constraints prohibit manual annotation.
Our proposed solution leverages the expansive and ever-evolving internet to query and download uncurated webly-supervised data for image classification.
arXiv Detail & Related papers (2023-11-19T10:43:43Z) - Consistent Text Categorization using Data Augmentation in e-Commerce [1.558017967663767]
We propose a new framework for consistent text categorization.
Our goal is to improve the model's consistency while maintaining its production-level performance.
arXiv Detail & Related papers (2023-05-09T12:47:28Z) - Categorizing Items with Short and Noisy Descriptions using Ensembled
Transferred Embeddings [6.282068591820945]
Ensembled Transferred Embeddings (ETE) is a novel learning framework for item categorization.
We show that ETE outperforms state-of-the-art item categorization methods on a large-scale real-world dataset provided to us by PayPal.
arXiv Detail & Related papers (2021-10-21T18:57:40Z) - Text Classification for Predicting Multi-level Product Categories [0.0]
In an online shopping platform, a detailed classification of the products facilitates user navigation.
In this study, we focus on product title classification of the grocery products.
arXiv Detail & Related papers (2021-09-02T17:00:05Z) - Active Hybrid Classification [79.02441914023811]
This paper shows how crowd and machines can support each other in tackling classification problems.
We propose an architecture that orchestrates active learning and crowd classification and combines them in a virtuous cycle.
arXiv Detail & Related papers (2021-01-21T21:09:07Z) - One vs Previous and Similar Classes Learning -- A Comparative Study [2.208242292882514]
This work proposes three learning paradigms which allow trained models to be updated without the need of retraining from scratch.
Results show that the proposed paradigms are faster than the baseline at updating, with two of them being faster at training from scratch as well, especially on larger datasets.
arXiv Detail & Related papers (2021-01-05T00:28:38Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z) - Fine-Grained Visual Classification with Efficient End-to-end
Localization [49.9887676289364]
We present an efficient localization module that can be fused with a classification network in an end-to-end setup.
We evaluate the new model on the three benchmark datasets CUB200-2011, Stanford Cars and FGVC-Aircraft.
arXiv Detail & Related papers (2020-05-11T14:07:06Z) - Learning Robust Models for e-Commerce Product Search [23.537201383165755]
Showing items that do not match search query intent degrades customer experience in e-commerce.
Mitigating the problem requires a large labeled dataset.
We develop a deep, end-to-end model that learns to effectively classify mismatches.
arXiv Detail & Related papers (2020-05-07T17:22:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.