Products-10K: A Large-scale Product Recognition Dataset
- URL: http://arxiv.org/abs/2008.10545v1
- Date: Mon, 24 Aug 2020 16:33:37 GMT
- Title: Products-10K: A Large-scale Product Recognition Dataset
- Authors: Yalong Bai, Yuxiang Chen, Wei Yu, Linfang Wang, and Wei Zhang
- Abstract summary: In this paper, we construct a human-labeled product image dataset named "Products-10K"
The dataset contains 10,000 fine-grained SKU-level products frequently bought by online customers in JD.com.
Based on our new database, we also introduced several useful tips and tricks for fine-grained product recognition.
- Score: 18.506656670737407
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the rapid development of electronic commerce, the way of shopping has
experienced a revolutionary evolution. To fully meet customers' massive and
diverse online shopping needs with quick response, the retailing AI system
needs to automatically recognize products from images and videos at the
stock-keeping unit (SKU) level with high accuracy. However, product recognition
is still a challenging task, since many of SKU-level products are fine-grained
and visually similar by a rough glimpse. Although there are already some
products benchmarks available, these datasets are either too small (limited
number of products) or noisy-labeled (lack of human labeling). In this paper,
we construct a human-labeled product image dataset named "Products-10K", which
contains 10,000 fine-grained SKU-level products frequently bought by online
customers in JD.com. Based on our new database, we also introduced several
useful tips and tricks for fine-grained product recognition. The products-10K
dataset is available via https://products-10k.github.io/.
Related papers
- Multimodal Prompt Learning for Product Title Generation with Extremely
Limited Labels [66.54691023795097]
We propose a prompt-based approach, i.e., the Multimodal Prompt Learning framework, to generate titles for novel products with limited labels.
We build a set of multimodal prompts from different modalities to preserve the corresponding characteristics and writing styles of novel products.
With the full labelled data for training, our method achieves state-of-the-art results.
arXiv Detail & Related papers (2023-07-05T00:40:40Z) - Product Information Extraction using ChatGPT [69.12244027050454]
This paper explores the potential of ChatGPT for extracting attribute/value pairs from product descriptions.
Our results show that ChatGPT achieves a performance similar to a pre-trained language model but requires much smaller amounts of training data and computation for fine-tuning.
arXiv Detail & Related papers (2023-06-23T09:30:01Z) - Automated Extraction of Fine-Grained Standardized Product Information
from Unstructured Multilingual Web Data [66.21317300595483]
We show how recent advances in machine learning, combined with a recently published multilingual data set, enable robust product attribute extraction.
Our models can reliably predict product attributes across online shops, languages, or both.
arXiv Detail & Related papers (2023-02-23T16:26:11Z) - ItemSage: Learning Product Embeddings for Shopping Recommendations at
Pinterest [60.841761065439414]
At Pinterest, we build a single set of product embeddings called ItemSage to provide relevant recommendations in all shopping use cases.
This approach has led to significant improvements in engagement and conversion metrics, while reducing both infrastructure and maintenance cost.
arXiv Detail & Related papers (2022-05-24T02:28:58Z) - Unitail: Detecting, Reading, and Matching in Retail Scene [37.1516435926562]
We introduce the United Retail dataset, a benchmark of basic visual tasks on products.
With 1.8M quadrilateral-shaped instances, the Unitail offers a detection dataset to align product appearance better.
It also provides a gallery-style OCR dataset containing 1454 product categories, 30k text regions, and 21k transcriptions.
arXiv Detail & Related papers (2022-04-01T09:06:48Z) - eProduct: A Million-Scale Visual Search Benchmark to Address Product
Recognition Challenges [8.204924070199866]
eProduct is a benchmark dataset for training and evaluation on various visual search solutions in a real-world setting.
We present eProduct as a training set and an evaluation set, where the training set contains 1.3M+ listing images with titles and hierarchical category labels, for model development.
We will present eProduct's construction steps, provide analysis about its diversity and cover the performance of baseline models trained on it.
arXiv Detail & Related papers (2021-07-13T05:28:34Z) - Deep Learning-based Online Alternative Product Recommendations at Scale [0.2278231643598956]
We use both textual product information (e.g. product titles and descriptions) and customer behavior data to recommend alternative products.
Our results show that the coverage of alternative products is significantly improved in offline evaluations as well as recall and precision.
arXiv Detail & Related papers (2021-04-15T16:27:45Z) - AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of
Types [127.99385433323859]
We describe AutoKnow, our automatic (self-driving) system that addresses organizing information about products.
The system includes a suite of novel techniques for taxonomy construction, product property identification, knowledge extraction, anomaly detection, and synonym discovery.
AutoKnow has been operational in collecting product knowledge for over 11K product types.
arXiv Detail & Related papers (2020-06-24T04:35:17Z) - RP2K: A Large-Scale Retail Product Dataset for Fine-Grained Image
Classification [19.82453283089643]
RP2K is a new large-scale retail product dataset for fine-grained image classification.
Unlike previous datasets, we collect more than 500,000 images of retail products on shelves belonging to 2000 different products.
arXiv Detail & Related papers (2020-06-22T21:39:56Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.