LogoDet-3K: A Large-Scale Image Dataset for Logo Detection
- URL: http://arxiv.org/abs/2008.05359v1
- Date: Wed, 12 Aug 2020 14:57:53 GMT
- Title: LogoDet-3K: A Large-Scale Image Dataset for Logo Detection
- Authors: Jing Wang, Weiqing Min, Sujuan Hou, Shengnan Ma, Yuanjie Zheng,
Shuqiang Jiang
- Abstract summary: We introduce LogoDet-3K, the largest logo detection dataset with full annotation.
It has 3,000 logo categories, about 200,000 manually annotated logo objects and 158,652 images.
We propose a strong baseline method Logo-Yolo, which incorporates Focal loss and CIoU loss into the state-of-the-art YOLOv3 framework for large-scale logo detection.
- Score: 61.296935298332606
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Logo detection has been gaining considerable attention because of its wide
range of applications in the multimedia field, such as copyright infringement
detection, brand visibility monitoring, and product brand management on social
media. In this paper, we introduce LogoDet-3K, the largest logo detection
dataset with full annotation, which has 3,000 logo categories, about 200,000
manually annotated logo objects and 158,652 images. LogoDet-3K creates a more
challenging benchmark for logo detection, for its higher comprehensive coverage
and wider variety in both logo categories and annotated objects compared with
existing datasets. We describe the collection and annotation process of our
dataset, analyze its scale and diversity in comparison to other datasets for
logo detection. We further propose a strong baseline method Logo-Yolo, which
incorporates Focal loss and CIoU loss into the state-of-the-art YOLOv3
framework for large-scale logo detection. Logo-Yolo can solve the problems of
multi-scale objects, logo sample imbalance and inconsistent bounding-box
regression. It obtains about 4% improvement on the average performance compared
with YOLOv3, and greater improvements compared with reported several deep
detection models on LogoDet-3K. The evaluations on other three existing
datasets further verify the effectiveness of our method, and demonstrate better
generalization ability of LogoDet-3K on logo detection and retrieval tasks. The
LogoDet-3K dataset is used to promote large-scale logo-related research and it
can be found at https://github.com/Wangjing1551/LogoDet-3K-Dataset.
Related papers
- SLANT: Spurious Logo ANalysis Toolkit [61.59021920232986]
We develop SLANT: A Spurious Logo ANalysis Toolkit.
It contains a semi-automatic mechanism for mining such "spurious" logos.
We uncover various seemingly harmless logos that VL models correlate with negative human adjectives.
An attacker could place a spurious logo on harmful content, causing the model to misclassify it as harmless.
arXiv Detail & Related papers (2024-06-03T15:41:31Z) - Image-Text Pre-Training for Logo Recognition [0.27195102129094995]
We propose two novel contributions to improve the matching model's performance.
A standard paradigm of fine-tuning ImageNet pre-trained models fails to discover the text sensitivity necessary to solve the matching problem effectively.
We show that the same vision backbone pre-trained on image-text data, when fine-tuned on OpenLogoDet3K47, achieves $98.6%$ recall@1.
arXiv Detail & Related papers (2023-09-18T23:18:02Z) - Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred
Thousand-Scale One-Shot Logo Identification [2.243832625209014]
We study the problem of identifying logos of business brands in natural scenes in an open-set one-shot setting.
We propose a novel multi-view textual-visual encoding framework that encodes text appearing in the logos.
We evaluate our proposed framework for cropped logo verification, cropped logo identification, and end-to-end logo identification in natural scene tasks.
arXiv Detail & Related papers (2022-11-23T12:59:41Z) - Deep Learning for Logo Detection: A Survey [59.278443852492465]
This paper reviews the advance in applying deep learning techniques to logo detection.
We perform an in-depth analysis of the existing logo detection strategies and the strengths and weaknesses of each learning strategy.
We summarize the applications of logo detection in various fields, from intelligent transportation and brand monitoring to copyright and trademark compliance.
arXiv Detail & Related papers (2022-10-10T02:07:41Z) - Discriminative Semantic Feature Pyramid Network with Guided Anchoring
for Logo Detection [52.36825190893928]
We propose a novel approach, named Discriminative Semantic Feature Pyramid Network with Guided Anchoring (DSFP-GA)
Our approach mainly consists of Discriminative Semantic Feature Pyramid (DSFP) and Guided Anchoring (GA)
arXiv Detail & Related papers (2021-08-31T11:59:00Z) - FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via
Multi-Scale Feature Decoupling Network [55.49022825759331]
A large-scale food logo dataset is urgently needed for developing advanced food logo detection algorithms.
FoodLogoDet-1500 is a new large-scale publicly available food logo dataset with 1,500 categories, about 100,000 images and about 150,000 manually annotated food logo objects.
We propose a novel food logo detection method Multi-scale Feature Decoupling Network (MFDNet) to solve the problem of distinguishing multiple food logo categories.
arXiv Detail & Related papers (2021-08-10T12:47:04Z) - An Effective and Robust Detector for Logo Detection [58.448716977297565]
Some attackers fool the well-trained logo detection model for infringement.
A novel logo detector based on the mechanism of looking and thinking twice is proposed in this paper.
We extend detectoRS algorithm to a cascade schema with an equalization loss function, multi-scale transformations, and adversarial data augmentation.
arXiv Detail & Related papers (2021-08-01T10:17:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.