Makeup216: Logo Recognition with Adversarial Attention Representations
- URL: http://arxiv.org/abs/2112.06533v1
- Date: Mon, 13 Dec 2021 10:08:56 GMT
- Title: Makeup216: Logo Recognition with Adversarial Attention Representations
- Authors: Junjun Hu, Yanhao Zhu, Bo Zhao, Jiexin Zheng, Chenxu Zhao, Xiangyu
Zhu, Kangle Wu, Darun Tang
- Abstract summary: Makeup216 is the largest and most complex logo dataset in the field of makeup, captured from the real world.
It comprises of 216 logos and 157 brands, including 10,019 images and 37,018 annotated logo objects.
Our proposed framework achieved competitive results on Makeup216 and another large-scale open logo dataset.
- Score: 16.78131635640705
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: One of the challenges of logo recognition lies in the diversity of forms,
such as symbols, texts or a combination of both; further, logos tend to be
extremely concise in design while similar in appearance, suggesting the
difficulty of learning discriminative representations. To investigate the
variety and representation of logo, we introduced Makeup216, the largest and
most complex logo dataset in the field of makeup, captured from the real world.
It comprises of 216 logos and 157 brands, including 10,019 images and 37,018
annotated logo objects. In addition, we found that the marginal background
around the pure logo can provide a important context information and proposed
an adversarial attention representation framework (AAR) to attend on the logo
subject and auxiliary marginal background separately, which can be combined for
better representation. Our proposed framework achieved competitive results on
Makeup216 and another large-scale open logo dataset, which could provide fresh
thinking for logo recognition. The dataset of Makeup216 and the code of the
proposed framework will be released soon.
Related papers
- LogoSticker: Inserting Logos into Diffusion Models for Customized Generation [73.59571559978278]
We introduce the task of logo insertion into text-to-image models.
Our goal is to insert logo identities into diffusion models and enable their seamless synthesis in varied contexts.
We present a novel two-phase pipeline LogoSticker to tackle this task.
arXiv Detail & Related papers (2024-07-18T17:54:49Z) - SLANT: Spurious Logo ANalysis Toolkit [61.59021920232986]
We develop SLANT: A Spurious Logo ANalysis Toolkit.
It contains a semi-automatic mechanism for mining such "spurious" logos.
We uncover various seemingly harmless logos that VL models correlate with negative human adjectives.
An attacker could place a spurious logo on harmful content, causing the model to misclassify it as harmless.
arXiv Detail & Related papers (2024-06-03T15:41:31Z) - FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings [26.395196542803543]
We propose an approach to prompt MLLMs to generate appropriate text for product images, which can help visual models achieve better logo embeddings.
Our experiments on real-world datasets prove that FashionLOGO is capable of generating generic and robust logo embeddings.
arXiv Detail & Related papers (2023-08-17T14:30:26Z) - LogoNet: a fine-grained network for instance-level logo sketch retrieval [7.858317554338922]
We construct an instance-level logo sketch dataset containing 2k logo instances and exceeding 9k sketches.
Next, we develop a fine-grained triple-branch CNN architecture based on hybrid attention mechanism termed LogoNet for accurate logo sketch retrieval.
arXiv Detail & Related papers (2023-04-05T04:03:02Z) - Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred
Thousand-Scale One-Shot Logo Identification [2.243832625209014]
We study the problem of identifying logos of business brands in natural scenes in an open-set one-shot setting.
We propose a novel multi-view textual-visual encoding framework that encodes text appearing in the logos.
We evaluate our proposed framework for cropped logo verification, cropped logo identification, and end-to-end logo identification in natural scene tasks.
arXiv Detail & Related papers (2022-11-23T12:59:41Z) - Deep Learning for Logo Detection: A Survey [59.278443852492465]
This paper reviews the advance in applying deep learning techniques to logo detection.
We perform an in-depth analysis of the existing logo detection strategies and the strengths and weaknesses of each learning strategy.
We summarize the applications of logo detection in various fields, from intelligent transportation and brand monitoring to copyright and trademark compliance.
arXiv Detail & Related papers (2022-10-10T02:07:41Z) - Discriminative Semantic Feature Pyramid Network with Guided Anchoring
for Logo Detection [52.36825190893928]
We propose a novel approach, named Discriminative Semantic Feature Pyramid Network with Guided Anchoring (DSFP-GA)
Our approach mainly consists of Discriminative Semantic Feature Pyramid (DSFP) and Guided Anchoring (GA)
arXiv Detail & Related papers (2021-08-31T11:59:00Z) - FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via
Multi-Scale Feature Decoupling Network [55.49022825759331]
A large-scale food logo dataset is urgently needed for developing advanced food logo detection algorithms.
FoodLogoDet-1500 is a new large-scale publicly available food logo dataset with 1,500 categories, about 100,000 images and about 150,000 manually annotated food logo objects.
We propose a novel food logo detection method Multi-scale Feature Decoupling Network (MFDNet) to solve the problem of distinguishing multiple food logo categories.
arXiv Detail & Related papers (2021-08-10T12:47:04Z) - LogoDet-3K: A Large-Scale Image Dataset for Logo Detection [61.296935298332606]
We introduce LogoDet-3K, the largest logo detection dataset with full annotation.
It has 3,000 logo categories, about 200,000 manually annotated logo objects and 158,652 images.
We propose a strong baseline method Logo-Yolo, which incorporates Focal loss and CIoU loss into the state-of-the-art YOLOv3 framework for large-scale logo detection.
arXiv Detail & Related papers (2020-08-12T14:57:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.