LogoNet: a fine-grained network for instance-level logo sketch retrieval
- URL: http://arxiv.org/abs/2304.02214v1
- Date: Wed, 5 Apr 2023 04:03:02 GMT
- Title: LogoNet: a fine-grained network for instance-level logo sketch retrieval
- Authors: Binbin Feng, Jun Li, Jianhua Xu
- Abstract summary: We construct an instance-level logo sketch dataset containing 2k logo instances and exceeding 9k sketches.
Next, we develop a fine-grained triple-branch CNN architecture based on hybrid attention mechanism termed LogoNet for accurate logo sketch retrieval.
- Score: 7.858317554338922
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Sketch-based image retrieval, which aims to use sketches as queries to
retrieve images containing the same query instance, receives increasing
attention in recent years. Although dramatic progress has been made in sketch
retrieval, few efforts are devoted to logo sketch retrieval which is still
hindered by the following challenges: Firstly, logo sketch retrieval is more
difficult than typical sketch retrieval problem, since a logo sketch usually
contains much less visual contents with only irregular strokes and lines.
Secondly, instance-specific sketches demonstrate dramatic appearance variances,
making them less identifiable when querying the same logo instance. Thirdly,
there exist several sketch retrieval benchmarking datasets nowadays, whereas an
instance-level logo sketch dataset is still publicly unavailable. To address
the above-mentioned limitations, we make twofold contributions in this study
for instance-level logo sketch retrieval. To begin with, we construct an
instance-level logo sketch dataset containing 2k logo instances and exceeding
9k sketches. To our knowledge, this is the first publicly available
instance-level logo sketch dataset. Next, we develop a fine-grained
triple-branch CNN architecture based on hybrid attention mechanism termed
LogoNet for accurate logo sketch retrieval. More specifically, we embed the
hybrid attention mechanism into the triple-branch architecture for capturing
the key query-specific information from the limited visual cues in the logo
sketches. Experimental evaluations both on our assembled dataset and public
benchmark datasets demonstrate the effectiveness of our proposed network.
Related papers
- LogoSticker: Inserting Logos into Diffusion Models for Customized Generation [73.59571559978278]
We introduce the task of logo insertion into text-to-image models.
Our goal is to insert logo identities into diffusion models and enable their seamless synthesis in varied contexts.
We present a novel two-phase pipeline LogoSticker to tackle this task.
arXiv Detail & Related papers (2024-07-18T17:54:49Z) - Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings [99.9788496281408]
We study how sketches can be used as a weak label to detect salient objects present in an image.
To accomplish this, we introduce a photo-to-sketch generation model that aims to generate sequential sketch coordinates corresponding to a given visual photo.
Tests prove our hypothesis and delineate how our sketch-based saliency detection model gives a competitive performance compared to the state-of-the-art.
arXiv Detail & Related papers (2023-03-20T23:46:46Z) - Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred
Thousand-Scale One-Shot Logo Identification [2.243832625209014]
We study the problem of identifying logos of business brands in natural scenes in an open-set one-shot setting.
We propose a novel multi-view textual-visual encoding framework that encodes text appearing in the logos.
We evaluate our proposed framework for cropped logo verification, cropped logo identification, and end-to-end logo identification in natural scene tasks.
arXiv Detail & Related papers (2022-11-23T12:59:41Z) - FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in
Context [112.07988211268612]
We advance sketch research to scenes with the first dataset of freehand scene sketches, FS-COCO.
Our dataset comprises 10,000 freehand scene vector sketches with per point space-time information by 100 non-expert individuals.
We study for the first time the problem of the fine-grained image retrieval from freehand scene sketches and sketch captions.
arXiv Detail & Related papers (2022-03-04T03:00:51Z) - Multi-granularity Association Learning Framework for on-the-fly
Fine-Grained Sketch-based Image Retrieval [7.797006835701767]
Fine-grained sketch-based image retrieval (FG-SBIR) addresses the problem of retrieving a particular photo in a given query sketch.
In this study, we aim to retrieve the target photo with the least number of strokes possible (incomplete sketch)
We propose a multi-granularity association learning framework that further optimize the embedding space of all incomplete sketches.
arXiv Detail & Related papers (2022-01-13T14:38:50Z) - LogoDet-3K: A Large-Scale Image Dataset for Logo Detection [61.296935298332606]
We introduce LogoDet-3K, the largest logo detection dataset with full annotation.
It has 3,000 logo categories, about 200,000 manually annotated logo objects and 158,652 images.
We propose a strong baseline method Logo-Yolo, which incorporates Focal loss and CIoU loss into the state-of-the-art YOLOv3 framework for large-scale logo detection.
arXiv Detail & Related papers (2020-08-12T14:57:53Z) - Cross-Modal Hierarchical Modelling for Fine-Grained Sketch Based Image
Retrieval [147.24102408745247]
We study a further trait of sketches that has been overlooked to date, that is, they are hierarchical in terms of the levels of detail.
In this paper, we design a novel network that is capable of cultivating sketch-specific hierarchies and exploiting them to match sketch with photo at corresponding hierarchical levels.
arXiv Detail & Related papers (2020-07-29T20:50:25Z) - Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image
Retrieval [203.2520862597357]
Fine-grained sketch-based image retrieval (FG-SBIR) addresses the problem of retrieving a particular photo instance given a user's query sketch.
We reformulate the conventional FG-SBIR framework to tackle these challenges.
We propose an on-the-fly design that starts retrieving as soon as the user starts drawing.
arXiv Detail & Related papers (2020-02-24T15:36:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.