Dilated-Scale-Aware Attention ConvNet For Multi-Class Object Counting
- URL: http://arxiv.org/abs/2012.08149v1
- Date: Tue, 15 Dec 2020 08:38:28 GMT
- Title: Dilated-Scale-Aware Attention ConvNet For Multi-Class Object Counting
- Authors: Wei Xu, Dingkang Liang, Yixiao Zheng, Zhanyu Ma
- Abstract summary: Multi-class object counting expands the scope of application of object counting task.
The multi-target detection task can achieve multi-class object counting in some scenarios.
We propose a simple yet efficient counting network based on point-level annotations.
- Score: 18.733301622920102
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Object counting aims to estimate the number of objects in images. The leading
counting approaches focus on the single category counting task and achieve
impressive performance. Note that there are multiple categories of objects in
real scenes. Multi-class object counting expands the scope of application of
object counting task. The multi-target detection task can achieve multi-class
object counting in some scenarios. However, it requires the dataset annotated
with bounding boxes. Compared with the point annotations in mainstream object
counting issues, the coordinate box-level annotations are more difficult to
obtain. In this paper, we propose a simple yet efficient counting network based
on point-level annotations. Specifically, we first change the traditional
output channel from one to the number of categories to achieve multiclass
counting. Since all categories of objects use the same feature extractor in our
proposed framework, their features will interfere mutually in the shared
feature space. We further design a multi-mask structure to suppress harmful
interaction among objects. Extensive experiments on the challenging benchmarks
illustrate that the proposed method achieves state-of-the-art counting
performance.
Related papers
- OmniCount: Multi-label Object Counting with Semantic-Geometric Priors [42.38571663534819]
This paper introduces a more practical approach enabling simultaneous counting of multiple object categories using an open-vocabulary framework.
Our solution, OmniCount, stands out by using semantic and geometric insights (priors) from pre-trained models to count multiple categories of objects as specified by users.
Our comprehensive evaluation in OmniCount-191, alongside other leading benchmarks, demonstrates OmniCount's exceptional performance, significantly outpacing existing solutions.
arXiv Detail & Related papers (2024-03-08T16:38:11Z) - AFreeCA: Annotation-Free Counting for All [17.581015609730017]
We introduce an unsupervised sorting methodology to learn object-related features that are subsequently refined and anchored for counting purposes.
We also present a density classifier-guided method for dividing an image into patches containing objects that can be reliably counted.
arXiv Detail & Related papers (2024-03-07T23:18:34Z) - Zero-Shot Object Counting with Language-Vision Models [50.1159882903028]
Class-agnostic object counting aims to count object instances of an arbitrary class at test time.
Current methods require human-annotated exemplars as inputs which are often unavailable for novel categories.
We propose zero-shot object counting (ZSC), a new setting where only the class name is available during test time.
arXiv Detail & Related papers (2023-09-22T14:48:42Z) - Learning from Pseudo-labeled Segmentation for Multi-Class Object
Counting [35.652092907690694]
Class-agnostic counting (CAC) has numerous potential applications across various domains.
The goal is to count objects of an arbitrary category during testing, based on only a few annotated exemplars.
We show that the segmentation model trained on these pseudo-labeled masks can effectively localize objects of interest for an arbitrary multi-class image.
arXiv Detail & Related papers (2023-07-15T01:33:19Z) - Universal Instance Perception as Object Discovery and Retrieval [90.96031157557806]
UNI reformulates diverse instance perception tasks into a unified object discovery and retrieval paradigm.
It can flexibly perceive different types of objects by simply changing the input prompts.
UNI shows superior performance on 20 challenging benchmarks from 10 instance-level tasks.
arXiv Detail & Related papers (2023-03-12T14:28:24Z) - Contrastive Object Detection Using Knowledge Graph Embeddings [72.17159795485915]
We compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs.
We propose a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
arXiv Detail & Related papers (2021-12-21T17:10:21Z) - Object Counting: You Only Need to Look at One [28.49828300257672]
This paper aims to tackle the challenging task of one-shot object counting.
Given an image containing novel, previously unseen category objects, the goal of the task is to count all instances in the desired category with only one supporting bounding box example.
We propose a counting model by which you only need to Look At One instance (LaoNet)
arXiv Detail & Related papers (2021-12-11T14:22:05Z) - Weakly-Supervised Semantic Segmentation via Sub-category Exploration [73.03956876752868]
We propose a simple yet effective approach to enforce the network to pay attention to other parts of an object.
Specifically, we perform clustering on image features to generate pseudo sub-categories labels within each annotated parent class.
We conduct extensive analysis to validate the proposed method and show that our approach performs favorably against the state-of-the-art approaches.
arXiv Detail & Related papers (2020-08-03T20:48:31Z) - A Few-Shot Sequential Approach for Object Counting [63.82757025821265]
We introduce a class attention mechanism that sequentially attends to objects in the image and extracts their relevant features.
The proposed technique is trained on point-level annotations and uses a novel loss function that disentangles class-dependent and class-agnostic aspects of the model.
We present our results on a variety of object-counting/detection datasets, including FSOD and MS COCO.
arXiv Detail & Related papers (2020-07-03T18:23:39Z) - Rethinking Object Detection in Retail Stores [55.359582952686175]
We propose a new task, simultaneously object localization and counting, abbreviated as Locount.
Locount requires algorithms to localize groups of objects of interest with the number of instances.
We collect a large-scale object localization and counting dataset with rich annotations in retail stores.
arXiv Detail & Related papers (2020-03-18T14:01:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.