Learning To Count Everything
- URL: http://arxiv.org/abs/2104.08391v1
- Date: Fri, 16 Apr 2021 22:45:58 GMT
- Title: Learning To Count Everything
- Authors: Viresh Ranjan, Udbhav Sharma, Thu Nguyen, Minh Hoai
- Abstract summary: We are interested in counting everything, that is to count objects from any category given only a few annotated instances from that category.
We present a novel method that takes a query image together with a few exemplar objects from the query image and predicts a density map for the presence of all objects of interest in the query image.
We also present a novel adaptation strategy to adapt our network to any novel visual category at test time, using only a few exemplar objects from the novel category.
- Score: 16.1159048148031
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Existing works on visual counting primarily focus on one specific category at
a time, such as people, animals, and cells. In this paper, we are interested in
counting everything, that is to count objects from any category given only a
few annotated instances from that category. To this end, we pose counting as a
few-shot regression task. To tackle this task, we present a novel method that
takes a query image together with a few exemplar objects from the query image
and predicts a density map for the presence of all objects of interest in the
query image. We also present a novel adaptation strategy to adapt our network
to any novel visual category at test time, using only a few exemplar objects
from the novel category. We also introduce a dataset of 147 object categories
containing over 6000 images that are suitable for the few-shot counting task.
The images are annotated with two types of annotation, dots and bounding boxes,
and they can be used for developing few-shot counting models. Experiments on
this dataset shows that our method outperforms several state-of-the-art object
detectors and few-shot counting approaches. Our code and dataset can be found
at https://github.com/cvlab-stonybrook/LearningToCountEverything.
Related papers
- Zero-Shot Object Counting with Language-Vision Models [50.1159882903028]
Class-agnostic object counting aims to count object instances of an arbitrary class at test time.
Current methods require human-annotated exemplars as inputs which are often unavailable for novel categories.
We propose zero-shot object counting (ZSC), a new setting where only the class name is available during test time.
arXiv Detail & Related papers (2023-09-22T14:48:42Z) - Learning from Pseudo-labeled Segmentation for Multi-Class Object
Counting [35.652092907690694]
Class-agnostic counting (CAC) has numerous potential applications across various domains.
The goal is to count objects of an arbitrary category during testing, based on only a few annotated exemplars.
We show that the segmentation model trained on these pseudo-labeled masks can effectively localize objects of interest for an arbitrary multi-class image.
arXiv Detail & Related papers (2023-07-15T01:33:19Z) - Few-shot Object Counting and Detection [25.61294147822642]
We tackle a new task of few-shot object counting and detection. Given a few exemplar bounding boxes of a target object class, we seek to count and detect all objects of the target class.
This task shares the same supervision as the few-shot object counting but additionally outputs the object bounding boxes along with the total object count.
We introduce a novel two-stage training strategy and a novel uncertainty-aware few-shot object detector: Counting-DETR.
arXiv Detail & Related papers (2022-07-22T10:09:18Z) - Exploiting Unlabeled Data with Vision and Language Models for Object
Detection [64.94365501586118]
Building robust and generic object detection frameworks requires scaling to larger label spaces and bigger training datasets.
We propose a novel method that leverages the rich semantics available in recent vision and language models to localize and classify objects in unlabeled images.
We demonstrate the value of the generated pseudo labels in two specific tasks, open-vocabulary detection and semi-supervised object detection.
arXiv Detail & Related papers (2022-07-18T21:47:15Z) - Object Counting: You Only Need to Look at One [28.49828300257672]
This paper aims to tackle the challenging task of one-shot object counting.
Given an image containing novel, previously unseen category objects, the goal of the task is to count all instances in the desired category with only one supporting bounding box example.
We propose a counting model by which you only need to Look At One instance (LaoNet)
arXiv Detail & Related papers (2021-12-11T14:22:05Z) - Single Image Object Counting and Localizing using Active-Learning [4.56877715768796]
We present a new method for counting and localizing repeating objects in single-image scenarios.
Our method trains a CNN over a small set of labels carefully collected from the input image in few active-learning iterations.
Compared with existing user-assisted counting methods, our active-learning iterations achieve state-of-the-art performance in terms of counting and localizing accuracy, number of user mouse clicks, and running-time.
arXiv Detail & Related papers (2021-11-16T11:29:21Z) - Rectifying the Shortcut Learning of Background: Shared Object
Concentration for Few-Shot Image Recognition [101.59989523028264]
Few-Shot image classification aims to utilize pretrained knowledge learned from a large-scale dataset to tackle a series of downstream classification tasks.
We propose COSOC, a novel Few-Shot Learning framework, to automatically figure out foreground objects at both pretraining and evaluation stage.
arXiv Detail & Related papers (2021-07-16T07:46:41Z) - Dilated-Scale-Aware Attention ConvNet For Multi-Class Object Counting [18.733301622920102]
Multi-class object counting expands the scope of application of object counting task.
The multi-target detection task can achieve multi-class object counting in some scenarios.
We propose a simple yet efficient counting network based on point-level annotations.
arXiv Detail & Related papers (2020-12-15T08:38:28Z) - A Few-Shot Sequential Approach for Object Counting [63.82757025821265]
We introduce a class attention mechanism that sequentially attends to objects in the image and extracts their relevant features.
The proposed technique is trained on point-level annotations and uses a novel loss function that disentangles class-dependent and class-agnostic aspects of the model.
We present our results on a variety of object-counting/detection datasets, including FSOD and MS COCO.
arXiv Detail & Related papers (2020-07-03T18:23:39Z) - Rethinking Object Detection in Retail Stores [55.359582952686175]
We propose a new task, simultaneously object localization and counting, abbreviated as Locount.
Locount requires algorithms to localize groups of objects of interest with the number of instances.
We collect a large-scale object localization and counting dataset with rich annotations in retail stores.
arXiv Detail & Related papers (2020-03-18T14:01:54Z) - Captioning Images with Novel Objects via Online Vocabulary Expansion [62.525165808406626]
We introduce a low cost method for generating descriptions from images containing novel objects.
We propose a method that can explain images with novel objects without retraining using the word embeddings of the objects estimated from only a small number of image features.
arXiv Detail & Related papers (2020-03-06T16:34:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.