Iterative Correlation-based Feature Refinement for Few-shot Counting
- URL: http://arxiv.org/abs/2201.08959v1
- Date: Sat, 22 Jan 2022 03:27:11 GMT
- Title: Iterative Correlation-based Feature Refinement for Few-shot Counting
- Authors: Zhiyuan You, Kai Yang, Wenhan Luo, Xin Lu, Lei Cui, Xinyi Le
- Abstract summary: Few-shot counting aims to count objects of any class in an image given only a few exemplars of the same class.
Existing correlation-based few-shot counting approaches suffer from the coarseness and low semantic level of the correlation.
We propose an iterative framework to progressively refine the exemplar-related features based on the correlation between the image and exemplars.
- Score: 35.27237393354539
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Few-shot counting aims to count objects of any class in an image given only a
few exemplars of the same class. Existing correlation-based few-shot counting
approaches suffer from the coarseness and low semantic level of the
correlation. To solve these problems, we propose an iterative framework to
progressively refine the exemplar-related features based on the correlation
between the image and exemplars. Then the density map is predicted from the
final refined feature map. The iterative framework includes a Correlation
Distillation module and a Feature Refinement module. During the iterations, the
exemplar-related features are gradually refined, while the exemplar-unrelated
features are suppressed, benefiting few-shot counting where the
exemplar-related features are more important. Our approach surpasses all
baselines significantly on few-shot counting benchmark FSC-147. Surprisingly,
though designed for general class-agnostic counting, our approach still
achieves state-of-the-art performance on car counting benchmarks CARPK and
PUCPR+, and crowd counting benchmarks UCSD and Mall. We also achieve
competitive performance on crowd counting benchmark ShanghaiTech. The code will
be released soon.
Related papers
- Mutually-Aware Feature Learning for Few-Shot Object Counting [20.623402944601775]
Few-shot object counting has garnered significant attention for its practicality as it aims to count target objects in a query image based on given exemplars without the need for additional training.
We propose a novel framework, Mutually-Aware FEAture learning(MAFEA), which encodes query and exemplar features mutually aware of each other from the outset.
Our model reaches a new state-of-the-art performance on the two challenging benchmarks, FSCD-LVIS and FSC-147, with a remarkably reduced degree of the target confusion problem.
arXiv Detail & Related papers (2024-08-19T06:46:24Z) - Enhancing Zero-shot Counting via Language-guided Exemplar Learning [17.479926342093677]
Class-Agnostic Counting (CAC) problem has garnered increasing attention owing to its intriguing generality and superior efficiency.
This paper proposes a novel ExpressCount to enhance zero-shot object counting by delving deeply into language-guided exemplar learning.
The ExpressCount is comprised of an innovative Language-oriented Exemplar Perceptron and a downstream visual Zero-shot Counting pipeline.
arXiv Detail & Related papers (2024-02-08T04:07:38Z) - FECANet: Boosting Few-Shot Semantic Segmentation with Feature-Enhanced
Context-Aware Network [48.912196729711624]
Few-shot semantic segmentation is the task of learning to locate each pixel of a novel class in a query image with only a few annotated support images.
We propose a Feature-Enhanced Context-Aware Network (FECANet) to suppress the matching noise caused by inter-class local similarity.
In addition, we propose a novel correlation reconstruction module that encodes extra correspondence relations between foreground and background and multi-scale context semantic features.
arXiv Detail & Related papers (2023-01-19T16:31:13Z) - Counting Like Human: Anthropoid Crowd Counting on Modeling the
Similarity of Objects [92.80955339180119]
mainstream crowd counting methods regress density map and integrate it to obtain counting results.
Inspired by this, we propose a rational and anthropoid crowd counting framework.
arXiv Detail & Related papers (2022-12-02T07:00:53Z) - CAD: Co-Adapting Discriminative Features for Improved Few-Shot
Classification [11.894289991529496]
Few-shot classification is a challenging problem that aims to learn a model that can adapt to unseen classes given a few labeled samples.
Recent approaches pre-train a feature extractor, and then fine-tune for episodic meta-learning.
We propose a strategy to cross-attend and re-weight discriminative features for few-shot classification.
arXiv Detail & Related papers (2022-03-25T06:14:51Z) - Spatio-temporal Relation Modeling for Few-shot Action Recognition [100.3999454780478]
We propose a few-shot action recognition framework, STRM, which enhances class-specific featureriminability while simultaneously learning higher-order temporal representations.
Our approach achieves an absolute gain of 3.5% in classification accuracy, as compared to the best existing method in the literature.
arXiv Detail & Related papers (2021-12-09T18:59:14Z) - A Few-Shot Sequential Approach for Object Counting [63.82757025821265]
We introduce a class attention mechanism that sequentially attends to objects in the image and extracts their relevant features.
The proposed technique is trained on point-level annotations and uses a novel loss function that disentangles class-dependent and class-agnostic aspects of the model.
We present our results on a variety of object-counting/detection datasets, including FSOD and MS COCO.
arXiv Detail & Related papers (2020-07-03T18:23:39Z) - Self-Supervised Tuning for Few-Shot Segmentation [82.32143982269892]
Few-shot segmentation aims at assigning a category label to each image pixel with few annotated samples.
Existing meta-learning method tends to fail in generating category-specifically discriminative descriptor when the visual features extracted from support images are marginalized in embedding space.
This paper presents an adaptive framework tuning, in which the distribution of latent features across different episodes is dynamically adjusted based on a self-segmentation scheme.
arXiv Detail & Related papers (2020-04-12T03:53:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.