Virtual Classification: Modulating Domain-Specific Knowledge for
Multidomain Crowd Counting
- URL: http://arxiv.org/abs/2402.03758v1
- Date: Tue, 6 Feb 2024 06:49:04 GMT
- Title: Virtual Classification: Modulating Domain-Specific Knowledge for
Multidomain Crowd Counting
- Authors: Mingyue Guo, Binghui Chen, Zhaoyi Yan, Yaowei Wang, Qixiang Ye
- Abstract summary: Multidomain crowd counting aims to learn a general model for multiple diverse datasets.
Deep networks prefer modeling distributions of the dominant domains instead of all domains, which is known as domain bias.
We propose a Modulating Domain-specific Knowledge Network (MDKNet) to handle the domain bias issue in multidomain crowd counting.
- Score: 67.38137379297717
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multidomain crowd counting aims to learn a general model for multiple diverse
datasets. However, deep networks prefer modeling distributions of the dominant
domains instead of all domains, which is known as domain bias. In this study,
we propose a simple-yet-effective Modulating Domain-specific Knowledge Network
(MDKNet) to handle the domain bias issue in multidomain crowd counting. MDKNet
is achieved by employing the idea of `modulating', enabling deep network
balancing and modeling different distributions of diverse datasets with little
bias. Specifically, we propose an Instance-specific Batch Normalization (IsBN)
module, which serves as a base modulator to refine the information flow to be
adaptive to domain distributions. To precisely modulating the domain-specific
information, the Domain-guided Virtual Classifier (DVC) is then introduced to
learn a domain-separable latent space. This space is employed as an input
guidance for the IsBN modulator, such that the mixture distributions of
multiple datasets can be well treated. Extensive experiments performed on
popular benchmarks, including Shanghai-tech A/B, QNRF and NWPU, validate the
superiority of MDKNet in tackling multidomain crowd counting and the
effectiveness for multidomain learning. Code is available at
\url{https://github.com/csguomy/MDKNet}.
Related papers
- MLoRA: Multi-Domain Low-Rank Adaptive Network for CTR Prediction [18.524017579108044]
We propose a Multi-domain Low-Rank Adaptive network (MLoRA) for CTR prediction, where we introduce a specialized LoRA module for each domain.
Experimental results demonstrate our MLoRA approach achieves a significant improvement compared with state-of-the-art baselines.
The code of our MLoRA is publicly available.
arXiv Detail & Related papers (2024-08-14T05:53:02Z) - Domain-General Crowd Counting in Unseen Scenarios [25.171343652312974]
Domain shift across crowd data severely hinders crowd counting models to generalize to unseen scenarios.
We introduce a dynamic sub-domain division scheme which divides the source domain into multiple sub-domains.
In order to disentangle domain-invariant information from domain-specific information in image features, we design the domain-invariant and -specific crowd memory modules.
arXiv Detail & Related papers (2022-12-05T19:52:28Z) - AdaptDHM: Adaptive Distribution Hierarchical Model for Multi-Domain CTR
Prediction [4.299153274884263]
We propose an elegant and flexible multi-distribution modeling paradigm, named Adaptive Distribution Hierarchical Model (AdaptDHM)
Our model achieves impressive prediction accuracy and its time cost during the training stage is more than 50% less than that of other models.
arXiv Detail & Related papers (2022-11-22T09:10:37Z) - Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval [55.122020263319634]
Video moment retrieval (VMR) aims to localize the target moment from an untrimmed video according to a given language query.
In this paper, we focus on a novel task: cross-domain VMR, where fully-annotated datasets are available in one domain but the domain of interest only contains unannotated datasets.
We propose a novel Multi-Modal Cross-Domain Alignment network to transfer the annotation knowledge from the source domain to the target domain.
arXiv Detail & Related papers (2022-09-23T12:58:20Z) - Variational Attention: Propagating Domain-Specific Knowledge for
Multi-Domain Learning in Crowd Counting [75.80116276369694]
In crowd counting, due to the problem of laborious labelling, it is perceived intractability of collecting a new large-scale dataset.
We resort to the multi-domain joint learning and propose a simple but effective Domain-specific Knowledge Propagating Network (DKPNet)
It is mainly achieved by proposing the novel Variational Attention(VA) technique for explicitly modeling the attention distributions for different domains.
arXiv Detail & Related papers (2021-08-18T08:06:37Z) - Domain Consistency Regularization for Unsupervised Multi-source Domain
Adaptive Classification [57.92800886719651]
Deep learning-based multi-source unsupervised domain adaptation (MUDA) has been actively studied in recent years.
domain shift in MUDA exists not only between the source and target domains but also among multiple source domains.
We propose an end-to-end trainable network that exploits domain Consistency Regularization for unsupervised Multi-source domain Adaptive classification.
arXiv Detail & Related papers (2021-06-16T07:29:27Z) - Mutual Learning Network for Multi-Source Domain Adaptation [73.25974539191553]
We propose a novel multi-source domain adaptation method, Mutual Learning Network for Multiple Source Domain Adaptation (ML-MSDA)
Under the framework of mutual learning, the proposed method pairs the target domain with each single source domain to train a conditional adversarial domain adaptation network as a branch network.
The proposed method outperforms the comparison methods and achieves the state-of-the-art performance.
arXiv Detail & Related papers (2020-03-29T04:31:43Z) - MADAN: Multi-source Adversarial Domain Aggregation Network for Domain
Adaptation [58.38749495295393]
Domain adaptation aims to learn a transferable model to bridge the domain shift between one labeled source domain and another sparsely labeled or unlabeled target domain.
Recent multi-source domain adaptation (MDA) methods do not consider the pixel-level alignment between sources and target.
We propose a novel MDA framework to address these challenges.
arXiv Detail & Related papers (2020-02-19T21:22:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.