AdaptDHM: Adaptive Distribution Hierarchical Model for Multi-Domain CTR
Prediction
- URL: http://arxiv.org/abs/2211.12105v1
- Date: Tue, 22 Nov 2022 09:10:37 GMT
- Title: AdaptDHM: Adaptive Distribution Hierarchical Model for Multi-Domain CTR
Prediction
- Authors: Jinyun Li, Huiwen Zheng, Yuanlin Liu, Minfang Lu, Lixia Wu, Haoyuan Hu
- Abstract summary: We propose an elegant and flexible multi-distribution modeling paradigm, named Adaptive Distribution Hierarchical Model (AdaptDHM)
Our model achieves impressive prediction accuracy and its time cost during the training stage is more than 50% less than that of other models.
- Score: 4.299153274884263
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Large-scale commercial platforms usually involve numerous business domains
for diverse business strategies and expect their recommendation systems to
provide click-through rate (CTR) predictions for multiple domains
simultaneously. Existing promising and widely-used multi-domain models discover
domain relationships by explicitly constructing domain-specific networks, but
the computation and memory boost significantly with the increase of domains. To
reduce computational complexity, manually grouping domains with particular
business strategies is common in industrial applications. However, this
pre-defined data partitioning way heavily relies on prior knowledge, and it may
neglect the underlying data distribution of each domain, hence limiting the
model's representation capability. Regarding the above issues, we propose an
elegant and flexible multi-distribution modeling paradigm, named Adaptive
Distribution Hierarchical Model (AdaptDHM), which is an end-to-end optimization
hierarchical structure consisting of a clustering process and classification
process. Specifically, we design a distribution adaptation module with a
customized dynamic routing mechanism. Instead of introducing prior knowledge
for pre-defined data allocation, this routing algorithm adaptively provides a
distribution coefficient for each sample to determine which cluster it belongs
to. Each cluster corresponds to a particular distribution so that the model can
sufficiently capture the commonalities and distinctions between these distinct
clusters. Extensive experiments on both public and large-scale Alibaba
industrial datasets verify the effectiveness and efficiency of AdaptDHM: Our
model achieves impressive prediction accuracy and its time cost during the
training stage is more than 50% less than that of other models.
Related papers
- MLoRA: Multi-Domain Low-Rank Adaptive Network for CTR Prediction [18.524017579108044]
We propose a Multi-domain Low-Rank Adaptive network (MLoRA) for CTR prediction, where we introduce a specialized LoRA module for each domain.
Experimental results demonstrate our MLoRA approach achieves a significant improvement compared with state-of-the-art baselines.
The code of our MLoRA is publicly available.
arXiv Detail & Related papers (2024-08-14T05:53:02Z) - Virtual Classification: Modulating Domain-Specific Knowledge for
Multidomain Crowd Counting [67.38137379297717]
Multidomain crowd counting aims to learn a general model for multiple diverse datasets.
Deep networks prefer modeling distributions of the dominant domains instead of all domains, which is known as domain bias.
We propose a Modulating Domain-specific Knowledge Network (MDKNet) to handle the domain bias issue in multidomain crowd counting.
arXiv Detail & Related papers (2024-02-06T06:49:04Z) - Collaborating Foundation Models for Domain Generalized Semantic Segmentation [23.359941294938142]
Domain Generalized Semantic (DGSS) deals with training a model on a labeled source domain.
We take an approach to DGSS and propose to use an assembly of CoLlaborative FOUndation models for Domain Generalized Semantic (CLOUDS)
arXiv Detail & Related papers (2023-12-15T13:43:24Z) - From Big to Small: Adaptive Learning to Partial-Set Domains [94.92635970450578]
Domain adaptation targets at knowledge acquisition and dissemination from a labeled source domain to an unlabeled target domain under distribution shift.
Recent advances show that deep pre-trained models of large scale endow rich knowledge to tackle diverse downstream tasks of small scale.
This paper introduces Partial Domain Adaptation (PDA), a learning paradigm that relaxes the identical class space assumption to that the source class space subsumes the target class space.
arXiv Detail & Related papers (2022-03-14T07:02:45Z) - TAL: Two-stream Adaptive Learning for Generalizable Person
Re-identification [115.31432027711202]
We argue that both domain-specific and domain-invariant features are crucial for improving the generalization ability of re-id models.
We name two-stream adaptive learning (TAL) to simultaneously model these two kinds of information.
Our framework can be applied to both single-source and multi-source domain generalization tasks.
arXiv Detail & Related papers (2021-11-29T01:27:42Z) - Graphical Modeling for Multi-Source Domain Adaptation [56.05348879528149]
Multi-Source Domain Adaptation (MSDA) focuses on transferring the knowledge from multiple source domains to the target domain.
We propose two types of graphical models,i.e. Conditional Random Field for MSDA (CRF-MSDA) and Markov Random Field for MSDA (MRF-MSDA)
We evaluate these two models on four standard benchmark data sets of MSDA with distinct domain shift and data complexity.
arXiv Detail & Related papers (2021-04-27T09:04:22Z) - Instance Level Affinity-Based Transfer for Unsupervised Domain
Adaptation [74.71931918541748]
We propose an instance affinity based criterion for source to target transfer during adaptation, called ILA-DA.
We first propose a reliable and efficient method to extract similar and dissimilar samples across source and target, and utilize a multi-sample contrastive loss to drive the domain alignment process.
We verify the effectiveness of ILA-DA by observing consistent improvements in accuracy over popular domain adaptation approaches on a variety of benchmark datasets.
arXiv Detail & Related papers (2021-04-03T01:33:14Z) - Learning to Combine: Knowledge Aggregation for Multi-Source Domain
Adaptation [56.694330303488435]
We propose a Learning to Combine for Multi-Source Domain Adaptation (LtC-MSDA) framework.
In the nutshell, a knowledge graph is constructed on the prototypes of various domains to realize the information propagation among semantically adjacent representations.
Our approach outperforms existing methods with a remarkable margin.
arXiv Detail & Related papers (2020-07-17T07:52:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.