AutoDis: Automatic Discretization for Embedding Numerical Features in
CTR Prediction
- URL: http://arxiv.org/abs/2012.08986v1
- Date: Wed, 16 Dec 2020 14:31:31 GMT
- Title: AutoDis: Automatic Discretization for Embedding Numerical Features in
CTR Prediction
- Authors: Huifeng Guo, Bo Chen, Ruiming Tang, Zhenguo Li, Xiuqiang He
- Abstract summary: Learning sophisticated feature interactions is crucial for Click-Through Rate (CTR) prediction in recommender systems.
Various deep CTR models follow an Embedding & Feature Interaction paradigm.
We propose AutoDis, a framework that discretizes features in numerical fields automatically and is optimized with CTR models in an end-to-end manner.
- Score: 45.69943728028556
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Learning sophisticated feature interactions is crucial for Click-Through Rate
(CTR) prediction in recommender systems. Various deep CTR models follow an
Embedding & Feature Interaction paradigm. The majority focus on designing
network architectures in Feature Interaction module to better model feature
interactions while the Embedding module, serving as a bottleneck between data
and Feature Interaction module, has been overlooked. The common methods for
numerical feature embedding are Normalization and Discretization. The former
shares a single embedding for intra-field features and the latter transforms
the features into categorical form through various discretization approaches.
However, the first approach surfers from low capacity and the second one limits
performance as well because the discretization rule cannot be optimized with
the ultimate goal of CTR model. To fill the gap of representing numerical
features, in this paper, we propose AutoDis, a framework that discretizes
features in numerical fields automatically and is optimized with CTR models in
an end-to-end manner. Specifically, we introduce a set of meta-embeddings for
each numerical field to model the relationship among the intra-field features
and propose an automatic differentiable discretization and aggregation approach
to capture the correlations between the numerical features and meta-embeddings.
Comprehensive experiments on two public and one industrial datasets are
conducted to validate the effectiveness of AutoDis over the SOTA methods.
Related papers
- AdaptDHM: Adaptive Distribution Hierarchical Model for Multi-Domain CTR
Prediction [4.299153274884263]
We propose an elegant and flexible multi-distribution modeling paradigm, named Adaptive Distribution Hierarchical Model (AdaptDHM)
Our model achieves impressive prediction accuracy and its time cost during the training stage is more than 50% less than that of other models.
arXiv Detail & Related papers (2022-11-22T09:10:37Z) - Meta-Wrapper: Differentiable Wrapping Operator for User Interest
Selection in CTR Prediction [97.99938802797377]
Click-through rate (CTR) prediction, whose goal is to predict the probability of the user to click on an item, has become increasingly significant in recommender systems.
Recent deep learning models with the ability to automatically extract the user interest from his/her behaviors have achieved great success.
We propose a novel approach under the framework of the wrapper method, which is named Meta-Wrapper.
arXiv Detail & Related papers (2022-06-28T03:28:15Z) - Masked Transformer for Neighhourhood-aware Click-Through Rate Prediction [74.52904110197004]
We propose Neighbor-Interaction based CTR prediction, which put this task into a Heterogeneous Information Network (HIN) setting.
In order to enhance the representation of the local neighbourhood, we consider four types of topological interaction among the nodes.
We conduct comprehensive experiments on two real world datasets and the experimental results show that our proposed method outperforms state-of-the-art CTR models significantly.
arXiv Detail & Related papers (2022-01-25T12:44:23Z) - Dynamic Parameterized Network for CTR Prediction [6.749659219776502]
We proposed a novel plug-in operation, Dynamic ized Operation (DPO), to learn both explicit and implicit interaction instance-wisely.
We showed that the introduction of DPO into DNN modules and Attention modules can respectively benefit two main tasks in click-through rate (CTR) prediction.
Our Dynamic ized Networks significantly outperforms state-of-the-art methods in the offline experiments on the public dataset and real-world production dataset.
arXiv Detail & Related papers (2021-11-09T08:15:03Z) - Memorize, Factorize, or be Na\"ive: Learning Optimal Feature Interaction
Methods for CTR Prediction [29.343267933348372]
We propose a framework called OptInter which finds the most suitable modelling method for each feature interaction.
Our experiments show that OptInter improves the best performed state-of-the-art baseline deep CTR models by up to 2.21%.
arXiv Detail & Related papers (2021-08-03T03:03:34Z) - Multi-path Neural Networks for On-device Multi-domain Visual
Classification [55.281139434736254]
This paper proposes a novel approach to automatically learn a multi-path network for multi-domain visual classification on mobile devices.
The proposed multi-path network is learned from neural architecture search by applying one reinforcement learning controller for each domain to select the best path in the super-network created from a MobileNetV3-like search space.
The determined multi-path model selectively shares parameters across domains in shared nodes while keeping domain-specific parameters within non-shared nodes in individual domain paths.
arXiv Detail & Related papers (2020-10-10T05:13:49Z) - Multi-Partition Embedding Interaction with Block Term Format for
Knowledge Graph Completion [3.718476964451589]
Knowledge graph embedding methods perform the task by representing entities and relations as embedding vectors.
Previous work has usually treated each embedding as a whole and has modeled the interactions between these whole embeddings.
We propose the multi- partition embedding interaction (MEI) model with block term format to address this problem.
arXiv Detail & Related papers (2020-06-29T20:37:11Z) - Towards Automated Neural Interaction Discovery for Click-Through Rate
Prediction [64.03526633651218]
Click-Through Rate (CTR) prediction is one of the most important machine learning tasks in recommender systems.
We propose an automated interaction architecture discovering framework for CTR prediction named AutoCTR.
arXiv Detail & Related papers (2020-06-29T04:33:01Z) - AutoFIS: Automatic Feature Interaction Selection in Factorization Models
for Click-Through Rate Prediction [75.16836697734995]
We propose a two-stage algorithm called Automatic Feature Interaction Selection (AutoFIS)
AutoFIS can automatically identify important feature interactions for factorization models with computational cost just equivalent to training the target model to convergence.
AutoFIS has been deployed onto the training platform of Huawei App Store recommendation service.
arXiv Detail & Related papers (2020-03-25T06:53:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.