CAN: Feature Co-Action for Click-Through Rate Prediction
- URL: http://arxiv.org/abs/2011.05625v3
- Date: Tue, 7 Dec 2021 06:16:07 GMT
- Title: CAN: Feature Co-Action for Click-Through Rate Prediction
- Authors: Weijie Bian, Kailun Wu, Lejian Ren, Qi Pi, Yujing Zhang, Can Xiao,
Xiang-Rong Sheng, Yong-Nan Zhu, Zhangming Chan, Na Mou, Xinchen Luo, Shiming
Xiang, Guorui Zhou, Xiaoqiang Zhu, Hongbo Deng
- Abstract summary: We propose a Co-Action Network (CAN) to approximate the explicit pairwise feature interactions.
CAN outperforms state-of-the-art CTR models and the cartesian product method.
CAN has been deployed in the display advertisement system in Alibaba, obtaining 12% improvement on CTR.
- Score: 42.251405364218805
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Feature interaction has been recognized as an important problem in machine
learning, which is also very essential for click-through rate (CTR) prediction
tasks. In recent years, Deep Neural Networks (DNNs) can automatically learn
implicit nonlinear interactions from original sparse features, and therefore
have been widely used in industrial CTR prediction tasks. However, the implicit
feature interactions learned in DNNs cannot fully retain the complete
representation capacity of the original and empirical feature interactions
(e.g., cartesian product) without loss. For example, a simple attempt to learn
the combination of feature A and feature B <A, B> as the explicit cartesian
product representation of new features can outperform previous implicit feature
interaction models including factorization machine (FM)-based models and their
variations. In this paper, we propose a Co-Action Network (CAN) to approximate
the explicit pairwise feature interactions without introducing too many
additional parameters. More specifically, giving feature A and its associated
feature B, their feature interaction is modeled by learning two sets of
parameters: 1) the embedding of feature A, and 2) a Multi-Layer Perceptron
(MLP) to represent feature B. The approximated feature interaction can be
obtained by passing the embedding of feature A through the MLP network of
feature B. We refer to such pairwise feature interaction as feature co-action,
and such a Co-Action Network unit can provide a very powerful capacity to
fitting complex feature interactions. Experimental results on public and
industrial datasets show that CAN outperforms state-of-the-art CTR models and
the cartesian product method. Moreover, CAN has been deployed in the display
advertisement system in Alibaba, obtaining 12\% improvement on CTR and 8\% on
Revenue Per Mille (RPM), which is a great improvement to the business.
Related papers
- Tuning Pre-trained Model via Moment Probing [62.445281364055795]
We propose a novel Moment Probing (MP) method to explore the potential of LP.
MP performs a linear classification head based on the mean of final features.
Our MP significantly outperforms LP and is competitive with counterparts at less training cost.
arXiv Detail & Related papers (2023-07-21T04:15:02Z) - Disentangled Representation Learning for Text-Video Retrieval [51.861423831566626]
Cross-modality interaction is a critical component in Text-Video Retrieval (TVR)
We study the interaction paradigm in depth, where we find that its computation can be split into two terms.
We propose a disentangled framework to capture a sequential and hierarchical representation.
arXiv Detail & Related papers (2022-03-14T13:55:33Z) - Masked Transformer for Neighhourhood-aware Click-Through Rate Prediction [74.52904110197004]
We propose Neighbor-Interaction based CTR prediction, which put this task into a Heterogeneous Information Network (HIN) setting.
In order to enhance the representation of the local neighbourhood, we consider four types of topological interaction among the nodes.
We conduct comprehensive experiments on two real world datasets and the experimental results show that our proposed method outperforms state-of-the-art CTR models significantly.
arXiv Detail & Related papers (2022-01-25T12:44:23Z) - Memorize, Factorize, or be Na\"ive: Learning Optimal Feature Interaction
Methods for CTR Prediction [29.343267933348372]
We propose a framework called OptInter which finds the most suitable modelling method for each feature interaction.
Our experiments show that OptInter improves the best performed state-of-the-art baseline deep CTR models by up to 2.21%.
arXiv Detail & Related papers (2021-08-03T03:03:34Z) - GraphFM: Graph Factorization Machines for Feature Interaction Modeling [27.307086868266012]
We propose a novel approach, Graph Factorization Machine (GraphFM), by naturally representing features in the graph structure.
In particular, we design a mechanism to select the beneficial feature interactions and formulate them as edges between features.
The proposed model integrates the interaction function of FM into the feature aggregation strategy of Graph Neural Network (GNN)
arXiv Detail & Related papers (2021-05-25T12:10:54Z) - AdnFM: An Attentive DenseNet based Factorization Machine for CTR
Prediction [11.958336595818267]
We propose a novel model called Attentive DenseNet based Factorization Machines (AdnFM)
AdnFM can extract more comprehensive deep features by using all the hidden layers from a feed-forward neural network as implicit high-order features.
Experiments on two real-world datasets show that the proposed model can effectively improve the performance of Click-Through-Rate prediction.
arXiv Detail & Related papers (2020-12-20T01:00:39Z) - AutoDis: Automatic Discretization for Embedding Numerical Features in
CTR Prediction [45.69943728028556]
Learning sophisticated feature interactions is crucial for Click-Through Rate (CTR) prediction in recommender systems.
Various deep CTR models follow an Embedding & Feature Interaction paradigm.
We propose AutoDis, a framework that discretizes features in numerical fields automatically and is optimized with CTR models in an end-to-end manner.
arXiv Detail & Related papers (2020-12-16T14:31:31Z) - FIVES: Feature Interaction Via Edge Search for Large-Scale Tabular Data [106.76845921324704]
We propose a novel method named Feature Interaction Via Edge Search (FIVES)
FIVES formulates the task of interactive feature generation as searching for edges on the defined feature graph.
In this paper, we present our theoretical evidence that motivates us to search for useful interactive features with increasing order.
arXiv Detail & Related papers (2020-07-29T03:33:18Z) - Feature Interaction based Neural Network for Click-Through Rate
Prediction [5.095988654970358]
We propose a Feature Interaction based Neural Network (FINN) which is able to model feature interaction via a 3-dimention relation tensor.
We show that our deep FINN model outperforms other state-of-the-art deep models such as PNN and DeepFM.
It also indicates that our models can effectively learn the feature interactions, and achieve better performances in real-world datasets.
arXiv Detail & Related papers (2020-06-07T03:53:24Z) - AutoFIS: Automatic Feature Interaction Selection in Factorization Models
for Click-Through Rate Prediction [75.16836697734995]
We propose a two-stage algorithm called Automatic Feature Interaction Selection (AutoFIS)
AutoFIS can automatically identify important feature interactions for factorization models with computational cost just equivalent to training the target model to convergence.
AutoFIS has been deployed onto the training platform of Huawei App Store recommendation service.
arXiv Detail & Related papers (2020-03-25T06:53:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.