XCrossNet: Feature Structure-Oriented Learning for Click-Through Rate
Prediction
- URL: http://arxiv.org/abs/2104.10907v1
- Date: Thu, 22 Apr 2021 07:37:36 GMT
- Title: XCrossNet: Feature Structure-Oriented Learning for Click-Through Rate
Prediction
- Authors: Runlong Yu, Yuyang Ye, Qi Liu, Zihan Wang, Chunfeng Yang, Yucheng Hu,
Enhong Chen
- Abstract summary: We propose a novel Extreme Cross Network, abbreviated XCrossNet, which aims at learning dense and sparse feature interactions in an explicit manner.
XCrossNet as a feature structure-oriented model leads to a more expressive representation and a more precise CTR prediction.
Experimental studies on Criteo Kaggle dataset show significant improvement of XCrossNet over state-of-the-art models on both effectiveness and efficiency.
- Score: 46.72935114485706
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Click-Through Rate (CTR) prediction is a core task in nowadays commercial
recommender systems. Feature crossing, as the mainline of research on CTR
prediction, has shown a promising way to enhance predictive performance.
Even though various models are able to learn feature interactions without
manual feature engineering, they rarely attempt to individually learn
representations for different feature structures.
In particular, they mainly focus on the modeling of cross sparse features but
neglect to specifically represent cross dense features.
Motivated by this, we propose a novel Extreme Cross Network, abbreviated
XCrossNet, which aims at learning dense and sparse feature interactions in an
explicit manner.
XCrossNet as a feature structure-oriented model leads to a more expressive
representation and a more precise CTR prediction, which is not only explicit
and interpretable, but also time-efficient and easy to implement.
Experimental studies on Criteo Kaggle dataset show significant improvement of
XCrossNet over state-of-the-art models on both effectiveness and efficiency.
Related papers
- Improving Network Interpretability via Explanation Consistency Evaluation [56.14036428778861]
We propose a framework that acquires more explainable activation heatmaps and simultaneously increase the model performance.
Specifically, our framework introduces a new metric, i.e., explanation consistency, to reweight the training samples adaptively in model learning.
Our framework then promotes the model learning by paying closer attention to those training samples with a high difference in explanations.
arXiv Detail & Related papers (2024-08-08T17:20:08Z) - A Click-Through Rate Prediction Method Based on Cross-Importance of Multi-Order Features [4.820576346277399]
This paper proposes a new model, FiiNet (Multiple Order Feature Interaction Importance Neural Networks)
The model first uses the selective kernel network (SKNet) to explicitly construct multi-order feature crosses.
It dynamically learns the importance of feature interaction combinations in a fine grained manner.
arXiv Detail & Related papers (2024-05-14T16:05:57Z) - Unveiling Backbone Effects in CLIP: Exploring Representational Synergies
and Variances [49.631908848868505]
Contrastive Language-Image Pretraining (CLIP) stands out as a prominent method for image representation learning.
We investigate the differences in CLIP performance among various neural architectures.
We propose a simple, yet effective approach to combine predictions from multiple backbones, leading to a notable performance boost of up to 6.34%.
arXiv Detail & Related papers (2023-12-22T03:01:41Z) - MAP: A Model-agnostic Pretraining Framework for Click-through Rate
Prediction [39.48740397029264]
We propose a Model-agnostic pretraining (MAP) framework that applies feature corruption and recovery on multi-field categorical data.
We derive two practical algorithms: masked feature prediction (RFD) and replaced feature detection (RFD)
arXiv Detail & Related papers (2023-08-03T12:55:55Z) - Rethinking Range View Representation for LiDAR Segmentation [66.73116059734788]
"Many-to-one" mapping, semantic incoherence, and shape deformation are possible impediments against effective learning from range view projections.
We present RangeFormer, a full-cycle framework comprising novel designs across network architecture, data augmentation, and post-processing.
We show that, for the first time, a range view method is able to surpass the point, voxel, and multi-view fusion counterparts in the competing LiDAR semantic and panoptic segmentation benchmarks.
arXiv Detail & Related papers (2023-03-09T16:13:27Z) - DCAP: Deep Cross Attentional Product Network for User Response
Prediction [20.17934000984361]
We propose a novel architecture Deep Cross Attentional Product Network (DCAP)
DCAP keeps cross network's benefits in modeling high-order feature interactions explicitly at the vector-wise level.
Our proposed model can be easily implemented and train in parallel.
arXiv Detail & Related papers (2021-05-18T16:27:20Z) - Explicit Semantic Cross Feature Learning via Pre-trained Graph Neural
Networks for CTR Prediction [14.270296688394762]
Cross features play an important role in click-through rate (CTR) prediction.
Most of the existing methods adopt a DNN-based model to capture the cross features in an implicit manner.
We propose Pre-trained Cross Feature learning Graph Neural Networks (PCF-GNN), a GNN based pre-trained model aiming at generating cross features in an explicit fashion.
arXiv Detail & Related papers (2021-05-17T11:56:04Z) - Towards Automated Neural Interaction Discovery for Click-Through Rate
Prediction [64.03526633651218]
Click-Through Rate (CTR) prediction is one of the most important machine learning tasks in recommender systems.
We propose an automated interaction architecture discovering framework for CTR prediction named AutoCTR.
arXiv Detail & Related papers (2020-06-29T04:33:01Z) - Feature Interaction based Neural Network for Click-Through Rate
Prediction [5.095988654970358]
We propose a Feature Interaction based Neural Network (FINN) which is able to model feature interaction via a 3-dimention relation tensor.
We show that our deep FINN model outperforms other state-of-the-art deep models such as PNN and DeepFM.
It also indicates that our models can effectively learn the feature interactions, and achieve better performances in real-world datasets.
arXiv Detail & Related papers (2020-06-07T03:53:24Z) - Cross-GCN: Enhancing Graph Convolutional Network with $k$-Order Feature
Interactions [153.6357310444093]
Graph Convolutional Network (GCN) is an emerging technique that performs learning and reasoning on graph data.
We argue that existing designs of GCN forgo modeling cross features, making GCN less effective for tasks or data where cross features are important.
We design a new operator named Cross-feature Graph Convolution, which explicitly models the arbitrary-order cross features with complexity linear to feature dimension and order size.
arXiv Detail & Related papers (2020-03-05T13:05:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.