Automated Algorithm Selection: from Feature-Based to Feature-Free
Approaches
- URL: http://arxiv.org/abs/2203.13392v1
- Date: Thu, 24 Mar 2022 23:59:50 GMT
- Title: Automated Algorithm Selection: from Feature-Based to Feature-Free
Approaches
- Authors: Mohamad Alissa, Kevin Sim and Emma Hart
- Abstract summary: We propose a novel technique for algorithm-selection, applicable to optimisation in which there is implicit sequential information encapsulated in the data.
We train two types of recurrent neural networks to predict a packing in online bin-packing, selecting from four well-known domains.
- Score: 0.5801044612920815
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We propose a novel technique for algorithm-selection, applicable to
optimisation domains in which there is implicit sequential information
encapsulated in the data, e.g., in online bin-packing. Specifically we train
two types of recurrent neural networks to predict a packing heuristic in online
bin-packing, selecting from four well-known heuristics. As input, the RNN
methods only use the sequence of item-sizes. This contrasts to typical
approaches to algorithm-selection which require a model to be trained using
domain-specific instance features that need to be first derived from the input
data. The RNN approaches are shown to be capable of achieving within 5% of the
oracle performance on between 80.88% to 97.63% of the instances, depending on
the dataset. They are also shown to outperform classical machine learning
models trained using derived features. Finally, we hypothesise that the
proposed methods perform well when the instances exhibit some implicit
structure that results in discriminatory performance with respect to a set of
heuristics. We test this hypothesis by generating fourteen new datasets with
increasing levels of structure, and show that there is a critical threshold of
structure required before algorithm-selection delivers benefit.
Related papers
- Towards Free Data Selection with General-Purpose Models [71.92151210413374]
A desirable data selection algorithm can efficiently choose the most informative samples to maximize the utility of limited annotation budgets.
Current approaches, represented by active learning methods, typically follow a cumbersome pipeline that iterates the time-consuming model training and batch data selection repeatedly.
FreeSel bypasses the heavy batch selection process, achieving a significant improvement in efficiency and being 530x faster than existing active learning methods.
arXiv Detail & Related papers (2023-09-29T15:50:14Z) - Automatic learning algorithm selection for classification via
convolutional neural networks [0.0]
The goal of this study is to learn the inherent structure of the data without identifying meta-features.
Experiments with simulated datasets show that the proposed approach achieves nearly perfect performance in identifying linear and nonlinear patterns.
arXiv Detail & Related papers (2023-05-16T01:57:01Z) - Revisit the Algorithm Selection Problem for TSP with Spatial Information
Enhanced Graph Neural Networks [4.084365114504618]
This paper revisits the algorithm selection problem for Euclidean Traveling Salesman Problem (TSP)
We propose a novel Graph Neural Network (GNN), called GINES.
GINES takes the coordinates of cities and distances between cities as input.
It is better than the traditional handcrafted feature-based approach on one dataset.
arXiv Detail & Related papers (2023-02-08T13:14:20Z) - Meta-Wrapper: Differentiable Wrapping Operator for User Interest
Selection in CTR Prediction [97.99938802797377]
Click-through rate (CTR) prediction, whose goal is to predict the probability of the user to click on an item, has become increasingly significant in recommender systems.
Recent deep learning models with the ability to automatically extract the user interest from his/her behaviors have achieved great success.
We propose a novel approach under the framework of the wrapper method, which is named Meta-Wrapper.
arXiv Detail & Related papers (2022-06-28T03:28:15Z) - HyperImpute: Generalized Iterative Imputation with Automatic Model
Selection [77.86861638371926]
We propose a generalized iterative imputation framework for adaptively and automatically configuring column-wise models.
We provide a concrete implementation with out-of-the-box learners, simulators, and interfaces.
arXiv Detail & Related papers (2022-06-15T19:10:35Z) - Dynamic Instance-Wise Classification in Correlated Feature Spaces [15.351282873821935]
In a typical machine learning setting, the predictions on all test instances are based on a common subset of features discovered during model training.
A new method is proposed that sequentially selects the best feature to evaluate for each test instance individually, and stops the selection process to make a prediction once it determines that no further improvement can be achieved with respect to classification accuracy.
The effectiveness, generalizability, and scalability of the proposed method is illustrated on a variety of real-world datasets from diverse application domains.
arXiv Detail & Related papers (2021-06-08T20:20:36Z) - Beta-CoRM: A Bayesian Approach for $n$-gram Profiles Analysis [0.0]
The flexibility of the proposed modelling allows to consider a straightforward approach to feature selection in the generative model.
A slice sampling algorithm is derived for a fast inferential procedure, which is applied to synthetic and real data scenarios.
arXiv Detail & Related papers (2020-11-23T17:12:34Z) - Embedded methods for feature selection in neural networks [0.0]
Black box models like neural networks negatively affect the interpretability, generalizability, and the training time of these models.
I propose two integrated approaches for feature selection that can be incorporated directly into the parameter learning.
I benchmarked both the methods against Permutation Feature Importance (PFI) - a general-purpose feature ranking method and a random baseline.
arXiv Detail & Related papers (2020-10-12T16:33:46Z) - Pre-Trained Models for Heterogeneous Information Networks [57.78194356302626]
We propose a self-supervised pre-training and fine-tuning framework, PF-HIN, to capture the features of a heterogeneous information network.
PF-HIN consistently and significantly outperforms state-of-the-art alternatives on each of these tasks, on four datasets.
arXiv Detail & Related papers (2020-07-07T03:36:28Z) - FedPD: A Federated Learning Framework with Optimal Rates and Adaptivity
to Non-IID Data [59.50904660420082]
Federated Learning (FL) has become a popular paradigm for learning from distributed data.
To effectively utilize data at different devices without moving them to the cloud, algorithms such as the Federated Averaging (FedAvg) have adopted a "computation then aggregation" (CTA) model.
arXiv Detail & Related papers (2020-05-22T23:07:42Z) - Model Fusion via Optimal Transport [64.13185244219353]
We present a layer-wise model fusion algorithm for neural networks.
We show that this can successfully yield "one-shot" knowledge transfer between neural networks trained on heterogeneous non-i.i.d. data.
arXiv Detail & Related papers (2019-10-12T22:07:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.