MEL: Efficient Multi-Task Evolutionary Learning for High-Dimensional
Feature Selection
- URL: http://arxiv.org/abs/2402.08982v1
- Date: Wed, 14 Feb 2024 06:51:49 GMT
- Title: MEL: Efficient Multi-Task Evolutionary Learning for High-Dimensional
Feature Selection
- Authors: Xubin Wang, Haojiong Shangguan, Fengyi Huang, Shangrui Wu and Weijia
Jia
- Abstract summary: We propose a novel approach called PSO-based Multi-task Evolutionary Learning (MEL)
By incorporating information sharing between different feature selection tasks, MEL achieves enhanced learning ability and efficiency.
We evaluate the effectiveness of MEL through extensive experiments on 22 high-dimensional datasets.
- Score: 11.934379476825551
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Feature selection is a crucial step in data mining to enhance model
performance by reducing data dimensionality. However, the increasing
dimensionality of collected data exacerbates the challenge known as the "curse
of dimensionality", where computation grows exponentially with the number of
dimensions. To tackle this issue, evolutionary computational (EC) approaches
have gained popularity due to their simplicity and applicability.
Unfortunately, the diverse designs of EC methods result in varying abilities to
handle different data, often underutilizing and not sharing information
effectively. In this paper, we propose a novel approach called PSO-based
Multi-task Evolutionary Learning (MEL) that leverages multi-task learning to
address these challenges. By incorporating information sharing between
different feature selection tasks, MEL achieves enhanced learning ability and
efficiency. We evaluate the effectiveness of MEL through extensive experiments
on 22 high-dimensional datasets. Comparing against 24 EC approaches, our method
exhibits strong competitiveness. Additionally, we have open-sourced our code on
GitHub at https://github.com/wangxb96/MEL.
Related papers
- SGW-based Multi-Task Learning in Vision Tasks [8.459976488960269]
As the scale of datasets expands and the complexity of tasks increases, knowledge sharing becomes increasingly challenging.
We propose an information bottleneck knowledge extraction module (KEM)
This module aims to reduce inter-task interference by constraining the flow of information, thereby reducing computational complexity.
arXiv Detail & Related papers (2024-10-03T13:56:50Z) - MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct [148.39859547619156]
We propose MMEvol, a novel multimodal instruction data evolution framework.
MMEvol iteratively improves data quality through a refined combination of fine-grained perception, cognitive reasoning, and interaction evolution.
Our approach reaches state-of-the-art (SOTA) performance in nine tasks using significantly less data compared to state-of-the-art models.
arXiv Detail & Related papers (2024-09-09T17:44:00Z) - Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing [19.139077084857487]
We introduce MODA -- a Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing approach.
We develop a novel model-based multi-task offline RL algorithm.
Experiments conducted in a real-world multi-task urban setting validate the effectiveness of MODA.
arXiv Detail & Related papers (2024-06-20T07:24:24Z) - Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning [50.73666458313015]
Large Language Models (LLMs) have demonstrated significant potential in performing multiple tasks in multimedia applications.
MoE has been emerged as a promising solution with its sparse architecture for effective task decoupling.
Intuition-MoR1E achieves superior efficiency and 2.15% overall accuracy improvement across 14 public datasets.
arXiv Detail & Related papers (2024-04-13T12:14:58Z) - Exploring the Mystery of Influential Data for Mathematical Reasoning [127.61978092016228]
We propose a Quality-aware Diverse Selection (QaDS) strategy for mathematical reasoning.
A comparison with other selection strategies validates the superiority of QaDS.
With OpenMathMix, we achieve a state-of-the-art 48.8% accuracy on MATH with 7B base model.
arXiv Detail & Related papers (2024-04-01T12:01:06Z) - Reinforcement Learning Based Multi-modal Feature Fusion Network for
Novel Class Discovery [47.28191501836041]
In this paper, we employ a Reinforcement Learning framework to simulate the cognitive processes of humans.
We also deploy a Member-to-Leader Multi-Agent framework to extract and fuse features from multi-modal information.
We demonstrate the performance of our approach in both the 3D and 2D domains by employing the OS-MN40, OS-MN40-Miss, and Cifar10 datasets.
arXiv Detail & Related papers (2023-08-26T07:55:32Z) - A Dimensional Structure based Knowledge Distillation Method for
Cross-Modal Learning [15.544134849816528]
We discover the correlation between feature discriminability and dimensional structure (DS) by analyzing and observing features extracted from simple and hard tasks.
We propose a novel cross-modal knowledge distillation (CMKD) method for better supervised cross-modal learning (CML) performance.
The proposed method enforces output features to be channel-wise independent and intermediate ones to be uniformly distributed, thereby learning semantically irrelevant features from the hard task to boost its accuracy.
arXiv Detail & Related papers (2023-06-28T07:29:26Z) - Automatic Data Augmentation via Invariance-Constrained Learning [94.27081585149836]
Underlying data structures are often exploited to improve the solution of learning tasks.
Data augmentation induces these symmetries during training by applying multiple transformations to the input data.
This work tackles these issues by automatically adapting the data augmentation while solving the learning task.
arXiv Detail & Related papers (2022-09-29T18:11:01Z) - An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale
Multitask Learning Systems [4.675744559395732]
Multitask learning assumes that models capable of learning from multiple tasks can achieve better quality and efficiency via knowledge transfer.
State of the art ML models rely on high customization for each task and leverage size and data scale rather than scaling the number of tasks.
We propose an evolutionary method that can generate a large scale multitask model and can support the dynamic and continuous addition of new tasks.
arXiv Detail & Related papers (2022-05-25T13:10:47Z) - Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation [51.21190751266442]
Domain adaptation (DA) tries to tackle the scenarios when the test data does not fully follow the same distribution of the training data.
By learning from large-scale unlabeled samples, self-supervised learning has now become a new trend in deep learning.
We propose a novel textbfSelf-textbfSupervised textbfGraph Neural Network (SSG) to enable more effective inter-task information exchange and knowledge sharing.
arXiv Detail & Related papers (2022-04-08T03:37:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.