Continuous Cartesian Genetic Programming based representation for
Multi-Objective Neural Architecture Search
- URL: http://arxiv.org/abs/2306.02648v1
- Date: Mon, 5 Jun 2023 07:32:47 GMT
- Title: Continuous Cartesian Genetic Programming based representation for
Multi-Objective Neural Architecture Search
- Authors: Cosijopii Garcia-Garcia and Alicia Morales-Reyes and Hugo Jair
Escalante
- Abstract summary: We propose a novel approach for designing less complex yet highly effective convolutional neural networks (CNNs)
Our approach combines real-based and block-chained CNNs representations based on cartesian genetic programming (CGP) for neural architecture search (NAS)
Two variants are introduced that differ in the granularity of the search space they consider.
- Score: 12.545742558041583
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We propose a novel approach for the challenge of designing less complex yet
highly effective convolutional neural networks (CNNs) through the use of
cartesian genetic programming (CGP) for neural architecture search (NAS). Our
approach combines real-based and block-chained CNNs representations based on
CGP for optimization in the continuous domain using multi-objective
evolutionary algorithms (MOEAs). Two variants are introduced that differ in the
granularity of the search space they consider. The proposed CGP-NASV1 and
CGP-NASV2 algorithms were evaluated using the non-dominated sorting genetic
algorithm II (NSGA-II) on the CIFAR-10 and CIFAR-100 datasets. The empirical
analysis was extended to assess the crossover operator from differential
evolution (DE), the multi-objective evolutionary algorithm based on
decomposition (MOEA/D) and S metric selection evolutionary multi-objective
algorithm (SMS-EMOA) using the same representation. Experimental results
demonstrate that our approach is competitive with state-of-the-art proposals in
terms of classification performance and model complexity.
Related papers
- Enhancing CNN Classification with Lamarckian Memetic Algorithms and Local Search [0.0]
We propose a novel approach integrating a two-stage training technique with population-based optimization algorithms incorporating local search capabilities.
Our experiments demonstrate that the proposed method outperforms state-of-the-art gradient-based techniques.
arXiv Detail & Related papers (2024-10-26T17:31:15Z) - Coevolution of Neural Architectures and Features for Stock Market
Forecasting: A Multi-objective Decision Perspective [0.0]
This paper proposes a new approach to identify a set of nondominated neural network models for further selection by the decision maker.
A new coevolution approach is proposed to simultaneously select the features and topology of neural networks.
The results on the NASDAQ index in pre and peri COVID time windows convincingly demonstrate that the proposed coevolution approach can evolve a set of nondominated neural forecasting models with better generalization capabilities.
arXiv Detail & Related papers (2023-11-23T15:12:30Z) - HKNAS: Classification of Hyperspectral Imagery Based on Hyper Kernel
Neural Architecture Search [104.45426861115972]
We propose to directly generate structural parameters by utilizing the specifically designed hyper kernels.
We obtain three kinds of networks to separately conduct pixel-level or image-level classifications with 1-D or 3-D convolutions.
A series of experiments on six public datasets demonstrate that the proposed methods achieve state-of-the-art results.
arXiv Detail & Related papers (2023-04-23T17:27:40Z) - Highlights of Semantics in Multi-objective Genetic Programming [0.0]
This research expands upon the current understanding of semantics in Genetic programming (GP) by proposing a new approach: Semantic-based Distance as an additional criteriOn (SDO)
Our work included an expansive analysis of the GP in terms of performance and diversity metrics, using two additional semantic-based approaches, namely Semantic Similarity-based Crossover ( SCC) and Semantic-based Crowding Distance (SCD)
Using highly-unbalanced binary classification datasets, we demonstrated that the newly proposed approach of SDO consistently generated more non-dominated solutions, with better diversity and improved hypervolume results.
arXiv Detail & Related papers (2022-06-10T11:49:45Z) - Transfer Learning Based Multi-Objective Evolutionary Algorithm for
Community Detection of Dynamic Complex Networks [1.693830041971135]
We propose a Feature Transfer Based Multi-Objective Optimization Algorithm (TMOGA) based on transfer learning and traditional multi-objective evolutionary algorithm framework.
We show that our algorithm can achieve better clustering effects compared with the state-of-the-art dynamic network community detection algorithms in diverse test problems.
arXiv Detail & Related papers (2021-09-30T17:16:51Z) - Pareto-wise Ranking Classifier for Multi-objective Evolutionary Neural
Architecture Search [15.454709248397208]
This study focuses on how to find feasible deep models under diverse design objectives.
We propose a classification-wise Pareto evolution approach for one-shot NAS, where an online classifier is trained to predict the dominance relationship between the candidate and constructed reference architectures.
We find a number of neural architectures with different model sizes ranging from 2M to 6M under diverse objectives and constraints.
arXiv Detail & Related papers (2021-09-14T13:28:07Z) - Redefining Neural Architecture Search of Heterogeneous Multi-Network
Models by Characterizing Variation Operators and Model Components [71.03032589756434]
We investigate the effect of different variation operators in a complex domain, that of multi-network heterogeneous neural models.
We characterize both the variation operators, according to their effect on the complexity and performance of the model; and the models, relying on diverse metrics which estimate the quality of the different parts composing it.
arXiv Detail & Related papers (2021-06-16T17:12:26Z) - Fractal Structure and Generalization Properties of Stochastic
Optimization Algorithms [71.62575565990502]
We prove that the generalization error of an optimization algorithm can be bounded on the complexity' of the fractal structure that underlies its generalization measure.
We further specialize our results to specific problems (e.g., linear/logistic regression, one hidden/layered neural networks) and algorithms.
arXiv Detail & Related papers (2021-06-09T08:05:36Z) - Progressive Spatio-Temporal Graph Convolutional Network for
Skeleton-Based Human Action Recognition [97.14064057840089]
We propose a method to automatically find a compact and problem-specific network for graph convolutional networks in a progressive manner.
Experimental results on two datasets for skeleton-based human action recognition indicate that the proposed method has competitive or even better classification performance.
arXiv Detail & Related papers (2020-11-11T09:57:49Z) - Iterative Algorithm Induced Deep-Unfolding Neural Networks: Precoding
Design for Multiuser MIMO Systems [59.804810122136345]
We propose a framework for deep-unfolding, where a general form of iterative algorithm induced deep-unfolding neural network (IAIDNN) is developed.
An efficient IAIDNN based on the structure of the classic weighted minimum mean-square error (WMMSE) iterative algorithm is developed.
We show that the proposed IAIDNN efficiently achieves the performance of the iterative WMMSE algorithm with reduced computational complexity.
arXiv Detail & Related papers (2020-06-15T02:57:57Z) - Binarizing MobileNet via Evolution-based Searching [66.94247681870125]
We propose a use of evolutionary search to facilitate the construction and training scheme when binarizing MobileNet.
Inspired by one-shot architecture search frameworks, we manipulate the idea of group convolution to design efficient 1-Bit Convolutional Neural Networks (CNNs)
Our objective is to come up with a tiny yet efficient binary neural architecture by exploring the best candidates of the group convolution.
arXiv Detail & Related papers (2020-05-13T13:25:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.