Fugu-MT 論文翻訳(概要): ZARTS: On Zero-order Optimization for Neural Architecture Search

論文の概要: ZARTS: On Zero-order Optimization for Neural Architecture Search

arxiv url: http://arxiv.org/abs/2110.04743v1
Date: Sun, 10 Oct 2021 09:35:15 GMT
ステータス: 翻訳完了
システム内更新日: 2021-10-12 18:08:12.890585
Title: ZARTS: On Zero-order Optimization for Neural Architecture Search
Title（参考訳）: ZARTS: ニューラルネットワーク探索のためのゼロオーダー最適化について
Authors: Xiaoxing Wang, Wenxuan Guo, Junchi Yan, Jianlin Su, Xiaokang Yang
Abstract要約: 微分可能なアーキテクチャサーチ (DARTS) は、NASの高効率性のため、一般的なワンショットパラダイムである。この作業はゼロオーダーの最適化に変わり、上記の近似を強制せずに探索するための新しいNASスキームであるZARTSを提案する。特に、12ベンチマークの結果は、DARTSの性能が低下するZARTSの顕著な堅牢性を検証する。
参考スコア（独自算出の注目度）: 94.41017048659664
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Differentiable architecture search (DARTS) has been a popular one-shot paradigm for NAS due to its high efficiency. It introduces trainable architecture parameters to represent the importance of candidate operations and proposes first/second-order approximation to estimate their gradients, making it possible to solve NAS by gradient descent algorithm. However, our in-depth empirical results show that the approximation will often distort the loss landscape, leading to the biased objective to optimize and in turn inaccurate gradient estimation for architecture parameters. This work turns to zero-order optimization and proposes a novel NAS scheme, called ZARTS, to search without enforcing the above approximation. Specifically, three representative zero-order optimization methods are introduced: RS, MGS, and GLD, among which MGS performs best by balancing the accuracy and speed. Moreover, we explore the connections between RS/MGS and gradient descent algorithm and show that our ZARTS can be seen as a robust gradient-free counterpart to DARTS. Extensive experiments on multiple datasets and search spaces show the remarkable performance of our method. In particular, results on 12 benchmarks verify the outstanding robustness of ZARTS, where the performance of DARTS collapses due to its known instability issue. Also, we search on the search space of DARTS to compare with peer methods, and our discovered architecture achieves 97.54% accuracy on CIFAR-10 and 75.7% top-1 accuracy on ImageNet, which are state-of-the-art performance.
Abstract（参考訳）: 微分可能なアーキテクチャサーチ (DARTS) は、NASの高効率性のため、一般的なワンショットパラダイムである。候補演算の重要性を表現するためにトレーニング可能なアーキテクチャパラメータを導入し、勾配を推定する一階・二階近似を提案し、勾配降下アルゴリズムによりNASを解くことができる。しかし、我々の詳細な実験結果から、近似はしばしば損失景観を歪ませ、アーキテクチャパラメータの最適化と不正確な勾配推定のバイアスが生じることが示されている。この作業はゼロオーダーの最適化に変わり、上記の近似を強制せずに探索するための新しいNASスキームであるZARTSを提案する。具体的には, RS, MGS, GLDの3つの代表的ゼロオーダー最適化手法を導入し, MGSの精度と速度のバランスをとる。さらに、RS/MGSと勾配降下アルゴリズムの関連性について検討し、我々のZARTSがDARTSとは無関係であることを示す。複数のデータセットと検索空間に対する大規模な実験により,本手法の顕著な性能が示された。特に、12のベンチマークの結果は、既知の不安定性の問題によりdartのパフォーマンスが崩壊するzartsの優れた堅牢性を検証する。また、dartの検索空間を検索してピアメソッドと比較し、発見されたアーキテクチャは、cifar-10で97.54%、imagenetで75.7%のtop-1精度を達成しています。

論文の概要: ZARTS: On Zero-order Optimization for Neural Architecture Search

関連論文リスト