Fugu-MT 論文翻訳(概要): LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

論文の概要: LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

arxiv url: http://arxiv.org/abs/2509.05657v2
Date: Tue, 09 Sep 2025 03:23:02 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-10 12:33:22.825284
Title: LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding
Title（参考訳）: LM-Searcher:統一数値エンコーディングによるLLMを用いたクロスドメインニューラルネットワーク探索
Authors: Yuxuan Hu, Jihao Liu, Ke Wang, Jinliang Zhen, Weikang Shi, Manyuan Zhang, Qi Dou, Rui Liu, Aojun Zhou, Hongsheng Li,
Abstract要約: LM-Searcherは、クロスドメインニューラルネットワーク最適化のための新しいフレームワークである。我々のアプローチの中心は、ニューラルネットワークのための普遍的な数値文字列表現であるNCodeである。我々のデータセットは、幅広いアーキテクチャとパフォーマンスのペアを含み、堅牢で伝達可能な学習を促進する。
参考スコア（独自算出の注目度）: 55.5535016040221
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent progress in Large Language Models (LLMs) has opened new avenues for solving complex optimization problems, including Neural Architecture Search (NAS). However, existing LLM-driven NAS approaches rely heavily on prompt engineering and domain-specific tuning, limiting their practicality and scalability across diverse tasks. In this work, we propose LM-Searcher, a novel framework that leverages LLMs for cross-domain neural architecture optimization without the need for extensive domain-specific adaptation. Central to our approach is NCode, a universal numerical string representation for neural architectures, which enables cross-domain architecture encoding and search. We also reformulate the NAS problem as a ranking task, training LLMs to select high-performing architectures from candidate pools using instruction-tuning samples derived from a novel pruning-based subspace sampling strategy. Our curated dataset, encompassing a wide range of architecture-performance pairs, encourages robust and transferable learning. Comprehensive experiments demonstrate that LM-Searcher achieves competitive performance in both in-domain (e.g., CNNs for image classification) and out-of-domain (e.g., LoRA configurations for segmentation and generation) tasks, establishing a new paradigm for flexible and generalizable LLM-based architecture search. The datasets and models will be released at https://github.com/Ashone3/LM-Searcher.
Abstract（参考訳）: 大規模言語モデル(LLM)の最近の進歩は、ニューラルネットワーク探索(NAS)を含む複雑な最適化問題を解決するための新たな道を開いた。しかし、既存のLLM駆動のNASアプローチは、様々なタスクにおける実用性とスケーラビリティを制限し、迅速なエンジニアリングとドメイン固有のチューニングに大きく依存している。本研究では,LLMをドメイン固有適応の必要なく,ドメイン間ニューラルネットワークの最適化に活用する新しいフレームワークであるLM-Searcherを提案する。我々のアプローチの中心は、ニューラルネットワークアーキテクチャのための普遍的な数値文字列表現であるNCodeで、ドメイン間アーキテクチャのエンコーディングと検索を可能にします。また、NAS問題をランキングタスクとして再構成し、新しいプルーニングに基づくサブスペースサンプリング戦略から導出した命令チューニングサンプルを用いて、候補プールから高い性能のアーキテクチャを選択するためのLCMを訓練する。我々のキュレートされたデータセットは、幅広いアーキテクチャとパフォーマンスのペアを含み、堅牢で伝達可能な学習を促進する。総合的な実験により、LM-Searcherは、領域内(画像分類用CNNなど)と領域外(セグメンテーションおよび生成用LoRA構成など)の両方で競合性能を達成し、柔軟性と一般化可能なLLMアーキテクチャ探索のための新しいパラダイムを確立した。データセットとモデルはhttps://github.com/Ashone3/LM-Searcher.comでリリースされる。

論文の概要: LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

関連論文リスト