Fugu-MT 論文翻訳(概要): Gradient-Based Program Synthesis with Neurally Interpreted Languages

論文の概要: Gradient-Based Program Synthesis with Neurally Interpreted Languages

arxiv url: http://arxiv.org/abs/2604.18907v1
Date: Mon, 20 Apr 2026 23:14:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 22:41:49.536473
Title: Gradient-Based Program Synthesis with Neurally Interpreted Languages
Title（参考訳）: ニューラルネットワークを用いたグラディエント型プログラム合成
Authors: Matthew V. Macfarlane, Clément Bonnet, Herke van Hoof, Levi H. S. Lelis,
Abstract要約: 本稿では,ニューラルネットワーク・インタプリタという遅延適応ネットワークアーキテクチャのインスタンスを開発する。独自のシンボリックなプログラミング言語をエンドツーエンドで学習する。その結果,離散言語の構成性とニューラルネットワークの勾配に基づくエンドツーエンド学習を組み合わせたモデルへの新たな道筋が確立された。
参考スコア（独自算出の注目度）: 19.734602525162106
License: http://creativecommons.org/licenses/by/4.0/
Abstract: A central challenge in program induction has long been the trade-off between symbolic and neural approaches. Symbolic methods offer compositional generalisation and data efficiency, yet their scalability is constrained by formalisms such as domain-specific languages (DSLs), which are labour-intensive to create and may not transfer to new domains. In contrast, neural networks flexibly learn from data but tend to generalise poorly in compositional and out-of-distribution settings. We bridge this divide with an instance of a Latent Adaptation Network architecture named Neural Language Interpreter (NLI), which learns its own discrete, symbolic-like programming language end-to-end. NLI autonomously discovers a vocabulary of primitive operations and uses a novel differentiable neural executor to interpret variable-length sequences of these primitives. This allows NLI to represent programs that are not bound to a constant number of computation steps, enabling it to solve more complex problems than those seen during training. To make these discrete, compositional program structures amenable to gradient-based optimisation, we employ the Gumbel-Softmax relaxation, enabling the entire model to be trained end-to-end. Crucially, this same differentiability enables powerful test-time adaptation. At inference, NLI's program inductor provides an initial program guess. This guess is then refined via gradient descent through the neural executor, enabling efficient search for the neural program that best explains the given data. We demonstrate that NLI outperforms in-context learning, test-time training, and continuous latent program networks on tasks that require combinatorial generalisation and rapid adaptation to unseen tasks. Our results establish a new path toward models that combine the compositionality of discrete languages with the gradient-based search and end-to-end learning of neural networks.
Abstract（参考訳）: プログラム誘導における中心的な課題は、長い間、象徴的アプローチとニューラルアプローチのトレードオフであった。シンボリック・メソッドは構成の一般化とデータ効率を提供するが、そのスケーラビリティはドメイン固有言語(DSL)のような形式主義によって制約される。対照的に、ニューラルネットワークはデータから柔軟に学習するが、構成やアウト・オブ・ディストリビューションの設定では一般化が不十分である。我々はこの分割を、ニューラルネットワークインタープリタ(NLI)と呼ばれる遅延適応ネットワークアーキテクチャの例で橋渡しする。 NLIは、プリミティブ操作の語彙を自律的に発見し、これらのプリミティブの可変長シーケンスを解釈するために、新しい微分可能なニューラルエグゼキュータを使用する。これにより、NLIは一定数の計算ステップに縛られないプログラムを表現でき、トレーニング中に見られるよりも複雑な問題を解くことができる。これらの離散的な構成的プログラム構造を勾配に基づく最適化に適合させるため、Gumbel-Softmax緩和を用いて、モデル全体をエンドツーエンドにトレーニングすることができる。重要なことに、この同じ差別化性は、強力なテスト時間適応を可能にする。推論において、NLIのプログラムインダクタは初期プログラム推定を提供する。この推測は、神経エグゼキュータを通して勾配降下によって洗練され、与えられたデータを最もよく説明する神経プログラムの効率的な探索を可能にします。 NLIは、組み合わせの一般化と予期せぬタスクへの迅速な適応を必要とするタスクにおいて、コンテキスト内学習、テストタイムトレーニング、継続的な潜伏プログラムネットワークよりも優れていることを示す。この結果から、離散言語の構成性と勾配に基づく探索とニューラルネットワークのエンドツーエンド学習を組み合わせたモデルへの新たな道が確立された。

論文の概要: Gradient-Based Program Synthesis with Neurally Interpreted Languages

関連論文リスト