Fugu-MT 論文翻訳(概要): Discovering Decoupled Functional Modules in Large Language Models

論文の概要: Discovering Decoupled Functional Modules in Large Language Models

arxiv url: http://arxiv.org/abs/2603.17823v1
Date: Wed, 18 Mar 2026 15:13:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-19 18:32:57.778741
Title: Discovering Decoupled Functional Modules in Large Language Models
Title（参考訳）: 大規模言語モデルにおける非結合機能モジュールの発見
Authors: Yanke Yu, Jin Li, Ying Sun, Ping Li, Zhefeng Wang, Yi Zheng,
Abstract要約: 大規模言語モデルは、どのように異なる関数をモジュールにまとめるかは、まだ明らかにされていない。本研究では,LLM全体の大きなニューロン群を同時にモジュールに分解する,教師なしLLMクロスレイヤー分子発見(ULCMOD)フレームワークを提案する。本稿では,新しい目的関数と効率的なイテレーティブ・デカップリング(IterD)アルゴリズムを提案する。
参考スコア（独自算出の注目度）: 21.534639190745978
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Understanding the internal functional organization of Large Language Models (LLMs) is crucial for improving their trustworthiness and performance. However, how LLMs organize different functions into modules remains highly unexplored. To bridge this gap, we formulate a functional module discovery problem and propose an Unsupervised LLM Cross-layer MOdule Discovery (ULCMOD) framework that simultaneously disentangles the large set of neurons in the entire LLM into modules while discovering the topics of input samples related to these modules. Our framework introduces a novel objective function and an efficient Iterative Decoupling (IterD) algorithm. Extensive experiments show that our method discovers high-quality, disentangled modules that capture more meaningful semantic information and achieve superior performance in various downstream tasks. Moreover, our qualitative analysis reveals that the discovered modules show semantic coherence, correspond to interpretable specializations, and a clear spatial and hierarchical organization within the LLM. Our work provides a novel tool for interpreting the functional modules of LLMs, filling a critical blank in LLM's interpretability research.
Abstract（参考訳）: 言語モデル(LLM)の内部機能組織を理解することは、信頼性とパフォーマンスを改善する上で非常に重要です。しかし、LLMがいかに異なる関数をモジュールに整理するかは、まだ明らかになっていない。このギャップを埋めるために,機能モジュール発見問題を定式化し,LLM全体の大きなニューロン集合をモジュールに同時にアンタングルし,これらのモジュールに関連する入力サンプルのトピックを発見できるunsupervised LLM Cross-layer Module Discovery (ULCMOD) フレームワークを提案する。本稿では,新しい目的関数と効率的なイテレーティブ・デカップリング(IterD)アルゴリズムを提案する。実験により,より意味のあるセマンティックな情報を収集し,様々な下流タスクにおいて優れた性能を実現する,高品質で不整合なモジュールが発見された。さらに,これらのモジュールは意味的コヒーレンスを示し,解釈可能な特殊化に対応し,LLM内の空間的・階層的構造が明確であることを明らかにする。我々の研究はLLMの機能的モジュールを解釈するための新しいツールを提供し、LLMの解釈可能性研究において重要な空白を埋める。

論文の概要: Discovering Decoupled Functional Modules in Large Language Models

関連論文リスト