Fugu-MT 論文翻訳(概要): Slot Filling as a Reasoning Task for SpeechLLMs

論文の概要: Slot Filling as a Reasoning Task for SpeechLLMs

arxiv url: http://arxiv.org/abs/2510.19326v1
Date: Wed, 22 Oct 2025 07:39:56 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:15.320889
Title: Slot Filling as a Reasoning Task for SpeechLLMs
Title（参考訳）: 音声LLMの推論課題としてのスロットフィリング
Authors: Kadri Hacioglu, Manjunath K E, Andreas Stolcke,
Abstract要約: 本稿では,音声大言語モデル(speechLLM)への推論の統合を提案する。 LLMの最近の発展に触発されて、私たちはスロット充足タスクを複数の推論ステップに分解するためにチェーン・オブ・シント・フレームワークを使用します。
参考スコア（独自算出の注目度）: 10.898666440393896
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We propose integration of reasoning into speech large language models (speechLLMs) for the end-to-end slot-filling task. Inspired by the recent development of reasoning LLMs, we use a chain-of-thought framework to decompose the slot-filling task into multiple reasoning steps, create a reasoning dataset and apply the supervised fine-tuning strategy to a speechLLM. We distinguish between regular and reasoning speechLLMs and experiment with different types and sizes of LLMs as their text foundation models. We demonstrate performance improvements by introducing reasoning (intermediate) steps. However, we show that a reasoning textual LLM developed mainly for math, logic and coding domains might be inferior as a foundation model for a reasoning speechLLM. We further show that hybrid speechLLMs, built on a hybrid text foundation LLM and fine-tuned to preserve both direct and reasoning modes of operation, have better performance than those fine-tuned employing only one mode of operation.
Abstract（参考訳）: 本稿では,音声大言語モデル(speechLLM)への推論の統合を提案する。 LLMの最近の発展に触発されて、スロット充足タスクを複数の推論ステップに分解し、推論データセットを作成し、教師付き微調整戦略を音声LLMに適用する。我々は、正規言語と推論言語LLMを区別し、LLMの異なるタイプとサイズをテキスト基盤モデルとして実験する。推論(中間)ステップを導入することで,パフォーマンスの向上を実証する。しかし,主に数学,論理学,コーディング領域のために開発された推論テキスト LLM は,推論音声LLM の基礎モデルとして劣る可能性が示唆された。さらに、ハイブリッドテキスト基盤LLM上に構築され、操作の直接モードと推論モードの両方を保持するように微調整されたハイブリッド音声LLMは、1つの操作モードのみを使用する微調整よりも優れた性能を示す。

論文の概要: Slot Filling as a Reasoning Task for SpeechLLMs

関連論文リスト