Fugu-MT 論文翻訳(概要): ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling

論文の概要: ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling

arxiv url: http://arxiv.org/abs/2603.09691v1
Date: Tue, 10 Mar 2026 13:59:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-11 15:25:24.355642
Title: ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling
Title（参考訳）: ESAinsTOD:タスク指向ダイアログモデリングのための統合エンドツーエンドスキーマ認識インストラクションチューニングフレームワーク
Authors: Dechuan Teng, Chunlin Lu, Libo Qin, Wanxiang Che,
Abstract要約: タスク指向ダイアログモデリングのためのエンド・ツー・エンド・エンド・エンド・インストラクション・チューニング・フレームワークであるESAinsTODを提案する。 LLM(Large Language Models)を微調整するだけでなく、さまざまな対話タスクフローやスキーマへの柔軟な適応を可能にします。 ESAinsTODは、エンドツーエンドのタスク指向ダイアログモデリングベンチマークにおいて、最先端モデルよりも優れていることを示す。
参考スコア（独自算出の注目度）: 44.73279314406119
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Existing end-to-end modeling methods for modular task-oriented dialog systems are typically tailored to specific datasets, making it challenging to adapt to new dialog scenarios. In this work, we propose ESAinsTOD, a unified End-to-end Schema-Aware Instruction-tuning framework for general Task-Oriented Dialog modeling. This framework introduces a structured methodology to go beyond simply fine-tuning Large Language Models (LLMs), enabling flexible adaptation to various dialogue task flows and schemas. Specifically, we leverage full-parameter fine-tuning of LLMs and introduce two alignment mechanisms to make the resulting system both instruction-aware and schema-aware: (i) instruction alignment, which ensures that the system faithfully follows task instructions to complete various task flows from heterogeneous TOD datasets; and (ii) schema alignment, which encourages the system to make predictions adhering to the specified schema. In addition, we employ session-level end-to-end modeling, which allows the system to access the results of previously executed task flows within the dialogue history, to bridge the gap between the instruction-tuning paradigm and the real-world application of TOD systems. Empirical results show that while a fine-tuned LLM serves as a strong baseline, our structured approach provides significant additional benefits. In particular, our findings indicate that: (i) ESAinsTOD outperforms state-of-the-art models by a significant margin on end-to-end task-oriented dialog modeling benchmarks: CamRest676, In-Car and MultiWOZ; (ii) more importantly, it exhibits superior generalization capabilities across various low-resource settings, with the proposed alignment mechanisms significantly enhancing zero-shot performance; and (iii) our instruction-tuning paradigm substantially improves the model's robustness against data noise and cascading errors.
Abstract（参考訳）: モジュラータスク指向のダイアログシステムのための既存のエンドツーエンドモデリング手法は、通常、特定のデータセットに合わせて設計されているため、新しいダイアログシナリオに適応することは困難である。本研究では,汎用タスク指向ダイアログモデリングのためのエンド・ツー・エンド・エンドのスキーマ・アウェア・インストラクション・チューニング・フレームワークであるESAinsTODを提案する。このフレームワークは、単に微調整された大規模言語モデル(LLM)を超えて、様々な対話タスクフローやスキーマへの柔軟な適応を可能にする構造化された方法論を導入する。具体的には、LLMのフルパラメータ微調整を活用し、2つのアライメント機構を導入し、命令認識とスキーマ認識の両方を実現させる。一不均質なTODデータセットから様々なタスクフローを完了させるためのタスク命令を忠実に従うことを保証する命令アライメント。 (ii)スキーマアライメントは、特定のスキーマに付着した予測をシステムに促す。さらに、セッションレベルのエンドツーエンドモデリングを用いて、対話履歴内で以前に実行されたタスクフローの結果にアクセスし、命令チューニングパラダイムとTODシステムの現実的応用とのギャップを埋める。実験の結果, 微調整LDMは強力なベースラインとして機能するが, 構造的アプローチは大きなメリットをもたらすことがわかった。特に,本研究の成果は以下のとおりである。 (i)ESAinsTODは、CamRest676、In-Car、MultiWOZといったエンドツーエンドのタスク指向ダイアログモデリングベンチマークにおいて、最先端モデルよりも優れた性能を発揮する。 (II)より重要なのは,提案したアライメント機構がゼロショット性能を大幅に向上させ,様々な低リソース設定にまたがる優れた一般化能力を示すことである。 3)命令チューニングのパラダイムは,データノイズやカスケードエラーに対するモデルの堅牢性を大幅に向上させる。

論文の概要: ESAinsTOD: A Unified End-to-End Schema-Aware Instruction-Tuning Framework for Task-Oriented Dialog Modeling

関連論文リスト