Fugu-MT 論文翻訳(概要): DACIP-RC: Domain Adaptive Continual Instruction Pre-Training via Reading Comprehension on Business Conversations

論文の概要: DACIP-RC: Domain Adaptive Continual Instruction Pre-Training via Reading Comprehension on Business Conversations

arxiv url: http://arxiv.org/abs/2510.08152v1
Date: Thu, 09 Oct 2025 12:35:20 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-10 17:54:15.069337
Title: DACIP-RC: Domain Adaptive Continual Instruction Pre-Training via Reading Comprehension on Business Conversations
Title（参考訳）: DACIP-RC:ビジネス会話の読解によるドメイン適応型継続的指導事前学習
Authors: Elena Khasanova, Harsh Saini, Md Tahmid Rahman Laskar, Xue-Yong Fu, Cheng Chen, Shashi Bhushan TN,
Abstract要約: 本稿では,DACIP-RC(Domain Adaptive Continual Instruction Pre-Training)を提案する。次世代の予測に依存する従来の事前学習アプローチとは異なり、DACIP-RCは会話書き起こしの読解を通じて多様なタスク命令と応答を生成する。実験により,DACIP-RCは広範囲のビジネス会話タスクにおいて,ゼロショットの一般化を著しく改善することが示された。
参考スコア（独自算出の注目度）: 12.671996818071817
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The rapid advancements in Large Language Models (LLMs) have enabled their adoption in real-world industrial scenarios for various natural language processing tasks. However, the high inference cost of large-scale LLMs makes their deployment impractical, necessitating the use of smaller models. Despite their efficiency, smaller LLMs lack robust zero-shot instruction-following capabilities across diverse domains, limiting their adaptability to dynamic user requirements. Traditional fine-tuning approaches exacerbate this issue by inducing catastrophic forgetting, reducing the model's generalization ability for unseen tasks. In this paper, we propose Domain Adaptive Continual Instruction Pre-Training via Reading Comprehension (DACIP-RC), a continual pre-training technique that enhances smaller LLMs' domain adaptability for business conversational tasks. Unlike conventional pre-training approaches that rely on next-token prediction, DACIP-RC generates diverse task instructions and responses via reading comprehension on conversation transcripts, enabling better instruction generalization. Our empirical evaluations demonstrate that DACIP-RC significantly improves zero-shot generalization across a wide range of business conversational tasks, including meeting summarization, action item generation, and call purpose identification. To the best of our knowledge, this is the first work to apply instruction pre-training on business conversational data, providing insights into how industries can leverage proprietary datasets for domain adaptation.
Abstract（参考訳）: LLM(Large Language Models)の急速な進歩により、様々な自然言語処理タスクの現実的な産業シナリオへの導入が可能になった。しかし、大規模LLMの高推論コストは、その展開を非現実的にし、より小さなモデルを使う必要がある。効率性にも拘わらず、小型のLLMは多様なドメインにまたがる堅牢なゼロショット命令追従機能を持たず、動的なユーザ要求への適応性を制限している。従来の微調整アプローチは、破滅的な忘れ込みを誘発し、目に見えないタスクに対するモデルの一般化能力を低下させることで、この問題を悪化させる。本稿では,ビジネス会話タスクにおけるLLMのドメイン適応性を向上する継続事前学習技術であるDACIP-RC(Domain Adaptive Continual Instruction Pre-Training)を提案する。次世代の予測に依存する従来の事前学習アプローチとは異なり、DACIP-RCは会話書き起こしの読解を通じて多様なタスク命令と応答を生成し、より良い命令一般化を可能にする。 DACIP-RCは、会議要約、アクションアイテム生成、呼び出し目的識別など、幅広いビジネス会話タスクにおいて、ゼロショットの一般化を著しく改善することを示す。私たちの知る限りでは、これはビジネスの会話データに事前トレーニングを適用するための最初の取り組みであり、業界がドメイン適応のために独自のデータセットをどのように活用できるかについての洞察を提供する。

論文の概要: DACIP-RC: Domain Adaptive Continual Instruction Pre-Training via Reading Comprehension on Business Conversations

関連論文リスト