Fugu-MT 論文翻訳(概要): Tutoring Large Language Models to be Domain-adaptive, Precise, and Safe

論文の概要: Tutoring Large Language Models to be Domain-adaptive, Precise, and Safe

arxiv url: http://arxiv.org/abs/2602.13860v1
Date: Sat, 14 Feb 2026 19:45:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-17 14:17:28.520199
Title: Tutoring Large Language Models to be Domain-adaptive, Precise, and Safe
Title（参考訳）: 大規模言語モデルをドメイン適応、正確、安全にチューニングする
Authors: Somnath Banerjee,
Abstract要約: 「責任インテリジェンス」は、大規模言語モデルの膨大な生成力を現実の展開の厳密な要求と整合させるために設計されたフレームワークである。本研究は, 技術的精度を確保するための領域適応, 敵の脆弱性を軽減するための倫理的厳格性, グローバルな傾きを促進するための文化的・マルチリンガルなアライメントの3つの相互接続スレッドをナビゲートする。
参考スコア（独自算出の注目度）: 1.455557560509524
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The overarching research direction of this work is the development of a ''Responsible Intelligence'' framework designed to reconcile the immense generative power of Large Language Models (LLMs) with the stringent requirements of real-world deployment. As these models become a transformative force in artificial intelligence, there is an urgent need to move beyond general-purpose architectures toward systems that are contextually aware, inherently safer, and deeply respectful of global cultural nuances. This research navigates three interconnected threads: domain adaptation to ensure technical precision, ethical rigor to mitigate adversarial vulnerabilities, and cultural/multilingual alignment to promote global inclusivity. The methodological trajectory moves from classical supervised adaptation for task-specific demands to decoding-time alignment for safety, finally leveraging human feedback and preference modeling to achieve sociolinguistic acuity.
Abstract（参考訳）: この研究の包括的な研究方向性は、大規模言語モデル(LLM)の膨大な生成力を現実の展開の厳密な要求と整合させるために設計された'責任知性'フレームワークの開発である。これらのモデルが人工知能の変革の力になるにつれて、文脈的に認識され、本質的に安全であり、グローバルな文化的ニュアンスを深く尊重するシステムへと、汎用アーキテクチャを超えて移行する必要がある。本研究は, 技術的精度を確保するための領域適応, 敵の脆弱性を軽減するための倫理的厳格性, グローバルな傾きを促進するための文化的・マルチリンガルなアライメントの3つの相互接続スレッドをナビゲートする。この手法は,古典的教師付き適応から安全のための復号時間アライメントへと移行し,最終的に人間のフィードバックと嗜好モデルを活用し,社会言語学的な目的を達成する。

論文の概要: Tutoring Large Language Models to be Domain-adaptive, Precise, and Safe

関連論文リスト