Fugu-MT 論文翻訳(概要): Contextual Multi-Objective Optimization: Rethinking Objectives in Frontier AI Systems

論文の概要: Contextual Multi-Objective Optimization: Rethinking Objectives in Frontier AI Systems

arxiv url: http://arxiv.org/abs/2605.03900v1
Date: Tue, 05 May 2026 15:55:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-06 19:35:44.015897
Title: Contextual Multi-Objective Optimization: Rethinking Objectives in Frontier AI Systems
Title（参考訳）: コンテキスト多目的最適化:フロンティアAIシステムにおける目的の再考
Authors: Jie Zhou, Qin Chen, Liang He,
Abstract要約: AIの失敗の多くは、単にスケールや能力の失敗ではなく、客観的な選択の失敗である、と私たちは主張する。この問題を,文体多目的最適化として定式化する。私たちのフレームワークは、候補アクション、客観的見積、アクティブ制約、利害関係者、不確実性、コンフリクト解決手順に対するコンテキスト依存の選択ルールとして、AIの振る舞いをモデル化します。
参考スコア（独自算出の注目度）: 20.864964136537772
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Frontier AI systems perform best in settings with clear, stable, and verifiable objectives, such as code generation, mathematical reasoning, games, and unit-test-driven tasks. They remain less reliable in open-ended settings, including scientific assistance, long-horizon agents, high-stakes advice, personalization, and tool use, where the relevant objective is ambiguous, context-dependent, delayed, or only partially observable. We argue that many such failures are not merely failures of scale or capability, but failures of objective selection: the system optimizes a locally visible signal while missing which objectives should govern the interaction. We formulate this problem as \emph{contextual multi-objective optimization}. In this setting, systems must consider multiple, context-dependent objectives, such as helpfulness, truthfulness, safety, privacy, calibration, non-manipulation, user preference, reversibility, and stakeholder impact, while determining which objectives are active, which are soft preferences, and which must function as hard or quasi-hard constraints. These examples are not intended as an exhaustive taxonomy: different domains and deployment settings may activate different objective dimensions and different conflict-resolution procedures. Our framework models AI behavior as a context-dependent choice rule over candidate actions, objective estimates, active constraints, stakeholders, uncertainty, and conflict-resolution procedures. We outline an implementation pathway based on decomposed objective representations, context-to-objective routing, hierarchical constraints, deliberative policy reasoning, controlled personalization, tool-use control, diagnostic evaluation, auditing, and post-deployment revision.
Abstract（参考訳）: フロンティアAIシステムは、コード生成、数学的推論、ゲーム、単体テスト駆動タスクなど、明確で安定した、検証可能な目的を持った設定でベストに機能する。科学的援助、長期のエージェント、高度なアドバイス、パーソナライゼーション、ツールの使用など、オープンエンドの環境では信頼性が低いままであり、関連する目的は曖昧で、文脈に依存し、遅れており、部分的に観察可能である。このような障害の多くは、単にスケールや能力の失敗ではなく、客観的選択の失敗である、と我々は主張する。この問題を 'emph{contextual multi-objective optimization} として定式化する。この設定では、システムは、有用性、真実性、安全性、プライバシ、キャリブレーション、非操作性、ユーザの嗜好、可逆性、ステークホルダーの影響など、複数のコンテキスト依存の目的を考慮し、どの目的がアクティブか、ソフトな選好であり、ハードまたは準ハードな制約として機能するかを判断する必要がある。異なるドメインとデプロイメント設定は、異なる客観的次元と異なるコンフリクト解決手順を活性化する可能性がある。私たちのフレームワークは、候補アクション、客観的見積、アクティブ制約、利害関係者、不確実性、コンフリクト解決手順に対するコンテキスト依存の選択ルールとして、AIの振る舞いをモデル化します。本稿では, 分割された客観的表現, コンテキストからオブジェクトへのルーティング, 階層的制約, 熟考的ポリシー推論, 制御されたパーソナライゼーション, ツール使用制御, 診断評価, 監査, デプロイ後のリビジョンに基づく実装経路について概説する。

論文の概要: Contextual Multi-Objective Optimization: Rethinking Objectives in Frontier AI Systems

関連論文リスト