Fugu-MT 論文翻訳(概要): Positive Alignment: Artificial Intelligence for Human Flourishing

論文の概要: Positive Alignment: Artificial Intelligence for Human Flourishing

arxiv url: http://arxiv.org/abs/2605.10310v1
Date: Mon, 11 May 2026 10:11:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.718821
Title: Positive Alignment: Artificial Intelligence for Human Flourishing
Title（参考訳）: ポジティブアライメント:人間の浮き彫りのための人工知能
Authors: Ruben Laukkonen, Seb Krier, Chloé Bakalar, Shamil Chandaria, Morten Kringelbach, Adam Elwood, Daniel Ford, Fernando Rosas, Maty Bohacek, Matija Franklin, Nenad Tomašev, Stephanie Chan, Verena Rieser, Roma Patel, Michael Levin, Arun Rao,
Abstract要約: 既存のアライメント研究は、安全と害の防止に関する懸念に支配されている。ポジティブアライメント(Positive Alignment)とは、人間と生態の繁栄を積極的に支援するAIシステムの開発である。
参考スコア（独自算出の注目度）: 36.70635562721606
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Existing alignment research is dominated by concerns about safety and preventing harm: safeguards, controllability, and compliance. This paradigm of alignment parallels early psychology's focus on mental illness: necessary but incomplete. What we call Positive Alignment is the development of AI systems that (i) actively support human and ecological flourishing in a pluralistic, polycentric, context-sensitive, and user-authored way while (ii) remaining safe and cooperative. It is a distinct and necessary agenda within AI alignment research. We argue that several existing failures of alignment (e.g., engagement hacking, loss of human autonomy, failures in truth-seeking, low epistemic humility, error correction, lack of diverse viewpoints, and being primarily reactive rather than proactive) may be better addressed through positive alignment, including cultivating virtues and maximizing human flourishing. We highlight a range of challenges, open questions, and technical directions (e.g., data filtering and upsampling, pre- and post-training, evaluations, collaborative value collection) for different phases of the LLM and agents lifecycle. We end with design principles for promoting disagreement and decentralization through contextual grounding, community customization, continual adaptation, and polycentric governance; that is, many legitimate centers of oversight rather than one institutional or moral chokepoint.
Abstract（参考訳）: 既存のアライメント研究は、安全と害の防止に関する懸念、すなわち保護、管理可能性、コンプライアンスに支配されている。このアライメントのパラダイムは、初期の心理学が精神疾患(必要だが不完全)に焦点を合わせるのと平行している。ポジティブアライメント(Positive Alignment)とは、AIシステムの開発である。 (i)多元的・多元的・文脈に敏感でユーザ権限のある方法での人的・生態的繁栄を積極的に支援する (二)安全で協力的なままである。これはAIアライメント研究において、明確にかつ必要な議題である。既存のアライメントの失敗(例えば、エンゲージメントハッキング、人間の自律性の喪失、真理探究における失敗、低い認識の謙虚さ、エラー修正、多様な視点の欠如、そして主に積極的な視点の欠如)は、人間の繁栄を最大化することを含むポジティブなアライメントによってよりうまく対処できる、と我々は主張する。 LLMとエージェントライフサイクルの異なるフェーズに対して、さまざまな課題、オープンな質問、技術的な方向性(例えば、データフィルタリングとアップサンプリング、事前および後トレーニング、評価、協調価値収集など)を強調します。我々は、文脈的基盤化、コミュニティのカスタマイズ、継続的な適応、多中心的なガバナンスを通じて、不一致と分散化を促進する設計原則、すなわち、1つの制度的または道徳的なチョークポイントよりも多くの正当な監視の中心を設計する。

論文の概要: Positive Alignment: Artificial Intelligence for Human Flourishing

関連論文リスト