Fugu-MT 論文翻訳(概要): 'AI Alignment' Encompasses Competing Technical Priorities

論文の概要: 'AI Alignment' Encompasses Competing Technical Priorities

arxiv url: http://arxiv.org/abs/2606.14315v1
Date: Fri, 12 Jun 2026 09:56:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 16:00:42.861518
Title: 'AI Alignment' Encompasses Competing Technical Priorities
Title（参考訳）: AIアライメント」が競合する技術的優先事項を補完
Authors: Tushita Jha, Rory Svarc, Mateusz Bagiński,
Abstract要約: 文献には「AIアライメント」の先頭に落ちる多くの異なる概念が含まれている。現実的な介入は、ある概念の下で「AIアライメント」を促進しつつ、他の視点からは積極的に反生産的であると主張する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The ML literature contains many distinct concepts falling under the heading of 'AI alignment'. After noting three concepts of AI alignment in the context of their corresponding research programs, we claim that realistic interventions may promote 'AI alignment' under one conception while being actively counterproductive from the perspective of others. We suggest that tensions between alignment ideals emerge due to differences in background threat-models, alongside differences in normative orientations. In light of our analysis, researchers aiming to further the goal of 'AI alignment' should do five things. First, they should not conflate distinctions of policy and distinctions of scientific scope; second, methodological disagreements should be acknowledged explicitly; third, researchers should distinguish between 'AI alignment' as a high-level ideal and specific 'alignment proxies' used in empirical research; fourth, they should use more granular concepts to identify both the source and nature of possible AI harms/benefits; fifth, they should explicitly acknowledge the diversity of 'alignment' concepts in both empirical work and in communication with non-technical audiences.
Abstract（参考訳）: ML文献には「AIアライメント」の先頭に落ちる多くの異なる概念が含まれている。研究プログラムの文脈でAIアライメントの3つの概念に言及した後、現実的な介入は1つの概念の下で「AIアライメント」を促進しつつ、他者の視点から積極的に反生産的であると主張している。我々は,背景脅威モデルの違いと規範的指向の相違により,アライメント理想間の緊張が生じることを示唆する。我々の分析を踏まえて、「AIアライメント」の目標をさらに進める研究者は5つのことを行うべきである。第二に、研究者は「AIアライメント」を高水準の理想と特定の「アライメントプロキシ」と区別し、第四に、AIの害/利益の源泉と性質の両方を識別するために、より粒度の細かい概念を用いて、経験的作業と非技術的オーディエンスとのコミュニケーションにおいて「アライメント」概念の多様性を明確に認識すべきである。

論文の概要: 'AI Alignment' Encompasses Competing Technical Priorities

関連論文リスト