Fugu-MT 論文翻訳(概要): Recognition Without Authorization: LLMs and the Moral Order of Online Advice

論文の概要: Recognition Without Authorization: LLMs and the Moral Order of Online Advice

arxiv url: http://arxiv.org/abs/2604.22143v1
Date: Fri, 24 Apr 2026 01:19:39 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-27 15:36:26.302533
Title: Recognition Without Authorization: LLMs and the Moral Order of Online Advice
Title（参考訳）: 認可なしの認識: LLMsとオンラインアドバイスのモラルオーダー
Authors: Tom van Nuenen,
Abstract要約: この記事では、r/relationship_adviceの11,565の投稿に対して、4つのアシスタントスタイルのLLMとコミュニティが推奨するアドバイスを比較します。モデル全体では、LLMは人間のコメンテーターと同じダイナミクスの多くを識別するが、その認識を行動の指示的な承認に変換する可能性は著しく低い。この記事では、モデルのばらつきは、技術的なエラーから、標準化されたアシスタントの規範が道徳的世界と遭遇したときにフラットになるものを見る方法に書き換えることができる、と論じている。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models are increasingly used to mediate everyday interpersonal dilemmas, yet how their advisory defaults interact with the concentrated moral orders of specific communities remains poorly understood. This article compares four assistant-style LLMs with community-endorsed advice on 11,565 posts from r/relationship_advice, using the subreddit as a concentrated, vote-ratified moral formation whose prescriptive clarity makes divergence measurable. Across models, LLMs identify many of the same dynamics as human commenters, but are markedly less likely to convert that recognition into directive authorization for action. The gap is sharpest where community consensus is strongest: on high-consensus posts involving abuse or safety threats, models recommend exit at roughly half the human rate while maintaining elevated levels of hedging, validation, and therapeutic framing. The article describes this pattern as recognition without authorization: the capacity to register harm while withholding socially ratified permission for consequential action. This divergence is not incidental but structural: a portable advisory style that remains validating, risk-averse, and weakly directive across contexts. Safety alignment is one plausible contributor to this pattern, alongside training-data averaging and broader assistant design. The article argues that model divergence can be reframed from a technical error to a way of seeing what standardized assistant norms flatten when they encounter situated moral worlds.
Abstract（参考訳）: 大きな言語モデルは、日々の対人関係のジレンマを仲介するためにますます使われてきているが、彼らのアドバイザリのデフォルトが特定のコミュニティの集中した道徳的秩序とどのように相互作用するかは、いまだに理解されていない。本稿では,4つの補助的 LLM と,r/relationship_advice の 11,565 の投稿に対するアドバイスを比較検討する。モデル全体では、LLMは人間のコメンテーターと同じダイナミクスの多くを識別するが、その認識を行動の指示的な承認に変換する可能性は著しく低い。このギャップは、コミュニティのコンセンサスが最も強く、乱用や安全上の脅威を含む高合意の投稿において、モデルは、ヘッジ、バリデーション、治療フレーミングのレベルを高く保ちながら、人間の約半分のレートで退避することを推奨している。この記事は、このパターンを無許可の認識として記述している: 社会的に承認された社会的行為の許可を保ちながら害を登録する能力。この分散は偶発的ではなく構造的であり、検証、リスク回避、コンテキスト横断の弱い指示を継続するポータブルなアドバイザリスタイルである。安全性のアライメントは、トレーニングデータ平均化とより広範なアシスタント設計とともに、このパターンへのもっともらしい貢献の1つです。この記事では、モデルのばらつきは、技術的なエラーから、標準化されたアシスタントの規範が道徳的世界と遭遇したときにフラットになるものを見る方法に書き換えることができる、と論じている。

論文の概要: Recognition Without Authorization: LLMs and the Moral Order of Online Advice

関連論文リスト