Fugu-MT 論文翻訳(概要): Measuring and Mitigating Persona Distortions from AI Writing Assistance

論文の概要: Measuring and Mitigating Persona Distortions from AI Writing Assistance

arxiv url: http://arxiv.org/abs/2604.22503v1
Date: Fri, 24 Apr 2026 12:31:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-27 15:36:26.453943
Title: Measuring and Mitigating Persona Distortions from AI Writing Assistance
Title（参考訳）: AI記述支援によるペルソナ歪みの測定と緩和
Authors: Paul Röttger, Kobi Hackenburg, Hannah Rose Kirk, Christopher Summerfield,
Abstract要約: 何百万人もの人々が人工知能(AI)を使って支援を行っている。ここでは、AIによる文章作成支援が、作家のペルソナ、認識される信念、個性、アイデンティティを歪めているかを評価する。
参考スコア（独自算出の注目度）: 15.493624238165998
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Hundreds of millions of people use artificial intelligence (AI) for writing assistance. Here, we evaluated how AI writing assistance distorts writer personas - their perceived beliefs, personality, and identity. In three large-scale experiments, writers (N=2,939) wrote political opinion paragraphs with and without AI assistance. Separate groups of readers (N=11,091) blindly evaluated these paragraphs across 29 socially salient dimensions of reader perception, spanning political opinion, writing quality, writer personality, emotions, and demographics. AI writing assistance produced persona distortions across all dimensions: with AI, writers seemed more opinionated, competent, and positive, and their perceived demographic profile shifted towards more privileged groups. Writers objected to many of the observed distortions, yet continued to prefer AI-assisted text even when made aware of them. We successfully mitigated objectionable persona distortions at the model level by training reward models on our experimental data (10,008 paragraphs, 2,903,596 ratings) to steer AI outputs towards faithful representation of writer stance. However, this came at a cost to user acceptance, suggesting an entanglement between desirable and undesirable properties of AI writing assistance that may be difficult to resolve. Together, our findings demonstrate that persona distortions from AI writing assistance are pervasive and persistent even under realistic conditions of human oversight, which carries implications for public discourse, trust, and democratic deliberation that scale with AI adoption.
Abstract（参考訳）: 何百万人もの人々が人工知能(AI)を使って支援を行っている。ここでは,AIによる文章作成支援が,作家のペルソナ(信念,個性,アイデンティティ)を歪めているかを評価する。大規模な3つの実験において、著者 (N=2,939) は、AIの支援なしに政治的意見の段落を書いた。異なる読解者のグループ(N=11,091)は、これらの段落を、社会的に健全な29次元の読者認識、政治的意見、文章の質、作家の性格、感情、人口統計など、盲目的に評価した。 AIの執筆支援は、あらゆる面でペルソナの歪みを生み出した。AIでは、著者はより意見が強く、有能で、肯定的なように思われ、彼らの認識する人口動態は、より特権のあるグループへと移行した。著者は観察された歪みの多くに反対したが、それを認識した場合でもAI支援のテキストを好んだ。筆者らの実験データ(10,008段落,2,903,596点)を用いて,著者の態度を忠実に表現するための報酬モデルを用いて,モデルレベルでの否定可能なペルソナ歪みの軽減に成功した。しかし、これはユーザーの受け入れにコストがかかり、AI記述支援の望ましい性質と望ましくない性質の絡み合いが解決し難いかもしれないことを示唆している。この結果から,AI導入に伴う公衆の言論,信頼,民主的熟考に影響を及ぼす人間監視の現実的条件下においても,AI作成支援からのペルソナの歪みが広範に持続していることが示唆された。

関連論文リスト

Reactive Writers: How Co-Writing with AI Changes How We Engage with Ideas [19.820383662256308]
我々は、AIの提案への関与が、執筆プロセスにおける中心的な活動となることを示す。このシフトを textitReactive write: a evaluation-first, suggestion-led writing practice と呼ぶ。
論文参考訳（メタデータ） (2026-03-11T03:40:48Z)
Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing [55.2480439325792]
本研究では、AI-Polished-Text Evaluationデータセットを用いて、12の最先端AIテキスト検出器を体系的に評価する。我々の発見によると、検出器は、最小限に洗練されたテキストをAI生成としてフラグ付けし、AIの関与度を区別し、古いモデルや小さなモデルに対するバイアスを示す。
論文参考訳（メタデータ） (2025-02-21T18:45:37Z)
"It was 80% me, 20% AI": Seeking Authenticity in Co-Writing with Large Language Models [97.22914355737676]
我々は、AIツールと共同で書き込む際に、著者が自分の真正な声を保存したいかどうか、どのように検討する。本研究は,人間とAIの共創における真正性の概念を解明するものである。読者の反応は、人間とAIの共著に対する関心が低かった。
論文参考訳（メタデータ） (2024-11-20T04:42:32Z)
How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? [29.068596156140913]
筆者らは,筆記プロセスにおけるAI支援のレベルとタイプが,書記プロセスに対する人々の認識にどのように影響するかについて検討した。以上の結果から,特にAIが新たなコンテンツ生成の支援を提供していれば,文章作成プロセスにおけるAI支援の開示は,平均品質評価を低下させる可能性が示唆された。
論文参考訳（メタデータ） (2024-10-06T16:45:33Z)
Human Bias in the Face of AI: Examining Human Judgment Against Text Labeled as AI Generated [48.70176791365903]
本研究では、偏見がAIと人為的コンテンツの知覚をどう形成するかを考察する。ラベル付きおよびラベルなしコンテンツに対するヒトのラッカーの反応について検討した。
論文参考訳（メタデータ） (2024-09-29T04:31:45Z)
The Great AI Witch Hunt: Reviewers Perception and (Mis)Conception of Generative AI in Research Writing [36.188062803005515]
研究執筆におけるジェネレーティブAI(GenAI)の利用は急速に増加している。ピアレビュアーがAIによる増補された写本をどう認識するか、それとも誤認しているかは明らかでない。我々の研究結果は、AIによって強化された文章は可読性、言語多様性、情報性を改善するが、しばしば研究の詳細や著者からの反射的な洞察を欠いていることを示唆している。
論文参考訳（メタデータ） (2024-06-27T02:38:25Z)
The Future of AI-Assisted Writing [0.0]
我々は、情報検索レンズ(プル・アンド・プッシュ)を用いて、そのようなツールの比較ユーザスタディを行う。我々の研究結果によると、ユーザーは執筆におけるAIのシームレスな支援を歓迎している。ユーザはAI支援の書き込みツールとのコラボレーションも楽しんだが、オーナシップの欠如を感じなかった。
論文参考訳（メタデータ） (2023-06-29T02:46:45Z)
The Role of AI in Drug Discovery: Challenges, Opportunities, and Strategies [97.5153823429076]
この分野でのAIのメリット、課題、欠点についてレビューする。データ拡張、説明可能なAIの使用、従来の実験手法とAIの統合についても論じている。
論文参考訳（メタデータ） (2022-12-08T23:23:39Z)
Can Machines Imitate Humans? Integrative Turing-like tests for Language and Vision Demonstrate a Narrowing Gap [56.611702960809644]
3つの言語タスクと3つの視覚タスクで人間を模倣するAIの能力をベンチマークする。次に,人間1,916名,AI10名を対象に,72,191名のチューリング様試験を行った。模倣能力は従来のAIパフォーマンス指標と最小限の相関を示した。
論文参考訳（メタデータ） (2022-11-23T16:16:52Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。