Fugu-MT 論文翻訳(概要): Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

論文の概要: Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

arxiv url: http://arxiv.org/abs/2603.18729v1
Date: Thu, 19 Mar 2026 10:24:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-20 17:19:06.086484
Title: Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures
Title（参考訳）: 単一および多エージェントな生成AIアーキテクチャにおける言語ステレオタイプの解析
Authors: Martina Ullasci, Marco Rondina, Riccardo Coppola, Flavio Giobergia, Riccardo Bellanca, Gabriele Mancari Pasi, Luca Prato, Federico Spinoso, Silvia Tagliente,
Abstract要約: 本稿では,LLM出力における方言感受性ステレオタイプ生成の既存解析を再現する。我々は8つのプロンプトテンプレートを定義し、方言バイアスが現れる様々な方法を分析する。 LLM-as-judge 法を用いて結果のバイアスを評価する。
参考スコア（独自算出の注目度）: 4.112801542332816
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Many works in the literature show that LLM outputs exhibit discriminatory behaviour, triggering stereotype-based inferences based on the dialect in which the inputs are written. This bias has been shown to be particularly pronounced when the same inputs are provided to LLMs in Standard American English (SAE) and African-American English (AAE). In this paper, we replicate existing analyses of dialect-sensitive stereotype generation in LLM outputs and investigate the effects of mitigation strategies, including prompt engineering (role-based and Chain-Of-Thought prompting) and multi-agent architectures composed of generate-critique-revise models. We define eight prompt templates to analyse different ways in which dialect bias can manifest, such as suggested names, jobs, and adjectives for SAE or AAE speakers. We use an LLM-as-judge approach to evaluate the bias in the results. Our results show that stereotype-bearing differences emerge between SAE- and AAE-related outputs across all template categories, with the strongest effects observed in adjective and job attribution. Baseline disparities vary substantially by model, with the largest SAE-AAE differential observed in Claude Haiku and the smallest in Phi-4 Mini. Chain-Of-Thought prompting proved to be an effective mitigation strategy for Claude Haiku, whereas the use of a multi-agent architecture ensured consistent mitigation across all the models. These findings suggest that for intersectionality-informed software engineering, fairness evaluation should include model-specific validation of mitigation strategies, and workflow-level controls (e.g., agentic architectures involving critique models) in high-impact LLM deployments. The current results are exploratory in nature and limited in scope, but can lead to extensions and replications by increasing the dataset size and applying the procedure to different languages or dialects.
Abstract（参考訳）: 文献における多くの研究は、LLM出力が識別行動を示し、入力が書かれた方言に基づいてステレオタイプベースの推論を誘導していることを示している。このバイアスは、標準アメリカ英語(SAE)とアフリカ系アメリカ人英語(AAE)でLLMに同じ入力が提供されるときに特に顕著である。本稿では, LLM出力における方言感応ステレオタイプ生成の既存解析を再現し, 生成批判・修正モデルからなる素早い工学的手法(ロールベース, 連鎖-Of-Thoughtプロンプト)やマルチエージェントアーキテクチャなど, 緩和戦略の効果について検討する。 SAE話者やAE話者に提案される名前、職名、形容詞など、方言バイアスが現れる様々な方法を分析するために、8つのプロンプトテンプレートを定義した。 LLM-as-judge 法を用いて結果のバイアスを評価する。以上の結果から,SAE-とAE-関連アウトプットの立体的差異は,すべてのテンプレートカテゴリーにおいて出現し,形容詞的および職業的属性において最も強い影響が認められた。ベースラインの差異はモデルによって大きく異なり、クロード・ハイクでは最大のSAE-AAE差が観測され、Phi-4 Miniでは最小の差が観測された。 Chain-Of-ThoughtプロンプトはClaude Haikuにとって効果的な緩和戦略であることが証明された。これらの結果から, 交差性インフォームドソフトウェアエンジニアリングにおいて, 公平性評価には, 緩和戦略のモデル固有の検証, ワークフローレベルの制御(例えば, 批判モデルを含むエージェントアーキテクチャ)を, 高度にインパクトのあるLCMデプロイメントに含めるべきであることが示唆された。現在の結果は自然界で探索的であり、範囲は限られているが、データセットのサイズを拡大し、異なる言語や方言にプロシージャを適用することにより、拡張やレプリケーションにつながる可能性がある。

論文の概要: Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

関連論文リスト