Fugu-MT 論文翻訳(概要): What Is The Political Content in LLMs' Pre- and Post-Training Data?

論文の概要: What Is The Political Content in LLMs' Pre- and Post-Training Data?

arxiv url: http://arxiv.org/abs/2509.22367v1
Date: Fri, 26 Sep 2025 14:00:51 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-29 20:57:54.481272
Title: What Is The Political Content in LLMs' Pre- and Post-Training Data?
Title（参考訳）: LLMの事前・後データにおける政治内容とは?
Authors: Tanise Ceron, Dmitry Nikolaev, Dominik Stammbach, Debora Nozza,
Abstract要約: 完全オープンソースモデルであるOLMO2の事前学習コーパスと後学習コーパスの解析を行った。これらのコーパスから、我々は大きなランダムサンプルを描き、政治的指向のために自動的に文書を注釈付けし、それらのソースドメインとコンテンツを分析する。次に、トレーニングデータの政治的コンテンツが、特定の政策問題に対するモデルのスタンスとどのように関連しているかを評価する。
参考スコア（独自算出の注目度）: 12.72257058961811
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) are known to generate politically biased text, yet how such biases arise remains unclear. A crucial step toward answering this question is the analysis of training data, whose political content remains largely underexplored in current LLM research. To address this gap, we present in this paper an analysis of the pre- and post-training corpora of OLMO2, the largest fully open-source model released together with its complete dataset. From these corpora, we draw large random samples, automatically annotate documents for political orientation, and analyze their source domains and content. We then assess how political content in the training data correlates with models' stance on specific policy issues. Our analysis shows that left-leaning documents predominate across datasets, with pre-training corpora containing significantly more politically engaged content than post-training data. We also find that left- and right-leaning documents frame similar topics through distinct values and sources of legitimacy. Finally, the predominant stance in the training data strongly correlates with models' political biases when evaluated on policy issues. These findings underscore the need to integrate political content analysis into future data curation pipelines as well as in-depth documentation of filtering strategies for transparency.
Abstract（参考訳）: 大規模言語モデル (LLM) は、政治的に偏見のあるテキストを生成することが知られているが、そのような偏見がどのように生じるかは定かではない。この問題に対処するための重要なステップは、現在のLLM研究において、政治的内容がほとんど探索されていないトレーニングデータの分析である。このギャップに対処するため,本論文では,OLMO2の事前学習コーパスと後学習コーパスの分析を行った。これらのコーパスから、我々は大きなランダムサンプルを描き、政治的指向のために自動的に文書を注釈付けし、それらのソースドメインとコンテンツを分析する。次に、トレーニングデータの政治的コンテンツが、特定の政策問題に対するモデルのスタンスとどのように関連しているかを評価する。我々の分析によると、左利きの文書はデータセット間で優位であり、事前学習のコーパスには、ポストトレーニングデータよりもはるかに政治的に関与したコンテンツが含まれている。また、左と右の文書は、異なる値と正当性に基づいて類似のトピックを定式化していることもわかりました。最後に、トレーニングデータにおける主要なスタンスは、政策問題を評価する際のモデルの政治的偏見と強く相関する。これらの調査結果は、今後のデータキュレーションパイプラインに政治コンテンツ分析を統合することの必要性と、透明性のためのフィルタリング戦略の詳細なドキュメントの必要性を浮き彫りにしている。

論文の概要: What Is The Political Content in LLMs' Pre- and Post-Training Data?

関連論文リスト