Fugu-MT 論文翻訳(概要): Ideology Prediction of German Political Texts

論文の概要: Ideology Prediction of German Political Texts

arxiv url: http://arxiv.org/abs/2605.14352v1
Date: Thu, 14 May 2026 04:25:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.615551
Title: Ideology Prediction of German Political Texts
Title（参考訳）: ドイツ政治文書のイデオロギー予測
Authors: Sinclair Schneider, Florian Steuber, Joao A. G. Schneider, Gabi Dreo Rodosek,
Abstract要約: 本稿では,テキストの政治的指向を連続的な左から右へのスペクトルに投影するトランスフォーマーモデルを提案する。この研究は、トランスフォーマーモデルが、世論調査のレベルにおいて、ドイツのニュースにおける政治的フレーミングを認識できることを示した。
参考スコア（独自算出の注目度）: 0.7349727826230863
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Elections represent a crucial milestone in a nation's ongoing development. To better understand the political rhetoric from various movements, ranging from left to right, we propose a transformer-based model capable of projecting the political orientation of a text on a continuous left-to-right spectrum, represented by a normalized scalar d between -1 and 1. This approach enables analysts to focus on specific segments of the political landscape, such as conservatives, while excluding liberal and far-right movements. Such a task can only be achieved with multiclass classifiers, provided that the desired orientation is incorporated within one of their predefined classes. To determine the most suitable foundation model among 13 candidate transformers for this task, we constructed four distinct corpora. One corpus comprised annotated plenary notes from the German Bundestag, while another was based on an official online decision-making tool, Wahl-O-Mat. The third corpus consisted of articles from 33 newspapers, each identified by its political orientation, and the fourth included 535,200 tweets from 597 members of the 20th and 21st German Bundestag. To mitigate overfitting, we used two distinct corpora for training and two for testing, respectively. For in-domain performance, DeBERTa-large achieved the highest F1 score F1=0.844 as well as for the X (Twitter) out-of-domain test ACC=0.864. Regarding the newspaper out-of-domain test, Gemma2-2B excelled (MAE = 0.172). This study demonstrates that transformer models can recognize political framing in German news at the level of public opinion polls. Our findings suggest that both the model architecture and the availability of domain-specific training data can be as influential as model size for estimating political bias. We discuss methodological limitations and outline directions for improving the robustness of bias measurement.
Abstract（参考訳）: 選挙は国家の継続的な発展における重要なマイルストーンである。左から右への様々な動きからの政治的レトリックをよりよく理解するために, テキストの政治的指向を連続的な左から右へのスペクトルに投影し, -1から1の間の正規化されたスカラーdで表現できるトランスフォーマーモデルを提案する。このアプローチは、リベラル派や極右派を除いたまま、保守派のような政治情勢の特定の部分に焦点を当てることを可能にする。このようなタスクは、所望の向きが事前に定義されたクラスの1つに組み込まれていることを前提として、マルチクラス分類器でのみ達成できる。この課題に対して,13の候補変換器のうち,最も適切な基礎モデルを決定するために,4つの異なるコーパスを構築した。 1つのコーパスはドイツ連邦議会の注釈付き文書で構成され、もう1つのコーパスは公式のオンライン意思決定ツールであるWahl-O-Matに基づいていた。第3のコーパスは33の新聞の記事で構成され、それぞれが政治的指向によって識別され、第4のツイートには第20および第21ドイツ連邦議会議員597人の535,200のツイートが含まれていた。オーバーフィッティングを緩和するために、トレーニングには2つの異なるコーパス、テストには2つのコーパスを使用しました。ドメイン内のパフォーマンスでは、DeBERTa-largeがF1=0.844、X (Twitter) out-of-domain test ACC=0.864を達成している。新聞のドメイン外試験では、Gemma2-2Bは優れている(MAE = 0.172)。この研究は、トランスフォーマーモデルが、世論調査のレベルにおいて、ドイツのニュースにおける政治的フレーミングを認識できることを示した。以上の結果から,モデルアーキテクチャとドメイン固有のトレーニングデータの可用性は,政治的バイアスを推定するためのモデルサイズと同じくらいの影響を及ぼす可能性が示唆された。本稿では,バイアス測定のロバスト性向上のための方法論的限界と方向性について論じる。

論文の概要: Ideology Prediction of German Political Texts

関連論文リスト