Fugu-MT 論文翻訳(概要): ArPoMeme: An Annotated Arabic Multimodal Dataset for Political Ideology and Polarization

論文の概要: ArPoMeme: An Annotated Arabic Multimodal Dataset for Political Ideology and Polarization

arxiv url: http://arxiv.org/abs/2605.20967v1
Date: Wed, 20 May 2026 09:53:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.610489
Title: ArPoMeme: An Annotated Arabic Multimodal Dataset for Political Ideology and Polarization
Title（参考訳）: ArPoMeme: 政治イデオロギーと分極のための注釈付きアラビア語マルチモーダルデータセット
Authors: Wajdi Zaghouani, Kais Attia, Md. Rafiul Biswas, Fadhl Eryani,
Abstract要約: 本稿では,約7300のアラビア政治ミームの大規模データセットであるArPoMemeについて述べる。このデータセットは、これらのミームを制作し広める公開Facebookページやグループの自己識別に分類することで、アラビアのミームエコシステムの多様性を捉えている。得られたデータセットは、視覚的内容、テキストメッセージ、イデオロギー的指向を結びつけ、政治的対立、動員、ユーモアのきめ細かい分析を可能にする。
参考スコア（独自算出の注目度）: 1.0115311686147797
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Memes have become a prominent medium of political communication in the Arab world, reflecting how humor, imagery, and text interact to express ideological and cultural positions. Despite the centrality of memes to online political discourse, there is a lack of systematically curated resources for analyzing their multimodal and ideological dimensions in Arabic. This paper presents ArPoMeme, a large-scale dataset of approximately 7,300 Arabic political memes categorized by ideological orientation, including Leftist, Islamist, Pan-Arabist, and Satirical perspectives. The dataset captures the diversity of Arabic meme ecosystems by grounding classification in the self-identification of public Facebook pages and groups that produce and disseminate these memes. To ensure both scale and accuracy, we designed a semi-automated data collection pipeline combining Playwright-based Facebook scraping with Google Drive synchronization, followed by text extraction using the Qwen2.5-VL-7B vision language model. The extracted text was manually verified and annotated for three polarization dimensions: Us vs. Them framing, Hostility toward out-groups, and Calls to action. Annotation was conducted through a custom Streamlit-based interface supporting distributed labeling, real-time tracking, and version control. The resulting dataset links visual content, textual messages, and ideological orientation, enabling fine-grained analysis of political antagonism, mobilization, and humor. Quantitative analysis of the annotated corpus reveals strong asymmetries in antagonistic framing across ideological groups, with Islamist and satirical memes exhibiting the highest levels of hostility and mobilization cues. The dataset and the annotation tool offers a reproducible and publicly available resource for studying Arabic political discourse, multimodal ideology detection, and polarization dynamics.
Abstract（参考訳）: ミームは、イデオロギーや文化的な位置を表現するためにユーモア、イメージ、テキストがどのように相互作用するかを反映して、アラブ世界の政治的コミュニケーションの顕著な媒体となっている。オンラインの政治談話にミームが集中しているにもかかわらず、アラビア語の多様・イデオロギー的な側面を分析するための体系的な資源が不足している。本稿では、左派、イスラム主義、パン・アラビスト、サーティリカルといったイデオロギー的指向によって分類された約7,300のアラビアの政治ミームの大規模データセットであるArPomeについて述べる。このデータセットは、これらのミームを制作し広める公開Facebookページやグループの自己識別に分類することで、アラビアのミームエコシステムの多様性を捉えている。スケールと精度の両方を確保するため,PlaywrightベースのFacebookスクレイピングとGoogle Drive同期を組み合わせた半自動データ収集パイプラインを設計し,続いてQwen2.5-VL-7B視覚言語モデルを用いたテキスト抽出を行った。抽出したテキストは, Us vs. Them framing, Hostility to Out-groups, Calls to actionという3つの偏極ディメンションに対して手作業で検証,注釈付けされた。アノテーションは、分散ラベリング、リアルタイムトラッキング、バージョン管理をサポートするStreamlitベースのカスタムインターフェースを通じて実行された。得られたデータセットは、視覚的内容、テキストメッセージ、イデオロギー的指向を結びつけ、政治的対立、動員、ユーモアのきめ細かい分析を可能にする。注釈付きコーパスの定量的分析では、イデオロギーグループ間の対立的フレーミングにおいて強い非対称性が示され、イスラム主義や風刺的なミームは最も高い敵意と動員の手がかりを示す。このデータセットとアノテーションツールは、アラビアの政治談話、マルチモーダルイデオロギー検出、偏光ダイナミクスを研究するための再現性があり、公開されているリソースを提供する。

論文の概要: ArPoMeme: An Annotated Arabic Multimodal Dataset for Political Ideology and Polarization

関連論文リスト