Fugu-MT 論文翻訳(概要): BITS Pilani at SemEval-2026 Task 9: Structured Supervised Fine-Tuning with DPO Refinement for Polarization Detection

論文の概要: BITS Pilani at SemEval-2026 Task 9: Structured Supervised Fine-Tuning with DPO Refinement for Polarization Detection

arxiv url: http://arxiv.org/abs/2604.11121v1
Date: Mon, 13 Apr 2026 07:35:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.402031
Title: BITS Pilani at SemEval-2026 Task 9: Structured Supervised Fine-Tuning with DPO Refinement for Polarization Detection
Title（参考訳）: BITS Pilani at SemEval-2026 Task 9: Structured Supervised Fine-Tuning with DPO Refinement for Polarization Detection
Authors: Atharva Gupta, Dhruv Kumar, Yash Sinha,
Abstract要約: ソーシャルメディアのテキストに政治的偏極を検出するための2段階のアプローチを提案する。解釈可能なスロットフィリングテンプレートを用いてQwen 2.5-7B-インストラクションをLoRAで微調整する。 SemEval 2026 POLAR共有タスクデータセットの実験では、嗜好ベースの改善は両方の精度を改善し、付加的なアノテーションなしで偽陰性を減少させる。
参考スコア（独自算出の注目度）: 2.2588605422113606
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The POLAR SemEval-2026 Shared Task aims to detect online polarization and focuses on the classification and identification of multilingual, multicultural, and multi-event polarization. Accurate computational detection of online polarization is challenging due to nuanced rhetoric, implicit framing, and the high cost of human-in-the-loop annotation. Building on recent findings that contextual prompting enables large language models to function as strong polarization detectors, we present a two-stage approach for detecting political polarization in social media text that combines structured supervised fine-tuning with Direct Preference Optimization (DPO) refinement. We fine-tune Qwen 2.5-7B-Instruct with LoRA using an interpretable slot-filling template (target, claim type, manifestation checklist, and justification). We then apply DPO with automatically generated preference pairs to reduce costly false negatives. Experiments on the SemEval 2026 POLAR shared task dataset show that preference-based refinement improves both accuracy and decreases false negatives without extra annotation. On the English development set, DPO increases recall from 0.5085 to 0.7797 and improves macro-F1 by ~5 points.
Abstract（参考訳）: POLAR SemEval-2026 Shared Taskは、オンライン偏光の検出と、多言語、多文化、多領域偏光の分類と識別に焦点を当てている。オンライン分極の正確な計算は、ニュアンス付きレトリック、暗黙のフレーミング、高コストのヒューマン・イン・ザ・ループアノテーションにより困難である。文脈的プロンプトによって大きな言語モデルを強力な分極検出器として機能させることができるという最近の知見に基づいて、構造化された教師付き微調整と直接選好最適化(DPO)の改良を組み合わせたソーシャルメディアテキストにおいて、政治的分極を検出するための2段階のアプローチを提案する。解釈可能なスロットフィリングテンプレート(ターゲット,クレームタイプ,マニフェストチェックリスト,正当化)を用いて,Qwen 2.5-7B-インストラクションをLoRAで微調整する。次に、自動生成された選好ペアでDPOを適用し、コストのかかる偽陰性を減らす。 SemEval 2026 POLAR共有タスクデータセットの実験では、嗜好ベースの改善は両方の精度を改善し、付加的なアノテーションなしで偽陰性を減少させる。英語の開発セットでは、DPOはリコールを0.5085から0.7797に増加し、マクロF1を約5ポイント改善する。

論文の概要: BITS Pilani at SemEval-2026 Task 9: Structured Supervised Fine-Tuning with DPO Refinement for Polarization Detection

関連論文リスト