Fugu-MT 論文翻訳(概要): PoliLegalLM: A Technical Report on a Large Language Model for Political and Legal Affairs

論文の概要: PoliLegalLM: A Technical Report on a Large Language Model for Political and Legal Affairs

arxiv url: http://arxiv.org/abs/2604.17543v1
Date: Sun, 19 Apr 2026 17:06:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.577759
Title: PoliLegalLM: A Technical Report on a Large Language Model for Political and Legal Affairs
Title（参考訳）: PoliLegalLM:政治・法務における大規模言語モデルに関する技術報告
Authors: Yuting Huang, Yinghao Hu, Qian Xiao, Wenlin Zhong, Yiquan Wu, Taishi Zhou, Moke Chen, Changlong Sun, Kun Kuang, Fei Wu,
Abstract要約: PoliLegalLMは、政治的および法的応用に適したドメイン固有の大規模言語モデルである。我々は、大規模で高品質な法定コーパスを構築し、構造化後学習パイプラインを設計する。我々は、LawBench、LexEval、および実世界のデータセットPoliLegalを含む3つの代表的なベンチマークでPoliLegalLMを評価した。
参考スコア（独自算出の注目度）: 42.80108620217788
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) have achieved remarkable success in general-domain tasks, yet their direct application to the legal domain remains challenging due to hallucinated legal citations, incomplete knowledge coverage, and weak structured reasoning. To address these issues, we propose PoliLegalLM, a domain-specific large language model tailored for political and legal applications. Our approach adopts a unified training framework that integrates continued pretraining, progressive supervised fine-tuning, and preference-based reinforcement learning to jointly enhance legal knowledge grounding, task alignment, and reasoning capability. We construct a large-scale, high-quality legal corpus and design a structured post-training pipeline, enabling the model to effectively learn domain-specific knowledge and adapt to diverse legal tasks. We evaluate PoliLegalLM on three representative benchmarks, including LawBench, LexEval, and a real-world dataset, PoliLegal. Experimental results demonstrate that PoliLegalLM achieves strong and consistent performance, outperforming competitive models of similar scale and remaining highly competitive with significantly larger models, while achieving the best results on real-world legal scenarios. These results highlight the effectiveness of our training paradigm and the practical value of domain-specific LLMs for real-world legal applications.
Abstract（参考訳）: 大規模言語モデル(LLM)は、一般分野のタスクにおいて顕著な成功を収めてきたが、法域への直接的な適用は、法的な引用、不完全な知識のカバレッジ、弱い構造的推論のため、依然として困難である。これらの問題に対処するため,政治・法的応用に適したドメイン固有の大規模言語モデルであるPoliLegalLMを提案する。本手法では, 継続的事前学習, プログレッシブ・教師付き微調整, 嗜好に基づく強化学習を統合し, 法的知識基盤, タスクアライメント, 推論能力の強化を図る。我々は、大規模で高品質な法定コーパスを構築し、構造化後訓練パイプラインを設計し、モデルがドメイン固有の知識を効果的に学習し、多様な法的タスクに適応できるようにする。我々は、LawBench、LexEval、および実世界のデータセットPoliLegalを含む3つの代表的なベンチマークでPoliLegalLMを評価した。実験結果から,PoliLegalLMはより強靭で一貫した性能を達成し,類似のスケールの競争モデルよりも優れ,より大規模なモデルと高い競争力を保ちながら,現実の法的シナリオにおいて最高の結果を得ることができた。これらの結果から,現実の法的応用におけるトレーニングパラダイムの有効性と,ドメイン固有LLMの実用的価値が明らかとなった。

論文の概要: PoliLegalLM: A Technical Report on a Large Language Model for Political and Legal Affairs

関連論文リスト