Fugu-MT 論文翻訳(概要): Exploring Lightweight Large Language Models for Court View Generation

論文の概要: Exploring Lightweight Large Language Models for Court View Generation

arxiv url: http://arxiv.org/abs/2605.16770v1
Date: Sat, 16 May 2026 02:44:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:47.014408
Title: Exploring Lightweight Large Language Models for Court View Generation
Title（参考訳）: コートビュー生成のための軽量大言語モデルの探索
Authors: Zhitian Hou, Tianyong Hao, Nanli Zeng, Zhixiong Chao, Kun Zeng,
Abstract要約: 刑事裁判所ビュージェネレーション(CVG)は、法律人工知能(Legal AI)における重要な課題である CVGにおける軽量(2B未満)大言語モデル(LLM)の機能とその電荷予測への影響について検討する。また、CVGタスク用の3つの公開データセットを含む評価フレームワークであるCVGEvalKitを開発し、料金を予測する。
参考スコア（独自算出の注目度）: 7.8505608893153065
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Criminal Court View Generation (CVG) is a critical task in Legal Artificial Intelligence (Legal AI), involving the generation of court view based on case facts. In this work, we systematically explore the capabilities of lightweight (smaller than 2B) large language models (LLMs) in CVG and their impact on charge prediction. Our study addresses four key questions: (1) how does different architecture of LLMs affect the CVG quality and charge prediction. (2) how does LLMs size contribute to the performance, (3) how do lightweight LLMs compare with Deep Neural Networks (DNNs) in these tasks, and (4) how does predicting charge by court view generation first compare with predicting it directly. Additionally, we also develop CVGEvalKit, an evaluation framework including three public available datasets for CVG tasks, as well as predicting their charges. Comprehensive experiments are conducted on this framework, where models are trained on a mixed training set and evaluated on each dataset's test set. Experimental results provide new insights into the trade-offs between model architecture, model size, and the influence between different tasks, highlighting the potential of lightweight LLMs in judicial AI applications. The source code is anonymously available at \url{https://github.com/ZhitianHou/CVGEvalKit}
Abstract（参考訳）: 刑事裁判所ビュー生成(英語: Criminal Court View Generation、CVG)は、法律人工知能(Legal AI)において重要な課題であり、事件事実に基づく裁判所ビューの生成を含む。本研究では,CVGにおける軽量(2B未満)大言語モデル(LLM)の機能とその電荷予測への影響を系統的に検討する。本研究は,LLMの異なるアーキテクチャがCVGの品質と電荷予測にどのように影響を与えるか,という4つの重要な疑問に対処する。 2) LLM のサイズはパフォーマンスにどのように貢献するか,(3) 軽量 LLM はこれらのタスクにおいてディープニューラルネットワーク (DNN) とどのように比較するか,(4) コートビュー生成による電荷の予測は直接予測と比較するか,などである。また、CVGタスク用の3つの公開データセットを含む評価フレームワークであるCVGEvalKitを開発し、料金を予測する。このフレームワークでは、モデルを混合トレーニングセットでトレーニングし、各データセットのテストセットで評価する。実験結果は、モデルアーキテクチャ間のトレードオフ、モデルサイズ、異なるタスク間の影響に関する新たな洞察を与え、司法AIアプリケーションにおける軽量LLMの可能性を強調します。ソースコードは匿名で \url{https://github.com/ZhitianHou/CVGEvalKit} で入手できる。

論文の概要: Exploring Lightweight Large Language Models for Court View Generation

関連論文リスト