Fugu-MT 論文翻訳(概要): To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification

論文の概要: To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification

arxiv url: http://arxiv.org/abs/2605.10211v1
Date: Mon, 11 May 2026 08:55:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.670081
Title: To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification
Title（参考訳）: 再実行すべきか, 再実行すべきでないか? 局所的なLCMアプローチによる検証プロセスのプリヴィリエジ分類
Authors: Maik Larooij, David Graus,
Abstract要約: 我々は、FOIA Exemption 5の熟考プロセス特権のための自動感度分類を行うための先行研究に基づいて構築する。我々は、小型でローカルなモデルで、コンシューマグレードのハードウェアにデプロイ可能な感度分類を行う。評定として予測される文には、意見の表現を示す動詞がもっと多く含まれていることが分かる。
参考スコア（独自算出の注目度）: 0.2979301924660567
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Government transparency laws, like the Freedom of Information (FOIA) acts in the United States and United Kingdom, and the Woo (Open Government Act) in the Netherlands, grant citizens the right to directly request documents from the government. As these documents might contain sensitive information, such as personal information or threats to national security, the laws allow governments to redact sensitive parts of the documents prior to release. We build on prior research to perform automatic sensitivity classification for the FOIA Exemption 5 deliberative process privilege using Large Language Models (LLMs). However, processing documents not yet cleared for review via third-party cloud APIs is often legally or politically untenable. Therefore, in this work, we perform sensitivity classification with a small, local model, deployable on consumer-grade hardware (Qwen3.5 9B). We compare eight variants of applying LLMs for sentence classification, using well-known prompting techniques, and find that a combination of Chain-of-Thought prompting and few-shot prompting with error-based examples outperforms classification models of earlier work in terms of recall and F2 score. This method also closely approaches the performance of a widely-used, cost-efficient commercial model (Gemini 2.5 Flash). In an additional analysis, we find that sentences that are predicted as deliberative contain more verbs that indicate the expression of opinions, and are more often phrased in in first-person. Above all, deliberativeness seems characterized by the presence of a combination of multiple indicators, in particular the combination of first-person words with a verb for expressing opinion.
Abstract（参考訳）: 情報の自由(FOIA)やオランダのウー(オープン政府法)のような政府の透明性に関する法律は、市民に政府から直接文書を要求する権利を与える。これらの文書には、個人情報や国家安全保障への脅威などの機密情報が含まれている可能性があるため、政府は公開前に機密情報を修正することができる。我々は,大規模言語モデル (LLM) を用いたFOIA Exemption 5の熟考プロセス特権のための,自動感度分類を行うための先行研究を構築した。しかし、サードパーティのクラウドAPIによるレビューのための文書の処理は、多くの場合、法的または政治的に不可能である。そこで本研究では,コンシューマグレードハードウェア(Qwen3.5 9B)上に展開可能な,小型で局所的なモデルによる感度分類を行う。文分類に LLM を適用した8つの変種を比較し、よく知られたプロンプト手法を用いて、誤りベースの例を例に、チェーン・オブ・ソート・プロンプト(Chain-of-Thought prompting)と少数ショット・プロンプト( few-shot prompting)の組み合わせが、リコールとF2スコアの点で、以前の作業の分類モデルより優れていたことを発見した。この手法はまた、広く使われているコスト効率の高い商用モデル(Gemini 2.5 Flash)の性能に近づいた。追加的な分析では、熟考として予測される文には、意見の表現を示す多くの動詞が含まれており、多くの場合、一人称で言い換えられる。いずれにせよ、熟考性は複数の指標の組み合わせ、特に一対一の単語と意見を表現する動詞の組み合わせの存在によって特徴づけられる。

論文の概要: To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification

関連論文リスト