Fugu-MT 論文翻訳(概要): LNE-Blocking: An Efficient Framework for Contamination Mitigation Evaluation on Large Language Models

論文の概要: LNE-Blocking: An Efficient Framework for Contamination Mitigation Evaluation on Large Language Models

arxiv url: http://arxiv.org/abs/2509.15218v1
Date: Thu, 18 Sep 2025 17:59:16 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-19 17:26:53.394931
Title: LNE-Blocking: An Efficient Framework for Contamination Mitigation Evaluation on Large Language Models
Title（参考訳）: LNE-Blocking:大規模言語モデルによる汚染緩和評価のための効率的なフレームワーク
Authors: Ruijie Hou, Yueyang Jiao, Hanxu Hu, Yingming Li, Wai Lam, Huajian Zhang, Hongyuan Lu,
Abstract要約: 我々は,潜在的に漏洩したデータセットを汚染する前にモデル性能を復元する新しいフレームワークである textbfLNE-Blocking を提案する。私たちのフレームワークは、モデルの性能を効率的に回復する最初のフレームワークです。
参考スコア（独自算出の注目度）: 42.94267844722955
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The problem of data contamination is now almost inevitable during the development of large language models (LLMs), with the training data commonly integrating those evaluation benchmarks even unintentionally. This problem subsequently makes it hard to benchmark LLMs fairly. Instead of constructing contamination-free datasets (quite hard), we propose a novel framework, \textbf{LNE-Blocking}, to restore model performance prior to contamination on potentially leaked datasets. Our framework consists of two components: contamination detection and disruption operation. For the prompt, the framework first uses the contamination detection method, \textbf{LNE}, to assess the extent of contamination in the model. Based on this, it adjusts the intensity of the disruption operation, \textbf{Blocking}, to elicit non-memorized responses from the model. Our framework is the first to efficiently restore the model's greedy decoding performance. This comes with a strong performance on multiple datasets with potential leakage risks, and it consistently achieves stable recovery results across different models and varying levels of data contamination. We release the code at https://github.com/RuijieH/LNE-Blocking to facilitate research.
Abstract（参考訳）: データ汚染の問題は、大規模言語モデル(LLM)の開発においてほぼ必然的に発生し、トレーニングデータは、これらの評価ベンチマークを通常、意図せずに統合する。この問題はその後、LSMを公平にベンチマークすることを困難にしている。汚染のないデータセットを構築する代わりに、潜在的に漏洩するデータセットに汚染される前にモデル性能を復元する新しいフレームワークである「textbf{LNE-Blocking}」を提案する。本フレームワークは, 汚染検出と破壊操作の2つのコンポーネントから構成される。プロンプトのために、このフレームワークはまず、モデルにおける汚染の程度を評価するために、汚染検出方法である \textbf{LNE} を使用する。これに基づいて、モデルから記憶されない応答を引き出すために、ディスラプション演算の強度である \textbf{Blocking} を調整する。私たちのフレームワークは、モデルの性能を効率的に回復する最初のフレームワークです。これは、潜在的なリークリスクのある複数のデータセットで強力なパフォーマンスを伴い、さまざまなモデルにわたる安定したリカバリ結果とさまざまなレベルのデータ汚染を一貫して達成する。我々は、研究を容易にするために、https://github.com/RuijieH/LNE-Blockingでコードをリリースします。

論文の概要: LNE-Blocking: An Efficient Framework for Contamination Mitigation Evaluation on Large Language Models

関連論文リスト