Fugu-MT 論文翻訳(概要): Have You Ever Seen Them? Entity-level Membership Inference through Interrogating Large Language Models

論文の概要: Have You Ever Seen Them? Entity-level Membership Inference through Interrogating Large Language Models

arxiv url: http://arxiv.org/abs/2606.23030v1
Date: Mon, 22 Jun 2026 08:40:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-26 21:22:44.119428
Title: Have You Ever Seen Them? Entity-level Membership Inference through Interrogating Large Language Models
Title（参考訳）: エンティティレベルのメンバシップ推論は,大規模言語モデルの解釈を通じて実現したか?
Authors: Yiran Zhu, Ziqi Yang,
Abstract要約: LLM(Large Language Models)は、プライバシの漏洩や著作権の遵守に関する懸念が高まっている。 LLMトレーニングに対象エンティティに関連する情報が使用されているかどうかを判定するエンティティレベルのメンバシップ推論を提案する。エンティティレベルのデータセットを構築し、最先端のサンプルレベルのラベルのみのメソッドをベースラインとしてエンティティレベルの設定に適用する。
参考スコア（独自算出の注目度）: 6.8669133150322255
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) raise growing concerns about privacy leakage and copyright compliance. Membership inference is a key tool for assessing such risks, but existing studies mainly focus on whether specific samples or sample-based data units are used for training. We argue that LLMs exhibit a human-memory-like behavior: an LLM may not memorize a specific sample verbatim, yet it can accumulate and reveal knowledge about a real-world entity from scattered mentions. This analogy motivates us to examine whether an LLM can be interrogated like a human interviewee to reveal its exposure to entity-related information. Motivated by this question, we propose entity-level membership inference, which determines whether information related to a target entity is used in LLM training. We study this task in the practical label-only black-box setting, where only generated texts are observable. We formalize the task under clue, input, and model constraints, establish the necessary and sufficient conditions for its feasibility, and instantiate five interrogation strategies based on this formalization. The strategies use limited entity clues to construct prompts, elicit entity-related responses, and infer membership from semantic features among the generated texts. We construct entity-level datasets and adapt state-of-the-art sample-level label-only methods to the entity-level setting as baselines. Experiments on person entities show that our methods achieve AUC up to 0.97 and bring gains of 6.0%--17.5% in Balanced Accuracy over the best adapted baseline.
Abstract（参考訳）: LLM(Large Language Models)は、プライバシの漏洩や著作権の遵守に関する懸念が高まっている。会員推定はそのようなリスクを評価するための重要なツールであるが、既存の研究は主に、特定のサンプルまたはサンプルベースのデータユニットがトレーニングに使用されるかどうかに焦点を当てている。 LLMは特定のサンプルの動詞を記憶しないかもしれないが、実際の実体に関する知識を蓄積し、散在する言及から明らかにすることができる。このアナロジーは、LLMを人間インタビュアーのように尋問して、エンティティ関連情報への露出を明らかにすることができるかどうかを調査する動機となる。そこで本研究では,LLMトレーニングにおいて,対象エンティティに関連する情報が使用されているかどうかを判定するエンティティレベルのメンバシップ推論を提案する。我々は,このタスクを,生成したテキストのみを観測可能な,実用的なラベルのみのブラックボックス設定で研究する。我々は、手掛かり、入力、モデル制約の下でタスクを形式化し、その実現可能性に必要な十分な条件を確立し、この形式化に基づいて5つの尋問戦略をインスタンス化する。戦略では、限定されたエンティティヒントを使用してプロンプトを構築し、エンティティ関連の応答を抽出し、生成されたテキストのセマンティックな特徴からメンバシップを推測する。エンティティレベルのデータセットを構築し、最先端のサンプルレベルのラベルのみのメソッドをベースラインとしてエンティティレベルの設定に適用する。個人による実験では,AUCを最大0.97まで達成し,最適な適応ベースラインに対して6.0%--17.5%のバランス精度が得られた。

論文の概要: Have You Ever Seen Them? Entity-level Membership Inference through Interrogating Large Language Models

関連論文リスト