Fugu-MT 論文翻訳(概要): Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition

論文の概要: Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition

arxiv url: http://arxiv.org/abs/2604.07884v1
Date: Thu, 09 Apr 2026 06:52:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-10 18:34:05.747784
Title: Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition
Title（参考訳）: プライバシ・センシティブ・アイデンティティ認識のための強化誘導型合成データ生成
Authors: Xuemei Jia, Jiawei Du, Hui Wei, Jun Chen, Joey Tianyi Zhou, Zheng Wang,
Abstract要約: 高忠実度生成モデルは、プライバシーに敏感なシナリオでますます必要とされている。限られたデータによって生成モデルは貧弱になり、データ不足を軽減できない。汎用ドメイン生成の先行を識別タスクに適応させる,強化誘導型合成データ生成フレームワークを提案する。
参考スコア（独自算出の注目度）: 60.52810518437911
License: http://creativecommons.org/licenses/by/4.0/
Abstract: High-fidelity generative models are increasingly needed in privacy-sensitive scenarios, where access to data is severely restricted due to regulatory and copyright constraints. This scarcity hampers model development--ironically, in settings where generative models are most needed to compensate for the lack of data. This creates a self-reinforcing challenge: limited data leads to poor generative models, which in turn fail to mitigate data scarcity. To break this cycle, we propose a reinforcement-guided synthetic data generation framework that adapts general-domain generative priors to privacy-sensitive identity recognition tasks. We first perform a cold-start adaptation to align a pretrained generator with the target domain, establishing semantic relevance and initial fidelity. Building on this foundation, we introduce a multi-objective reward that jointly optimizes semantic consistency, coverage diversity, and expression richness, guiding the generator to produce both realistic and task-effective samples. During downstream training, a dynamic sample selection mechanism further prioritizes high-utility synthetic samples, enabling adaptive data scaling and improved domain alignment. Extensive experiments on benchmark datasets demonstrate that our framework significantly improves both generation fidelity and classification accuracy, while also exhibiting strong generalization to novel categories in small-data regimes.
Abstract（参考訳）: プライバシーに敏感なシナリオでは、規制や著作権の制約によりデータへのアクセスが厳しく制限される。偶然にも、データ不足を補うために生成モデルが最も必要となる環境では、この希少なハッパーモデルが開発されます。限られたデータによって生成モデルが貧弱になるため、データ不足が軽減されません。このサイクルを断ち切るために、プライバシに敏感な識別タスクに一般ドメイン生成先を適応させる強化誘導型合成データ生成フレームワークを提案する。まず,事前学習したジェネレータを対象ドメインに整列させる冷間開始適応を行い,意味的関連性と初期忠実性を確立する。この基礎の上に構築された多目的報酬は、意味的一貫性、範囲の多様性、表現豊かさを共同で最適化し、現実的かつタスク効率の良いサンプルを生成するようにジェネレータを誘導する。下流トレーニングにおいて、動的サンプル選択機構は、高ユーティリティな合成サンプルをさらに優先順位付けし、適応的なデータスケーリングとドメインアライメントの改善を可能にする。ベンチマークデータセットの大規模な実験により、我々のフレームワークは、生成の忠実度と分類精度の両方を著しく改善し、同時に、小規模データ体制における新しいカテゴリへの強力な一般化も示している。

論文の概要: Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition

関連論文リスト