Fugu-MT 論文翻訳(概要): Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models

論文の概要: Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models

arxiv url: http://arxiv.org/abs/2605.18132v1
Date: Mon, 18 May 2026 09:36:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:49.229502
Title: Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models
Title（参考訳）: この3Dアセットを誰が生成したか : 生成3Dモデルのためのソース属性の学習
Authors: Sihan Ma, Siyuan Liang, Dacheng Tao,
Abstract要約: 生成可能な3Dモデルは、ゲーム、ロボティクス、没入型生成にデプロイされる。どの生成モデルが生成したのかを特定できますか。この問題は、分散属性信号と現実的なデプロイメント制約という、2つの主要な課題に直面している。
参考スコア（独自算出の注目度）: 66.3899679538548
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generative 3D models are deployed in gaming, robotics, and immersive creation, making source attribution critical: given a 3D asset, can we identify whether and which generative model created it? This problem faces two core challenges: dispersed attribution signals, where 3D fingerprints are distributed across multi-view, geometric, and frequency-domain cues; and realistic deployment constraints, where scarce labels, degraded prompts, and mixed real/synthetic assets undermine attribution reliability. To systematically study this problem, we construct, to the best of our knowledge, the first passive source attribution benchmark for modern generated assets, covering 22 representative 3D generators under standard, few-shot, and realistic deployment protocols. Based on this benchmark, we find that generative 3D models leave two types of stable fingerprints: cross-view inconsistency and structural artifacts reflected in geometric statistics and frequency-domain cues. To capture these dispersed signals, we propose a hierarchical multi-view multi-modal Transformer that fuses appearance, geometric, and frequency-domain features within each view and models global relationships across views. Extensive experiments demonstrate strong performance, achieving 97.22% accuracy under full supervision and 77.17% accuracy with only 1% training data, corresponding to fewer than five samples per generator. These results show that modern 3D generators leave stable and attributable fingerprints, establishing a new benchmark and methodological foundation for trustworthy 3D content provenance.
Abstract（参考訳）: 生成型3Dモデルは、ゲーム、ロボティクス、没入型生成にデプロイされ、ソース属性が重要になる。この問題は、分散帰属信号、多視点、幾何学的、周波数領域のキューに3D指紋が分散される、現実的な配置制約、ラベルの不足、劣化したプロンプト、混合現実/合成資産が帰属の信頼性を損なう、という2つの主要な課題に直面している。この問題を体系的に研究するために、我々は、我々の知る限り、現代の資産に対する最初の受動的ソース属性ベンチマークを構築し、22の代表的3Dジェネレータを標準、少数ショット、および現実的な展開プロトコルでカバーした。このベンチマークに基づいて、生成型3Dモデルには2種類の安定な指紋が残されていることが判明した: 横方向の不整合と構造的アーティファクトは、幾何学統計学と周波数領域の手がかりに反映されている。これらの分散信号を捉えるために,各ビュー内の外観,幾何学的,周波数領域の特徴を融合させる階層型マルチモード変換器を提案し,ビュー間のグローバルな関係をモデル化する。大規模な実験は強い性能を示し、全監督下で97.22%の精度を達成し、わずか1%の訓練データで77.17%の精度を達成した。これらの結果から,現代の3Dジェネレータは安定かつ帰属可能な指紋を残し,信頼性の高い3Dコンテンツ作成のための新しいベンチマークと方法論の基礎を確立した。

論文の概要: Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models

関連論文リスト