Fugu-MT 論文翻訳(概要): GS-CLIP: Zero-shot 3D Anomaly Detection by Geometry-Aware Prompt and Synergistic View Representation Learning

論文の概要: GS-CLIP: Zero-shot 3D Anomaly Detection by Geometry-Aware Prompt and Synergistic View Representation Learning

arxiv url: http://arxiv.org/abs/2602.19206v1
Date: Sun, 22 Feb 2026 14:30:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-24 17:42:02.530544
Title: GS-CLIP: Zero-shot 3D Anomaly Detection by Geometry-Aware Prompt and Synergistic View Representation Learning
Title（参考訳）: GS-CLIP:Geometry-Aware PromptとSynergistic View Representation Learningによるゼロショット3次元異常検出
Authors: Zehao Deng, An Liu, Yan Wang,
Abstract要約: 3D異常検出は、ターゲットのトレーニングデータなしでターゲットデータセット内の異常を検出することを目的とした、新たなタスクである。現在の方法は、3Dポイントクラウドを2D表現に投影することでCLIPに適応するが、それらは課題に直面している。本研究では,2段階の学習プロセスを通じて幾何学的異常を識別するゲノメトリ・アウェア・プロンプトとシネジスティック・ビュー表現学習フレームワークを提案する。
参考スコア（独自算出の注目度）: 11.364765496753074
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Zero-shot 3D Anomaly Detection is an emerging task that aims to detect anomalies in a target dataset without any target training data, which is particularly important in scenarios constrained by sample scarcity and data privacy concerns. While current methods adapt CLIP by projecting 3D point clouds into 2D representations, they face challenges. The projection inherently loses some geometric details, and the reliance on a single 2D modality provides an incomplete visual understanding, limiting their ability to detect diverse anomaly types. To address these limitations, we propose the Geometry-Aware Prompt and Synergistic View Representation Learning (GS-CLIP) framework, which enables the model to identify geometric anomalies through a two-stage learning process. In stage 1, we dynamically generate text prompts embedded with 3D geometric priors. These prompts contain global shape context and local defect information distilled by our Geometric Defect Distillation Module (GDDM). In stage 2, we introduce Synergistic View Representation Learning architecture that processes rendered and depth images in parallel. A Synergistic Refinement Module (SRM) subsequently fuses the features of both streams, capitalizing on their complementary strengths. Comprehensive experimental results on four large-scale public datasets show that GS-CLIP achieves superior performance in detection. Code can be available at https://github.com/zhushengxinyue/GS-CLIP.
Abstract（参考訳）: Zero-shot 3D Anomaly Detectionは、ターゲットデータセットの異常をターゲットのトレーニングデータなしで検出することを目的とした、新たなタスクである。現在のメソッドは、3Dポイントクラウドを2D表現に投影することでCLIPに適応するが、それらは課題に直面している。プロジェクションは本質的に幾何的な細部が失われ、単一の2次元モードへの依存は不完全な視覚的理解をもたらし、多様な異常な型を検出する能力を制限する。これらの制約に対処するために,2段階の学習プロセスを通じて幾何学的異常を識別できるGeometry-Aware Prompt and Synergistic View Representation Learning (GS-CLIP)フレームワークを提案する。ステージ1では、3次元幾何学的先行情報に埋め込まれたテキストプロンプトを動的に生成する。これらのプロンプトは、我々のGeometric Defect Distillation Module (GDDM)によって蒸留された、グローバルな形状コンテキストと局所的な欠陥情報を含んでいる。ステージ2では、描画と深度画像を並列に処理するSynergistic View Representation Learningアーキテクチャを導入する。その後、SRM(Synergistic Refinement Module)が両方のストリームの特徴を融合させ、相補的な強みを生かした。 4つの大規模公開データセットの総合的な実験結果から,GS-CLIPは検出性能に優れていた。コードはhttps://github.com/zhushengxinyue/GS-CLIPで入手できる。

論文の概要: GS-CLIP: Zero-shot 3D Anomaly Detection by Geometry-Aware Prompt and Synergistic View Representation Learning

関連論文リスト