Fugu-MT 論文翻訳(概要): Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection

論文の概要: Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection

arxiv url: http://arxiv.org/abs/2603.21511v1
Date: Mon, 23 Mar 2026 02:59:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.456912
Title: Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
Title（参考訳）: バック・トゥ・ポイント:ゼロショット3次元異常検出のための点線モデル探索
Authors: Kaiqiang Li, Gang Li, Mingle Zhou, Min Li, Delong Han, Jin Wan,
Abstract要約: ゼロショット(ZS)3次元異常検出は信頼性の高い産業検査に不可欠である。既存のアプローチでは、3Dポイントクラウドを2Dイメージにレンダリングし、異常検出のためにトレーニング済みのビジョンランゲージモデル(VLM)を活用する。我々は、3Dポイントクラウドとテキスト埋め込みを効果的に整合させる新しいフレームワークであるBTPを提案する。
参考スコア（独自算出の注目度）: 13.401003727539125
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Zero-shot (ZS) 3D anomaly detection is crucial for reliable industrial inspection, as it enables detecting and localizing defects without requiring any target-category training data. Existing approaches render 3D point clouds into 2D images and leverage pre-trained Vision-Language Models (VLMs) for anomaly detection. However, such strategies inevitably discard geometric details and exhibit limited sensitivity to local anomalies. In this paper, we revisit intrinsic 3D representations and explore the potential of pre-trained Point-Language Models (PLMs) for ZS 3D anomaly detection. We propose BTP (Back To Point), a novel framework that effectively aligns 3D point cloud and textual embeddings. Specifically, BTP aligns multi-granularity patch features with textual representations for localized anomaly detection, while incorporating geometric descriptors to enhance sensitivity to structural anomalies. Furthermore, we introduce a joint representation learning strategy that leverages auxiliary point cloud data to improve robustness and enrich anomaly semantics. Extensive experiments on Real3D-AD and Anomaly-ShapeNet demonstrate that BTP achieves superior performance in ZS 3D anomaly detection. Code will be available at \href{https://github.com/wistful-8029/BTP-3DAD}{https://github.com/wistful-8029/BTP-3DAD}.
Abstract（参考訳）: ゼロショット(ZS)3D異常検出は、目標カテゴリのトレーニングデータを必要としない欠陥の検出と位置決めを可能にするため、信頼性の高い産業検査に不可欠である。既存のアプローチでは、3Dポイントクラウドを2Dイメージにレンダリングし、異常検出のためにトレーニング済みのビジョンランゲージモデル(VLM)を活用する。しかし、このような戦略は必然的に幾何学的詳細を捨て、局所的な異常に対して限定的な感度を示す。本稿では,固有3次元表現を再検討し,ZS 3D異常検出のための事前学習されたポイント・ランゲージ・モデル(PLM)の可能性を探る。我々は、3Dポイントクラウドとテキスト埋め込みを効果的に整合させる新しいフレームワークであるBTP(Back To Point)を提案する。具体的には、BTPは、多粒度パッチの特徴を局所的な異常検出のためのテキスト表現と整合させ、幾何学的記述子を組み込んで構造的異常に対する感受性を高める。さらに,補助点クラウドデータを活用する共同表現学習戦略を導入し,ロバスト性の向上と異常セマンティクスの充実を図る。 Real3D-ADとAnomaly-ShapeNetの大規模な実験により、BTPはZS 3D異常検出において優れた性能を発揮することが示された。コードは \href{https://github.com/wistful-8029/BTP-3DAD}{https://github.com/wistful-8029/BTP-3DAD} で入手できる。

論文の概要: Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection

関連論文リスト