Fugu-MT 論文翻訳(概要): Multi-View Hierarchical Graph Neural Network for Sketch-Based 3D Shape Retrieval

論文の概要: Multi-View Hierarchical Graph Neural Network for Sketch-Based 3D Shape Retrieval

arxiv url: http://arxiv.org/abs/2604.18019v1
Date: Mon, 20 Apr 2026 09:46:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.792445
Title: Multi-View Hierarchical Graph Neural Network for Sketch-Based 3D Shape Retrieval
Title（参考訳）: スケッチに基づく3次元形状検索のための多視点階層型グラフニューラルネットワーク
Authors: Hang Cheng, Muyan He, Mingyu Fan, Chengfeng Xie, Xi Cheng, Long Zeng,
Abstract要約: スケッチに基づく3次元形状検索は,手描きスケッチのカテゴリと整合した3次元形状の検索を目的としている。本稿では,SBSRの新しいフレームワークであるMulti-View Hierarchical Graph Neural Network (MV-HGNN)を提案する。カテゴリーレベルとゼロショット設定の両方で、MV-HGNNは最先端の手法より優れている。
参考スコア（独自算出の注目度）: 8.680040031590362
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Sketch-based 3D shape retrieval (SBSR) aims to retrieve 3D shapes that are consistent with the category of the input hand-drawn sketch. The core challenge of this task lies in two aspects: existing methods typically employ simplified aggregation strategies for independently encoded 3D multi-view features, which ignore the geometric relationships between views and multi-level details, resulting in weak 3D representation. Simultaneously, traditional SBSR methods are constrained by visible category limitations, leading to poor performance in zero-shot scenarios. To address these challenges, we propose Multi-View Hierarchical Graph Neural Network (MV-HGNN), a novel framework for SBSR. Specifically, we construct a view-level graph and capture adjacent geometric dependencies and cross-view message passing via local graph convolution and global attention. A view selector is further introduced to perform hierarchical graph coarsening, enabling a progressively larger receptive field for graph convolution and mitigating the interference of redundant views, which leads to more discriminate discriminative hierarchical 3D representation. To enable category agnostic alignment and mitigate overfitting to seen classes, we leverage CLIP text embeddings as semantic prototypes and project both sketch and 3D features into a shared semantic space. We use a two-stage training strategy for category-level retrieval and a one-stage strategy for zero-shot retrieval under the same model architecture. Under both category-level and zero-shot settings, extensive experiments on two public benchmarks demonstrate that MV-HGNN outperforms state-of-the-art methods.
Abstract（参考訳）: スケッチベースの3次元形状検索(SBSR)は,手描きスケッチのカテゴリと整合した3次元形状の検索を目的としている。既存のメソッドは、独立に符号化された3Dの多面的特徴に対して単純化されたアグリゲーション戦略を使用し、ビューと多面的詳細の間の幾何学的関係を無視し、結果として弱い3D表現をもたらす。同時に、従来のSBSR手法は可視圏制限によって制約され、ゼロショットシナリオでは性能が低下する。これらの課題に対処するため、SBSRの新しいフレームワークであるMulti-View Hierarchical Graph Neural Network (MV-HGNN)を提案する。具体的には、ビューレベルグラフを構築し、隣接する幾何学的依存関係と、局所グラフ畳み込みとグローバルアテンションによるクロスビューメッセージパッシングをキャプチャする。さらにビューセレクタを導入して階層グラフの粗大化を実現し、グラフの畳み込みと冗長なビューの干渉を緩和し、より差別的な階層的な3D表現を実現する。カテゴリ非依存のアライメントを可能にするため,CLIPテキストの埋め込みをセマンティックプロトタイプとして利用し,スケッチと3D機能を共有セマンティック空間に投影する。カテゴリーレベルの検索には2段階の学習戦略、同じモデルアーキテクチャではゼロショット検索には1段階の学習戦略を用いる。カテゴリレベルの設定とゼロショットの設定の両方において、MV-HGNNが最先端の手法より優れていることを示す2つの公開ベンチマークで広範な実験が行われた。

論文の概要: Multi-View Hierarchical Graph Neural Network for Sketch-Based 3D Shape Retrieval

関連論文リスト