Fugu-MT 論文翻訳(概要): Reconstruction of a 3D wireframe from a single line drawing via generative depth estimation

論文の概要: Reconstruction of a 3D wireframe from a single line drawing via generative depth estimation

arxiv url: http://arxiv.org/abs/2604.13549v1
Date: Wed, 15 Apr 2026 06:52:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-16 20:38:32.422951
Title: Reconstruction of a 3D wireframe from a single line drawing via generative depth estimation
Title（参考訳）: 生成深度推定による単線描画からの3次元ワイヤフレームの再構成
Authors: Elton Cao, Hod Lipson,
Abstract要約: 2Dフリーハンドスケッチを3Dモデルに変換することは、コンピュータビジョンにおいて重要な課題である。本研究では, 条件付き深度推定タスクとして, フレーミング再構成による生成手法を提案する。我々は、100万以上の画像深度対の巨大なデータセットを使用して、我々のアプローチを訓練し、評価する。
参考スコア（独自算出の注目度）: 5.596217605600402
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The conversion of 2D freehand sketches into 3D models remains a pivotal challenge in computer vision, bridging the gap between human creativity and digital fabrication. Traditional line drawing reconstruction relies on brittle symbolic logic, while modern approaches are constrained by rigid parametric modeling, limiting users to predefined CAD primitives. We propose a generative approach by framing reconstruction as a conditional dense depth estimation task. To achieve this, we implement a Latent Diffusion Model (LDM) with a ControlNet-style conditioning framework to resolve the inherent ambiguities of orthographic projections. To support an iterative "sketch-reconstruct-sketch" workflow, we introduce a graph-based BFS masking strategy to simulate partial depth cues. We train and evaluate our approach using a massive dataset of over one million image-depth pairs derived from the ABC Dataset. Our framework demonstrates robust performance across varying shape complexities, providing a scalable pipeline for converting sparse 2D line drawings into dense 3D representations, effectively allowing users to "draw in 3D" without the rigid constraints of traditional CAD.
Abstract（参考訳）: 2Dのフリーハンドスケッチを3Dモデルに変換することは、コンピュータビジョンにおいて重要な課題であり、人間の創造性とデジタル製作のギャップを埋めている。伝統的な線描再構成は脆い記号論理に依存し、現代のアプローチは厳密なパラメトリックモデリングによって制約され、ユーザーは事前に定義されたCADプリミティブに制限される。本研究では, 条件付き深度推定タスクとして, フレーミング再構成による生成手法を提案する。これを実現するために、直交射影の固有の曖昧さを解決するために、制御ネットスタイルの条件付けフレームワークを備えた潜在拡散モデル(LDM)を実装した。反復的な「スケッチ・リコンストラクト・スケッチ」ワークフローをサポートするため、グラフベースのBFSマスキング戦略を導入し、部分深度をシミュレートする。 ABCデータセットから得られた100万以上の画像深度対の膨大なデータセットを用いて、我々のアプローチを訓練し、評価する。筆者らのフレームワークは, 様々な形状の複雑度にまたがる堅牢な性能を示し, スパース2次元線描画を高密度な3次元表現に変換するためのスケーラブルなパイプラインを提供し, 従来のCADの厳密な制約を伴わずに, ユーザが「3Dで描画できる」。

論文の概要: Reconstruction of a 3D wireframe from a single line drawing via generative depth estimation

関連論文リスト