Fugu-MT 論文翻訳(概要): UniMesh: Unifying 3D Mesh Understanding and Generation

論文の概要: UniMesh: Unifying 3D Mesh Understanding and Generation

arxiv url: http://arxiv.org/abs/2604.17472v1
Date: Sun, 19 Apr 2026 14:53:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.542253
Title: UniMesh: Unifying 3D Mesh Understanding and Generation
Title（参考訳）: UniMesh: 3Dメッシュの理解と生成の統合
Authors: Peng Huang, Yifeng Chen, Zeyu Zhang, Hao Tang,
Abstract要約: 一つのアーキテクチャ内で3D生成と理解を共同で学習する統合フレームワークUniMeshを提案する。まず,拡散に基づく画像生成を暗黙の形状デコーダでブリッジし,クロスモデルインタフェースとして機能する新しいメッシュヘッドを提案する。第2に、ユーザ主導のセマンティックメッシュ編集を可能にする反復推論の幾何学的インスタンス化であるChain of Mesh(CoM)を開発する。第3に,3次元キャプションなどの高レベルタスクにおける障害の診断と修正を行うために,アクタ評価器を用いた自己反射機構を組み込んだ。
参考スコア（独自算出の注目度）: 17.488670315445983
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Recent advances in 3D vision have led to specialized models for either 3D understanding (e.g., shape classification, segmentation, reconstruction) or 3D generation (e.g., synthesis, completion, and editing). However, these tasks are often tackled in isolation, resulting in fragmented architectures and representations that hinder knowledge transfer and holistic scene modeling. To address these challenges, we propose UniMesh, a unified framework that jointly learns 3D generation and understanding within a single architecture. First, we introduce a novel Mesh Head that acts as a cross model interface, bridging diffusion based image generation with implicit shape decoders. Second, we develop Chain of Mesh (CoM), a geometric instantiation of iterative reasoning that enables user driven semantic mesh editing through a closed loop latent, prompting, and re generation cycle. Third, we incorporate a self reflection mechanism based on an Actor Evaluator Self reflection triad to diagnose and correct failures in high level tasks like 3D captioning. Experimental results demonstrate that UniMesh not only achieves competitive performance on standard benchmarks but also unlocks novel capabilities in iterative editing and mutual enhancement between generation and understanding. Code: https://github.com/AIGeeksGroup/UniMesh. Website: https://aigeeksgroup.github.io/UniMesh.
Abstract（参考訳）: 近年の3次元視覚の進歩は、3次元理解(例えば、形状分類、セグメンテーション、再構成)または3次元生成(例えば、合成、完成、編集)の特別なモデルに繋がった。しかしながら、これらのタスクは独立して取り組まれることが多く、結果として、知識伝達や全体論的シーンモデリングを妨げる、断片化されたアーキテクチャや表現が生まれる。これらの課題に対処するために、単一のアーキテクチャ内で3D生成と理解を共同で学習する統合フレームワークであるUniMeshを提案する。まず,拡散に基づく画像生成を暗黙の形状デコーダでブリッジし,クロスモデルインタフェースとして機能する新しいメッシュヘッドを提案する。第二にChain of Mesh(CoM)を開発した。これは反復推論の幾何学的インスタンス化であり、クローズドループ遅延、プロンプト、再生成サイクルによるユーザ駆動セマンティックメッシュの編集を可能にする。第3に,3次元キャプションなどの高レベルタスクにおける障害の診断と修正を行うために,アクタ評価器を用いた自己反射機構を組み込んだ。実験結果から、UniMeshは標準ベンチマーク上での競争性能を達成するだけでなく、反復的な編集や、生成と理解の相互強化における新たな能力も解放することが示された。コード:https://github.com/AIGeeksGroup/UniMesh ウェブサイト:https://aigeeksgroup.github.io/UniMesh

論文の概要: UniMesh: Unifying 3D Mesh Understanding and Generation

関連論文リスト