Fugu-MT 論文翻訳(概要): PR-CAD: Progressive Refinement for Unified Controllable and Faithful Text-to-CAD Generation with Large Language Models

論文の概要: PR-CAD: Progressive Refinement for Unified Controllable and Faithful Text-to-CAD Generation with Large Language Models

arxiv url: http://arxiv.org/abs/2604.19773v1
Date: Fri, 27 Mar 2026 12:13:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-04 02:32:14.065627
Title: PR-CAD: Progressive Refinement for Unified Controllable and Faithful Text-to-CAD Generation with Large Language Models
Title（参考訳）: PR-CAD:大規模言語モデルを用いた統一制御可能で忠実なテキスト・ツー・CAD生成のためのプログレッシブ・リファインメント
Authors: Jiyuan An, Jiachen Zhao, Fan Chen, Liner Yang, Zhenghao Liu, Hongyan Wang, Weihua An, Meishan Zhang, Erhong Yang,
Abstract要約: PR-CADは、制御可能で忠実なテキスト・ツー・CADモデリングのための生成と編集を統一するプログレッシブ・リファインメント・フレームワークである。データセットは、編集操作の種類を体系的に定義し、高度にヒューマンライクなインタラクションデータを生成する。パブリックベンチマークでは、PR-CADは生成シナリオと改善シナリオの両方において最先端の制御性と忠実性を達成する。
参考スコア（独自算出の注目度）: 32.278474349549704
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The construction of CAD models has traditionally relied on labor-intensive manual operations and specialized expertise. Recent advances in large language models (LLMs) have inspired research into text-to-CAD generation. However, existing approaches typically treat generation and editing as disjoint tasks, limiting their practicality. We propose PR-CAD, a progressive refinement framework that unifies generation and editing for controllable and faithful text-to-CAD modeling. To support this, we curate a high-fidelity interaction dataset spanning the full CAD lifecycle, encompassing multiple CAD representations as well as both qualitative and quantitative descriptions. The dataset systematically defines the types of edit operations and generates highly human-like interaction data. Building on a CAD representation tailored for LLMs, we propose a reinforcement learning-enhanced reasoning framework that integrates intent understanding, parameter estimation, and precise edit localization into a single agent. This enables an "all-in-one" solution for both design creation and refinement. Extensive experiments demonstrate strong mutual reinforcement between generation and editing tasks, and across qualitative and quantitative modalities. On public benchmarks, PR-CAD achieves state-of-the-art controllability and faithfulness in both generation and refinement scenarios, while also proving user-friendly and significantly improving CAD modeling efficiency.
Abstract（参考訳）: CADモデルの構築は、伝統的に労働集約的な手作業と専門的な専門知識に依存してきた。大規模言語モデル(LLM)の最近の進歩は、テキスト・ツー・CAD生成の研究にインスピレーションを与えている。しかし、既存のアプローチは通常、生成と編集を非結合的なタスクとして扱い、実用性を制限する。制御可能で忠実なテキスト・ツー・CADモデリングのための生成と編集を統一するプログレッシブ・リファインメント・フレームワークであるPR-CADを提案する。これをサポートするために,CADのライフサイクル全体にわたる高忠実な相互作用データセットをキュレートし,複数のCAD表現と質的および定量的な記述を包含する。データセットは、編集操作の種類を体系的に定義し、高度にヒューマンライクなインタラクションデータを生成する。 LLMに適したCAD表現に基づいて、意図理解、パラメータ推定、高精度な編集ローカライゼーションを単一のエージェントに統合する強化学習強化推論フレームワークを提案する。これによって"オールインワン"なソリューションが,設計生成と改良の両方に実現される。大規模な実験は、生成タスクと編集タスク、質的および定量的なモダリティの間に強い相互強化を示す。パブリックベンチマークでは、PR-CADは、生成シナリオと改善シナリオの両方において、最先端の制御性と忠実性を達成すると同時に、ユーザフレンドリでCADモデリング効率を大幅に改善する。

論文の概要: PR-CAD: Progressive Refinement for Unified Controllable and Faithful Text-to-CAD Generation with Large Language Models

関連論文リスト