Fugu-MT 論文翻訳(概要): Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation

論文の概要: Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation

arxiv url: http://arxiv.org/abs/2603.16747v1
Date: Tue, 17 Mar 2026 16:24:07 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:07.416316
Title: Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation
Title（参考訳）: 繊維パターン生成のための半教師付き潜伏拡散モデル
Authors: Chenggong Hu, Yi Wang, Mengqi Xue, Haofei Zhang, Jie Song, Li Sun,
Abstract要約: 繊維パターン生成(TPG)は,所定の衣服画像に基づいて微細な繊維パターンを合成することを目的としている。既存のイメージ・ツー・イメージのモデルは、このタスクの自然な候補であるようだ。しかし、直接適用すると、これらの手法は不誠実な結果をもたらすことが多い。忠実かつ高忠実なTPGのための新しい手法SLDDM-TPGを提案する。
参考スコア（独自算出の注目度）: 22.45428755321959
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Textile pattern generation (TPG) aims to synthesize fine-grained textile pattern images based on given clothing images. Although previous studies have not explicitly investigated TPG, existing image-to-image models appear to be natural candidates for this task. However, when applied directly, these methods often produce unfaithful results, failing to preserve fine-grained details due to feature confusion between complex textile patterns and the inherent non-rigid texture distortions in clothing images. In this paper, we propose a novel method, SLDDM-TPG, for faithful and high-fidelity TPG. Our method consists of two stages: (1) a latent disentangled network (LDN) that resolves feature confusion in clothing representations and constructs a multi-dimensional, independent clothing feature space; and (2) a semi-supervised latent diffusion model (S-LDM), which receives guidance signals from LDN and generates faithful results through semi-supervised diffusion training, combined with our designed fine-grained alignment strategy. Extensive evaluations show that SLDDM-TPG reduces FID by 4.1 and improves SSIM by up to 0.116 on our CTP-HD dataset, and also demonstrate good generalization on the VITON-HD dataset.
Abstract（参考訳）: 繊維パターン生成(TPG)は,所定の衣服画像に基づいて微細な繊維パターンを合成することを目的としている。これまでの研究では、TPGについて明確には研究されていないが、既存の画像と画像のモデルは、この課題の自然な候補であるようだ。しかし、直接適用すると、複雑な織物パターンと衣料画像の固有の非剛性テクスチャ歪みの相違により、細かな細部を保たず、しばしば不誠実な結果をもたらす。本稿では,忠実かつ高忠実なTPGのための新しい手法であるSLDDM-TPGを提案する。提案手法は,(1)服飾表現の特徴的混乱を解消し,多次元の独立した衣服特徴空間を構築する潜伏不整合ネットワーク(LDN),(2)LDNからの誘導信号を受信し,半教師付き拡散訓練により忠実な結果を生成する半教師付き潜伏拡散モデル(S-LDM)の2段階からなる。 CTP-HDデータセットでは,SLDDM-TPGはFIDを4.1削減し,SSIMを0.116改善し,VITON-HDデータセットでは良好な一般化を示した。

論文の概要: Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation

関連論文リスト