Fugu-MT 論文翻訳(概要): Dual-Diffusional Generative Fashion Recommendation

論文の概要: Dual-Diffusional Generative Fashion Recommendation

arxiv url: http://arxiv.org/abs/2605.17357v1
Date: Sun, 17 May 2026 09:52:18 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:47.91863
Title: Dual-Diffusional Generative Fashion Recommendation
Title（参考訳）: Dual-Diffusional Generative Fashion Recommendation
Authors: Mingzhe Yu, Lei Wu, Qianru Sun, Yunshan Ma,
Abstract要約: 本稿ではDualFashion, Dual-Diffusional Generative Fashion Recommendation Architectureを提案する。提案アーキテクチャは,ファッションアイテムイメージとテキスト記述の両方を生成する。パーソナライズされたFill-in-the-BlankおよびGenerative Outfit Recommendationタスク間のiFashionとPolyvore-Uの実験は、DualFashionが高いパフォーマンスを達成することを示す。
参考スコア（独自算出の注目度）: 34.16599050803813
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Personalized generative recommender systems have emerged as a promising solution for fashion recommendation. However, existing methods primarily rely on implicit visual embeddings from historical interactions, which often contain preference-irrelevant information and result in insufficient user behavior modeling. Moreover, these models typically generate only item images, providing limited interpretability. To address these limitations, we propose DualFashion, a Dual-Diffusional Generative Fashion Recommendation Architecture that jointly models image and text modalities for personalized and explainable recommendation. DualFashion adopts a dual-diffusion Transformer with image and text branches, where structured attribute-level captions and visual outfit information are jointly used as conditioning signals to model user behavior. The proposed architecture produces both fashion item images and textual descriptions, ensuring visual compatibility while providing explicit semantic interpretability. Furthermore, we introduce a text-augmented fine-tuning strategy that enhances generation diversity and enables effective cross-modal knowledge transfer without incurring heavy computational costs. Extensive experiments on iFashion and Polyvore-U across Personalized Fill-in-the-Blank and Generative Outfit Recommendation tasks demonstrate that DualFashion achieves strong performance in behavior modeling, interpretability, and efficiency compared to state-of-the-art methods. Our code and model checkpoints are available at https://github.com/LinkMingzhe/DualFashion.
Abstract（参考訳）: ファッションレコメンデーションのための有望なソリューションとして、パーソナライズされた生成レコメンデーションシステムが現れている。しかし、既存の手法は主に歴史的相互作用からの暗黙的な視覚的埋め込みに依存しており、それはしばしば嗜好に無関係な情報を含んでおり、結果としてユーザー行動のモデリングが不十分である。さらに、これらのモデルは典型的にはアイテムイメージのみを生成し、限定的な解釈性を提供する。これらの制約に対処するため、DualFashionは、パーソナライズされた説明可能なレコメンデーションのために、画像とテキストのモダリティを共同でモデル化する、Dual-Diffusional Generative Fashion Recommendation Architectureを提案する。 DualFashionは、イメージとテキストブランチを備えたデュアル拡散トランスフォーマーを採用しており、構造化属性レベルのキャプションと視覚的な服装情報は、ユーザの振る舞いをモデル化するための条件付け信号として共同で使用される。提案アーキテクチャは、ファッションアイテムの画像とテキスト記述の両方を生成し、明示的な意味論的解釈性を提供しながら、視覚的互換性を確保する。さらに,テキストによる微調整戦略を導入し,生成の多様性を高め,計算コストを伴わずに効果的なクロスモーダルな知識伝達を実現する。パーソナライズされたFill-in-the-BlankおよびGenerative Outfit Recommendationタスク間でのiFashionとPolyvore-Uに関する大規模な実験により、DualFashionは、最先端の手法と比較して、動作モデリング、解釈可能性、効率性において高いパフォーマンスを達成することが示された。私たちのコードとモデルチェックポイントはhttps://github.com/LinkMingzhe/DualFashion.comで公開されています。

論文の概要: Dual-Diffusional Generative Fashion Recommendation

関連論文リスト