Fugu-MT 論文翻訳(概要): A2BFR: Attribute-Aware Blind Face Restoration

論文の概要: A2BFR: Attribute-Aware Blind Face Restoration

arxiv url: http://arxiv.org/abs/2603.29423v1
Date: Tue, 31 Mar 2026 08:29:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-01 15:25:03.353233
Title: A2BFR: Attribute-Aware Blind Face Restoration
Title（参考訳）: A2BFR:属性対応のブラインド顔復元
Authors: Chenxin Zhu, Yushun Fang, Lu Liu, Shibo Yin, Xiaohong Liu, Xiaoyun Zhang, Qiang Hu, Guangtao Zhai,
Abstract要約: ブラインド顔復元(BFR)は、劣化した入力から高品質な顔画像の復元を目的としている。近年の拡散型BFR法は知覚品質を向上させるが制御不能である。本稿では,属性を意識したBFRフレームワークであるA$2$BFRを提案する。
参考スコア（独自算出の注目度）: 48.06304429343643
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Blind face restoration (BFR) aims to recover high-quality facial images from degraded inputs, yet its inherently ill-posed nature leads to ambiguous and uncontrollable solutions. Recent diffusion-based BFR methods improve perceptual quality but remain uncontrollable, whereas text-guided face editing enables attribute manipulation without reliable restoration. To address these issues, we propose A$^2$BFR, an attribute-aware blind face restoration framework that unifies high-fidelity reconstruction with prompt-controllable generation. Built upon a Diffusion Transformer backbone with unified image-text cross-modal attention, A$^2$BFR jointly conditions the denoising trajectory on both degraded inputs and textual prompts. To inject semantic priors, we introduce attribute-aware learning, which supervises denoising latents using facial attribute embeddings extracted by an attribute-aware encoder. To further enhance prompt controllability, we introduce semantic dual-training, which leverages the pairwise attribute variations in our newly curated AttrFace-90K dataset to enforce attribute discrimination while preserving fidelity. Extensive experiments demonstrate that A$^2$BFR achieves state-of-the-art performance in both restoration fidelity and instruction adherence, outperforming diffusion-based BFR baselines by -0.0467 LPIPS and +52.58% attribute accuracy, while enabling fine-grained, prompt-controllable restoration even under severe degradations.
Abstract（参考訳）: ブラインド顔復元(BFR)は、劣化した入力から高品質な顔画像を復元することを目的としている。近年の拡散型BFR法では知覚品質が向上するが制御不能である。これらの問題に対処するために,属性対応のブラインドフェイス復元フレームワークであるA$^2$BFRを提案する。 Diffusion Transformerのバックボーン上に構築され、画像-テキストのクロスモーダルな注意を統一したもので、A$^2$BFRは劣化した入力とテキストプロンプトの両方にデノイングの軌跡を共同で条件付けする。本稿では,属性認識エンコーダによって抽出された顔属性の埋め込みを用いて,潜伏者の聴覚を指導する属性認識学習を提案する。そこで我々は,新たにキュレートしたAttrFace-90Kデータセットの2値属性の変動を利用したセマンティックデュアルトレーニングを導入する。大規模な実験により、A$^2$BFRは、修復の忠実さと命令の付着性の両方において最先端のパフォーマンスを達成し、拡散ベースのBFRベースラインを-0.0467 LPIPSと+52.58%の属性精度で上回り、深刻な劣化下でもきめ細かな、即時制御可能な復元を可能にした。

論文の概要: A2BFR: Attribute-Aware Blind Face Restoration

関連論文リスト