Fugu-MT 論文翻訳(概要): SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment

論文の概要: SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment

arxiv url: http://arxiv.org/abs/2603.13669v1
Date: Sat, 14 Mar 2026 00:37:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.329078
Title: SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment
Title（参考訳）: SHAMISA:自己監督型非参照画像品質評価のための意図しない構造関連のSHApedモデリング
Authors: Mahdi Naseri, Zhou Wang,
Abstract要約: No-Reference Image Quality Assessment (NR-IQA) は、素質の基準画像にアクセスすることなく知覚品質を推定することを目的としている。本研究では,非コントラストな自己監督型フレームワークであるSHAMISAを提案する。
参考スコア（独自算出の注目度）: 6.175621390241037
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: No-Reference Image Quality Assessment (NR-IQA) aims to estimate perceptual quality without access to a reference image of pristine quality. Learning an NR-IQA model faces a fundamental bottleneck: its need for a large number of costly human perceptual labels. We propose SHAMISA, a non-contrastive self-supervised framework that learns from unlabeled distorted images by leveraging explicitly structured relational supervision. Unlike prior methods that impose rigid, binary similarity constraints, SHAMISA introduces implicit structural associations, defined as soft, controllable relations that are both distortion-aware and content-sensitive, inferred from synthetic metadata and intrinsic feature structure. A key innovation is our compositional distortion engine, which generates an uncountable family of degradations from continuous parameter spaces, grouped so that only one distortion factor varies at a time. This enables fine-grained control over representational similarity during training: images with shared distortion patterns are pulled together in the embedding space, while severity variations produce structured, predictable shifts. We integrate these insights via dual-source relation graphs that encode both known degradation profiles and emergent structural affinities to guide the learning process throughout training. A convolutional encoder is trained under this supervision and then frozen for inference, with quality prediction performed by a linear regressor on its features. Extensive experiments on synthetic, authentic, and cross-dataset NR-IQA benchmarks demonstrate that SHAMISA achieves strong overall performance with improved cross-dataset generalization and robustness, all without human quality annotations or contrastive losses.
Abstract（参考訳）: No-Reference Image Quality Assessment (NR-IQA) は、素質の基準画像にアクセスすることなく知覚品質を推定することを目的としている。 NR-IQAモデルの学習は基本的なボトルネックに直面している。本研究では,非コントラストな自己監督型フレームワークであるSHAMISAを提案する。厳密な二項類似性制約を課す従来の方法とは異なり、SHAMISAは、合成メタデータや本質的な特徴構造から推定される歪みを認識し、内容に敏感な、柔らかく制御可能な関係として定義された暗黙的な構造関連を導入している。重要な革新は、連続パラメータ空間から無数の劣化の族を生成する構成歪みエンジンであり、同時に1つの歪み係数だけが変化するようにグループ化された。これにより、トレーニング中の表現的類似性に対するきめ細かい制御が可能となり、共有歪みパターンのイメージは埋め込み空間にまとめられ、重度変動は構造化され予測可能なシフトを生み出す。これらの知見を、既知の劣化プロファイルと創発的構造親和性の両方を符号化した二重ソース関係グラフを通じて統合し、学習過程をトレーニングを通してガイドする。畳み込みエンコーダは、この監督の下で訓練され、それから推論のために凍結され、線形回帰器がその特徴について品質予測を行う。合成, 認証, クロスデータセット NR-IQA ベンチマークの広範な実験により, SHAMISA は, 人間の品質アノテーションや対照的な損失を伴わずに, クロスデータセットの一般化と堅牢性の向上により, 高い総合的な性能を達成できることを示した。

論文の概要: SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment

関連論文リスト