Fugu-MT 論文翻訳(概要): Perceptual Quality Assessment of Virtual Reality Videos in the Wild

論文の概要: Perceptual Quality Assessment of Virtual Reality Videos in the Wild

arxiv url: http://arxiv.org/abs/2206.08751v2
Date: Mon, 3 Jul 2023 06:41:26 GMT
ステータス: 翻訳完了
システム内更新日: 2023-07-04 16:08:57.217011
Title: Perceptual Quality Assessment of Virtual Reality Videos in the Wild
Title（参考訳）: 野生におけるバーチャルリアリティ映像の知覚的品質評価
Authors: Wen Wen, Mu Li, Yiru Yao, Xiangjie Sui, Yabin Zhang, Long Lan, Yuming Fang, Kede Ma
Abstract要約: 既存のパノラマビデオデータベースでは、合成歪みのみを考慮し、一定の視聴条件を仮定し、サイズに制限がある。我々はVRVQW(VR Video Quality in the Wild)データベースを構築した。我々は,2つの異なる視聴条件下で,139ドルの被験者から,スキャンパスと品質スコアを記録するための正式な心理物理実験を行った。
参考スコア（独自算出の注目度）: 50.33693148440248
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Investigating how people perceive virtual reality videos in the wild (\ie, those captured by everyday users) is a crucial and challenging task in VR-related applications due to complex \textit{authentic} distortions localized in space and time. Existing panoramic video databases only consider synthetic distortions, assume fixed viewing conditions, and are limited in size. To overcome these shortcomings, we construct the VR Video Quality in the Wild (VRVQW) database, which is one of the first of its kind, and contains $502$ user-generated videos with diverse content and distortion characteristics. Based on VRVQW, we conduct a formal psychophysical experiment to record the scanpaths and perceived quality scores from $139$ participants under two different viewing conditions. We provide a thorough statistical analysis of the recorded data, observing significant impact of viewing conditions on both human scanpaths and perceived quality. Moreover, we develop an objective quality assessment model for VR videos based on pseudocylindrical representation and convolution. Results on the proposed VRVQW show that our method is superior to existing video quality assessment models, only underperforming viewport-based models that otherwise rely on human scanpaths for projection. Last, we explore the additional use of the VRVQW dataset to benchmark saliency detection techniques, highlighting the need for further research. We have made the database and code available at \url{https://github.com/limuhit/VR-Video-Quality-in-the-Wild}.
Abstract（参考訳）: 人々がバーチャルリアリティービデオをどのように知覚するかを調べることは、空間と時間にローカライズされた複雑な \textit{authentic} 歪みのため、vr関連のアプリケーションにおいて重要かつ困難なタスクである。既存のパノラマビデオデータベースは、合成歪みのみを考慮し、一定の視聴条件を仮定し、サイズを制限している。これらの欠点を克服するため、我々はVRVQW(VR Video Quality in the Wild)データベースを構築した。 VRVQWに基づいて,2つの異なる視聴条件下で,スキャンパスと品質スコアを139ドルの参加者から記録する,正式な心理物理実験を行った。記録されたデータの詳細な統計解析を行い、ヒトのスキャンパスと知覚品質の両方に観察条件が与える影響を観察した。さらに,擬似円筒表現と畳み込みに基づくVRビデオの客観的品質評価モデルを構築した。提案したVRVQWの結果から,提案手法は既存の映像品質評価モデルよりも優れており,投影のための人間の走査パスに依存しないビューポートベースモデルよりも優れていた。最後に,VRVQWデータセットによる塩分濃度検出手法のベンチマークを行い,さらなる研究の必要性を強調した。データベースとコードは \url{https://github.com/limuhit/VR-Video-Quality-in-the-Wild} で公開しています。

関連論文リスト

Adaptive Score Alignment Learning for Continual Perceptual Quality Assessment of 360-Degree Videos in Virtual Reality [20.511561848185444]
適応スコアアライメント学習(ASAL:Adaptive Score Alignment Learning)という,VRビデオの知覚品質を評価する新しい手法を提案する。 ASALは相関損失と誤り損失を統合し、人間の主観的評価と知覚品質の予測精度を高める。我々はVR-VQAとそのCLのための総合的なベンチマークを確立し、新しいデータ分割と評価指標を導入しました。
論文参考訳（メタデータ） (2025-02-27T00:29:04Z)
ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos [71.62145804686062]
我々は,600個のエゴセントリックな空間ビデオとそれらの平均評価スコア(MOS)からなる,最初のエゴセントリックな空間ビデオ品質評価データベース(ESVQAD)を紹介する。両眼の空間, 動き, 意味的特徴を統合し, 知覚品質を予測できる新しい多次元両眼機能融合モデル ESVQAnet を提案する。 ESVQAnetは知覚品質評価タスクにおいて16の最先端VQAモデルより優れていることを示す実験結果を得た。
論文参考訳（メタデータ） (2024-12-29T10:13:30Z)
AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results [120.95863275142727]
本稿では,ECCV 2024における画像操作の進歩(AIM)ワークショップと共同で開催されている圧縮映像品質評価の課題について述べる。この課題は、様々な圧縮標準の14コーデックで符号化された459本の動画の多様なデータセット上で、VQA法の性能を評価することであった。
論文参考訳（メタデータ） (2024-08-21T20:32:45Z)
WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models [132.77237314239025]
ビデオ仮想トライオンは、衣料品のアイデンティティを維持し、ソースビデオにおける人のポーズと身体の形に適応する現実的なシーケンスを生成することを目的としている。従来の画像ベースの手法は、ワープとブレンディングに依存しており、複雑な人間の動きや閉塞に苦しむ。衣料品の説明や人間の動きを条件とした映像生成のプロセスとして,映像試行を再認識する。私たちのソリューションであるWildVidFitは、画像ベースで制御された拡散モデルを用いて、一段階の合理化を図っている。
論文参考訳（メタデータ） (2024-07-15T11:21:03Z)
KVQ: Kwai Video Quality Assessment for Short-form Videos [24.5291786508361]
我々は,600本のユーザアップロードショートビデオと3600本のプロセッシングビデオからなる,最初の大規模KVQ(Kleidoscope short Video database for Quality Assessment)を構築した。そこで我々は,KSVQEというビデオ品質評価装置を提案する。これにより,品質決定セマンティクスを大規模視覚言語モデルの内容理解とともに識別することができる。
論文参考訳（メタデータ） (2024-02-11T14:37:54Z)
Exploring the Effectiveness of Video Perceptual Representation in Blind Video Quality Assessment [55.65173181828863]
表現の図形的形態を記述することにより、時間的歪みを測定するための時間的知覚品質指標(TPQI)を提案する。実験の結果,TPQIは主観的時間的品質を予測する効果的な方法であることがわかった。
論文参考訳（メタデータ） (2022-07-08T07:30:51Z)
Perceptual Quality Assessment of Omnidirectional Images [81.76416696753947]
16のソース画像と320の歪み画像を含む全方位IQA (OIQA) データベースを最初に構築する。そして、VR環境におけるOIQAデータベース上で主観的品質評価研究を行う。原画像と歪んだ全方位画像、主観的品質評価、および頭部と眼の動きデータを合わせてOIQAデータベースを構成する。
論文参考訳（メタデータ） (2022-07-06T13:40:38Z)
Blindly Assess Quality of In-the-Wild Videos via Quality-aware Pre-training and Motion Perception [32.87570883484805]
本稿では,画像品質評価(IQA)データベースからの知識の伝達と,リッチな動きパターンを用いた大規模行動認識を提案する。対象のVQAデータベース上で、混合リストワイドランキング損失関数を用いて、提案したモデルをトレーニングする。
論文参考訳（メタデータ） (2021-08-19T05:29:19Z)
Unified Quality Assessment of In-the-Wild Videos with Mixed Datasets Training [20.288424566444224]
我々は、コンピュータビジョンアプリケーションにおいて、Wildビデオの品質を自動評価することに注力する。品質評価モデルの性能向上のために,人間の知覚から直観を借りる。複数のデータセットで単一のVQAモデルをトレーニングするための混合データセットトレーニング戦略を提案する。
論文参考訳（メタデータ） (2020-11-09T09:22:57Z)
Perceptual Quality Assessment of Omnidirectional Images as Moving Camera Videos [49.217528156417906]
ユーザの視聴行動やパノラマの知覚的品質を決定するには,2種類のVR視聴条件が不可欠である。まず、異なる視聴条件下での異なるユーザの視聴行動を用いて、一方向の画像を複数のビデオ表現に変換する。次に、高度な2次元フルレファレンスビデオ品質モデルを活用して、知覚された品質を計算する。
論文参考訳（メタデータ） (2020-05-21T10:03:40Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。