Fugu-MT 論文翻訳(概要): BVI-Mamba: Video Enhancement Using a Visual State-Space Model for Low-Light and Underwater Environments

論文の概要: BVI-Mamba: Video Enhancement Using a Visual State-Space Model for Low-Light and Underwater Environments

arxiv url: http://arxiv.org/abs/2604.23655v1
Date: Sun, 26 Apr 2026 11:03:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.484526
Title: BVI-Mamba: Video Enhancement Using a Visual State-Space Model for Low-Light and Underwater Environments
Title（参考訳）: BVI-Mamba:低照度・水中環境のための視覚状態空間モデルによる映像強調
Authors: Guoxi Huang, Ruirui Lin, Yini Li, David R. Bull, Nantheera Anantrasirichai,
Abstract要約: Visual Mambaは、Visual State Space (VSS)モデルを利用することで、メモリ使用量と計算時間を削減するように設計されている。 Visual Mambaのテクニックは、トランスフォーマーと畳み込みベースのモデルで、低照度と水中のビデオエンハンスメントタスクの両方で優れている。
参考スコア（独自算出の注目度）: 8.830666229299736
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Videos captured in low-light and underwater conditions often suffer from distortions such as noise, low contrast, color imbalance, and blur. These issues not only limit visibility but also degrade automatic tasks like detection. Post-processing is typically required but can be time-consuming. AI-based tools for video enhancement also demand significantly more computational resources compared to image-based methods. This paper introduces a novel framework, Visual Mamba, designed to reduce memory usage and computational time by leveraging the Visual State Space (VSS) model. The framework consists of two modules: (i) a feature alignment module, where spatio-temporal displacement between input frames is registered in the feature space, and (ii) an enhancement module, where noise removal and brightness adjustment are performed using a UNet-like architecture, with all convolutional layers replaced by VSS blocks. Experimental results show that the Visual Mamba technique outperforms Transformer and convolution-based models in both low-light and underwater video enhancement tasks. Code is available on line at https://github.com/russellllaputa/BVI-Mamba.
Abstract（参考訳）: 低照度や水中で撮影されたビデオは、ノイズ、低コントラスト、色の不均衡、ぼやけなどの歪みに悩まされることが多い。これらの問題は可視性を制限するだけでなく、検出などの自動タスクも低下させる。後処理は一般的に必要だが、時間を要することがある。ビデオエンハンスメントのためのAIベースのツールも、画像ベースの方法に比べてはるかに多くの計算リソースを必要としている。本稿では、Visual State Space(VSS)モデルを利用して、メモリ使用量と計算時間を短縮する新しいフレームワーク、Visual Mambaを紹介する。フレームワークは2つのモジュールで構成されています。一入力フレーム間の時空間変位が特徴空間に登録されている特徴アライメントモジュール (II)UNetのようなアーキテクチャを用いてノイズ除去と輝度調整を行う拡張モジュールで,すべての畳み込み層をVSSブロックに置き換える。実験の結果,視覚マンバ法は低照度・水中ビデオ強調作業においてトランスフォーマーと畳み込みに基づくモデルより優れていた。コードはhttps://github.com/russellllaputa/BVI-Mamba.comで公開されている。

論文の概要: BVI-Mamba: Video Enhancement Using a Visual State-Space Model for Low-Light and Underwater Environments

関連論文リスト