Fugu-MT 論文翻訳(概要): Enhancing Infrared Vision: Progressive Prompt Fusion Network and Benchmark

論文の概要: Enhancing Infrared Vision: Progressive Prompt Fusion Network and Benchmark

arxiv url: http://arxiv.org/abs/2510.09343v1
Date: Fri, 10 Oct 2025 12:55:54 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 00:38:48.997549
Title: Enhancing Infrared Vision: Progressive Prompt Fusion Network and Benchmark
Title（参考訳）: 赤外線ビジョンの強化 - プログレッシブ・プロンプト・フュージョン・ネットワークとベンチマーク
Authors: Jinyuan Liu, Zihang Chen, Zhu Liu, Zhiying Jiang, Long Ma, Xin Fan, Risheng Liu,
Abstract要約: 既存の赤外線画像強調法は、個々の劣化に対処することに焦点を当てている。一般的にRGBセンサーに適用されるオールインワンエンハンスメント法は、有効性が制限されることが多い。
参考スコア（独自算出の注目度）: 58.61079960074608
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We engage in the relatively underexplored task named thermal infrared image enhancement. Existing infrared image enhancement methods primarily focus on tackling individual degradations, such as noise, contrast, and blurring, making it difficult to handle coupled degradations. Meanwhile, all-in-one enhancement methods, commonly applied to RGB sensors, often demonstrate limited effectiveness due to the significant differences in imaging models. In sight of this, we first revisit the imaging mechanism and introduce a Progressive Prompt Fusion Network (PPFN). Specifically, the PPFN initially establishes prompt pairs based on the thermal imaging process. For each type of degradation, we fuse the corresponding prompt pairs to modulate the model's features, providing adaptive guidance that enables the model to better address specific degradations under single or multiple conditions. In addition, a Selective Progressive Training (SPT) mechanism is introduced to gradually refine the model's handling of composite cases to align the enhancement process, which not only allows the model to remove camera noise and retain key structural details, but also enhancing the overall contrast of the thermal image. Furthermore, we introduce the most high-quality, multi-scenarios infrared benchmark covering a wide range of scenarios. Extensive experiments substantiate that our approach not only delivers promising visual results under specific degradation but also significantly improves performance on complex degradation scenes, achieving a notable 8.76\% improvement. Code is available at https://github.com/Zihang-Chen/HM-TIR.
Abstract（参考訳）: 我々は熱赤外画像強調という比較的未探索の課題に従事している。既存の赤外線画像強調法は主にノイズ、コントラスト、ぼやけなどの個々の劣化に対処することに焦点を当てており、複合劣化の処理が困難である。一方、RGBセンサーに一般的に適用されるオールインワンエンハンスメント法は、画像モデルに顕著な違いがあるため、効果が限られていることが多い。そこで我々はまず,撮像機構を再考し,プログレッシブ・プロンプト・フュージョン・ネットワーク(PPFN)を導入する。具体的には、PPFNは、最初に熱イメージングプロセスに基づいてプロンプトペアを確立する。各種類の劣化に対して、対応するプロンプトペアを融合してモデルの特徴を変調し、モデルが単一または複数の条件下での特定の劣化によりよく対処できる適応的なガイダンスを提供する。さらに、SPT機構を導入し、合成ケースの扱いを徐々に洗練し、拡張プロセスを整列させ、モデルがカメラノイズを除去し、重要な構造的詳細を保持するだけでなく、熱画像の全体的なコントラストを高める。さらに,様々なシナリオをカバーする,高品質なマルチシナリオ赤外線ベンチマークについても紹介する。広汎な実験は、我々のアプローチが特定の劣化下で有望な視覚的結果をもたらすだけでなく、複雑な劣化シーンのパフォーマンスも著しく向上し、注目すべき8.766%の改善を実現していることを裏付けている。コードはhttps://github.com/Zihang-Chen/HM-TIRで入手できる。

論文の概要: Enhancing Infrared Vision: Progressive Prompt Fusion Network and Benchmark

関連論文リスト