Fugu-MT 論文翻訳(概要): ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation

論文の概要: ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation

arxiv url: http://arxiv.org/abs/2509.00665v1
Date: Sun, 31 Aug 2025 02:24:00 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.333624
Title: ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation
Title（参考訳）: ER-LoRA:気象一般化深度推定のための効果的なリンク誘導適応
Authors: Weilong Yan, Xin Zhang, Robby T. Tan,
Abstract要約: 本稿では,2種類の有効ランク(エントロピーランクと安定ランク)に基づいて,VFM(Vision Foundation Models)の事前学習重量を構造的に分解するSTM(Selecting-Tuning-Maintaining)戦略を紹介する。チューニングフェーズでは、エントロピーランクとフルチューニングウェイトに基づいて、適切なランク数と、初期化のためのタスク認識特異方向を適応的に選択する。この設計は、事前訓練されたVFMの強い一般化能力を保ちながら、柔軟なタスク適応を保証する。
参考スコア（独自算出の注目度）: 33.632587382356824
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Monocular depth estimation under adverse weather conditions (e.g.\ rain, fog, snow, and nighttime) remains highly challenging due to the lack of reliable ground truth and the difficulty of learning from unlabeled real-world data. Existing methods often rely on synthetic adverse data with pseudo-labels, which suffer from domain gaps, or employ self-supervised learning, which violates photometric assumptions in adverse scenarios. In this work, we propose to achieve weather--generalized depth estimation by Parameter--Efficient Fine--Tuning (PEFT) of Vision Foundation Models (VFMs), using only a small amount of high--visibility (normal) data. While PEFT has shown strong performance in semantic tasks such as segmentation, it remains underexplored for geometry--centric tasks like depth estimation -- especially in terms of balancing effective adaptation with the preservation of pretrained knowledge. To this end, we introduce the Selecting--Tuning--Maintaining (STM) strategy, which structurally decomposes the pretrained weights of VFMs based on two kinds of effective ranks (entropy--rank and stable--rank). In the tuning phase, we adaptively select the proper rank number as well as the task--aware singular directions for initialization, based on the entropy--rank and full--tuned weight; while in the maintaining stage, we enforce a principal direction regularization based on the stable--rank. This design guarantees flexible task adaptation while preserving the strong generalization capability of the pretrained VFM. Extensive experiments on four real--world benchmarks across diverse weather conditions demonstrate that STM not only outperforms existing PEFT methods and full fine--tuning but also surpasses methods trained with adverse synthetic data, and even the depth foundation model
Abstract（参考訳）: 悪天候条件(例:雨、霧、雪、夜間)下での単眼深度推定は、信頼できる地上の真実の欠如と、ラベルのない実世界のデータから学ぶことの難しさにより、非常に難しいままである。既存の手法は、ドメインギャップに苦しむ擬似ラベルによる合成不良データや、負のシナリオにおける測光的仮定に反する自己教師付き学習に頼っていることが多い。本研究では,視覚基礎モデル(VFM)のパラメータ-効率的な微細チューニング(PEFT)による気象一般化深度推定を,少量の高可視(正規)データのみを用いて行うことを提案する。 PEFTはセグメンテーションのようなセグメンテーションタスクにおいて強い性能を示してきたが、特に事前訓練された知識の保存と効果的な適応のバランスの点で、深さ推定のような幾何学中心のタスクについては、まだ探索されていない。そこで我々は,2種類の有効ランク(エントロピーランクと安定ランク)に基づいて,VFMの事前学習重量を構造的に分解するSelecting--Tuning-Maintaining(STM)戦略を導入する。チューニングフェーズでは、エントロピーランクとフルチューニングウェイトに基づいて、適切なランク数とタスク認識特異方向を適応的に選択し、維持段階においては、安定ランクに基づいて主方向正則化を実施する。この設計は、事前訓練されたVFMの強い一般化能力を保ちながら、柔軟なタスク適応を保証する。多様な気象条件をまたいだ4つの実世界のベンチマークによる大規模な実験により、STMは既存のPEFT法と完全な微調整法を上回るだけでなく、有害な合成データで訓練された手法、さらには深度基礎モデルよりも優れていることが示された。

論文の概要: ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation

関連論文リスト