Fugu-MT 論文翻訳(概要): WeatherReasonSeg: A Benchmark for Weather-Aware Reasoning Segmentation in Visual Language Models

論文の概要: WeatherReasonSeg: A Benchmark for Weather-Aware Reasoning Segmentation in Visual Language Models

arxiv url: http://arxiv.org/abs/2603.17680v1
Date: Wed, 18 Mar 2026 12:57:18 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-19 18:32:57.704664
Title: WeatherReasonSeg: A Benchmark for Weather-Aware Reasoning Segmentation in Visual Language Models
Title（参考訳）: WeatherReasonSeg:ビジュアル言語モデルにおける天気認識推論セグメンテーションのベンチマーク
Authors: Wanjun Du, Zifeng Yuan, Tingting Chen, Fucai Ke, Beibei Lin, Shunli Zhang,
Abstract要約: 本稿では,悪天候条件下での推論に基づくセグメンテーションにおけるVLM性能評価のためのベンチマークであるWeatherReasonSegを紹介する。まず,既存のセグメンテーションデータセットに重度レベルの異なる合成気象を適用して,制御可能な推論データセットを構築する。第二に、実世界の複雑さを捉えるために、意味的に一貫したクエリで現実世界の悪天候推論セグメンテーションデータセットをキュレートする。
参考スコア（独自算出の注目度）: 15.970876520993954
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Existing vision-language models (VLMs) have demonstrated impressive performance in reasoning-based segmentation. However, current benchmarks are primarily constructed from high-quality images captured under idealized conditions. This raises a critical question: when visual cues are severely degraded by adverse weather conditions such as rain, snow, or fog, can VLMs sustain reliable reasoning segmentation capabilities? In response to this challenge, we introduce WeatherReasonSeg, a benchmark designed to evaluate VLM performance in reasoning-based segmentation under adverse weather conditions. It consists of two complementary components. First, we construct a controllable reasoning dataset by applying synthetic weather with varying severity levels to existing segmentation datasets, enabling fine-grained robustness analysis. Second, to capture real-world complexity, we curate a real-world adverse-weather reasoning segmentation dataset with semantically consistent queries generated via mask-guided LLM prompting. We further broaden the evaluation scope across five reasoning dimensions, including functionality, application scenarios, structural attributes, interactions, and requirement matching. Extensive experiments across diverse VLMs reveal two key findings: (1) VLM performance degrades monotonically with increasing weather severity, and (2) different weather types induce distinct vulnerability patterns. We hope WeatherReasonSeg will serve as a foundation for advancing robust, weather-aware reasoning.
Abstract（参考訳）: 既存の視覚言語モデル (VLM) は推論に基づくセグメンテーションにおいて顕著な性能を示した。しかし、現在のベンチマークは主に理想化された条件下で撮影された高品質な画像から構築されている。雨、雪、霧などの悪天候によって視覚的手がかりがひどく劣化した場合、VLMは信頼できる推論セグメンテーション能力を維持することができるのか? この課題に対応するために、悪天候条件下での推論に基づくセグメンテーションにおいて、VLMの性能を評価するために設計されたベンチマークであるWeatherReasonSegを紹介する。 2つの相補的な構成要素から構成される。まず、既存のセグメンテーションデータセットに様々な重大度レベルの合成気象を適用して、制御可能な推論データセットを構築し、きめ細かいロバスト性解析を可能にする。第二に、現実世界の複雑さを捉えるために、マスク誘導LPMプロンプトによって意味的に一貫したクエリによって、現実世界の悪天候推論セグメンテーションデータセットをキュレートする。さらに、機能、アプリケーションシナリオ、構造的属性、インタラクション、要求マッチングを含む5つの推論領域で評価範囲を広げます。 1) VLMの性能は、天候の重大度の増加とともに単調に低下し、(2)異なる気象タイプは、異なる脆弱性パターンを誘発する。 WeatherReasonSegが、堅牢で気象に配慮した推論の基盤になることを期待している。

論文の概要: WeatherReasonSeg: A Benchmark for Weather-Aware Reasoning Segmentation in Visual Language Models

関連論文リスト