Fugu-MT 論文翻訳(概要): Bridging Geographic Bias in Urban Streetscape Inference via Lifelong Learning with Visual-Semantic Pivoting

論文の概要: Bridging Geographic Bias in Urban Streetscape Inference via Lifelong Learning with Visual-Semantic Pivoting

arxiv url: http://arxiv.org/abs/2606.15055v1
Date: Sat, 13 Jun 2026 01:52:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-16 16:21:32.749751
Title: Bridging Geographic Bias in Urban Streetscape Inference via Lifelong Learning with Visual-Semantic Pivoting
Title（参考訳）: ビジュアルセマンティックピボットを用いた生涯学習による都市景観推論における地理バイアスのブリッジ
Authors: Xinze Zhang,
Abstract要約: 都市景観の視覚的認識は、景観計画、公衆衛生、場所作りにおけるエビデンスに基づく決定の基盤となっている。しかし、いくつかのよく写真化されたメトロポリスで訓練されたモデルは、体系的に非表示の地区を誤認した。このギャップに対処するHVSP-LLは、ビジュアル・セマンティック・ピボット・モジュールとエクイティ・アウェア・リハーサル・メカニズムを結合した生涯学習フレームワークである。
参考スコア（独自算出の注目度）: 2.538209532048867
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Visual perception of urban streetscapes underpins evidence-based decisions in landscape planning, public health, and place-making. Yet models trained on a few well-photographed metropolises systematically misjudge underrepresented districts, propagating geographic bias into downstream policy. We address this gap with HVSP-LL, a lifelong learning framework that couples a stratified visual-semantic pivoting module with an equity-aware rehearsal mechanism. The pivoting module organises landscape concepts along a three-tier ontology (macro structure, meso composition, micro element) and aligns image features to learnable semantic anchors at each tier, providing transferable representations that resist distributional drift. The lifelong adaptation component sequentially absorbs new urban regions while constraining inter-region perception gaps through a worst-region sample-reweighting objective and a structurally-aware exemplar buffer. We evaluate HVSP-LL on a panoramic streetscape benchmark assembled from twelve cities across four continents and seven perceptual dimensions. The framework attains 0.834 Spearman correlation on the held-out city sequence, an absolute 6.1 point improvement over the strongest continual baseline, and shrinks the inter-city perception gap to 0.094 -- a 38% reduction relative to the strongest continual baseline (0.151) and a 57% reduction relative to a representative regularisation baseline (0.218). Ablations confirm that each tier of the pivoting hierarchy contributes monotonically, and the equity-aware rehearsal converts mean backward transfer from -0.038 (without retention) to +0.013, eliminating catastrophic forgetting on the held-out sequence. Our results indicate that hierarchical anchoring is a practical pathway toward geographically equitable streetscape inference at city scale.
Abstract（参考訳）: 都市景観の視覚的認識は、景観計画、公衆衛生、場所作りにおけるエビデンスに基づく決定の基盤となっている。しかし、いくつかのよく写真化されたメトロポリスで訓練されたモデルは、体系的に非表示の地区を誤認し、下流政策に地理的偏見を伝播させた。 HVSP-LLは、階層化された視覚的セマンティックなピボットモジュールと、エクイティを意識したリハーサル機構を結合した、生涯学習フレームワークである。ピボットモジュールは、3層オントロジー(マクロ構造、メソ組成、マイクロ要素)に沿ってランドスケープの概念を編成し、各階層における学習可能なセマンティックアンカーに画像特徴を整列させ、分散ドリフトに抵抗する伝達可能な表現を提供する。寿命適応成分は、最低領域サンプル再重み付け目標と構造的に認識可能な模範バッファとを介して、地域間知覚ギャップを拘束しながら、新しい都市領域を順次吸収する。我々は,4大陸12都市と7つの知覚次元からなるパノラマストリートスケープベンチマークを用いて,HVSP-LLを評価した。このフレームワークは、ホールドアウト都市シーケンス上のスピアマン相関値0.834、最強連続ベースラインに対する絶対6.1点改善値、都市間認識ギャップを0.094に縮小し、最強連続ベースライン(0.151)に対して38%、代表正規化ベースライン(0.218)に対して57%縮小する。校正階層の各階層は単調に寄与し、エクイティ・アウェアのリハーサルは−0.038から+0.013への後退平均を変換し、ホールドアウトシーケンスにおける破滅的な忘れをなくす。以上の結果から,階層的アンカーは都市規模での地理的に均等な街路景観推定への実践的経路であることが示唆された。

論文の概要: Bridging Geographic Bias in Urban Streetscape Inference via Lifelong Learning with Visual-Semantic Pivoting

関連論文リスト