Fugu-MT 論文翻訳(概要): Supercharging Thermal Gaussian Splatting with Depth Estimation

論文の概要: Supercharging Thermal Gaussian Splatting with Depth Estimation

arxiv url: http://arxiv.org/abs/2605.30328v1
Date: Thu, 28 May 2026 17:57:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 02:45:56.739079
Title: Supercharging Thermal Gaussian Splatting with Depth Estimation
Title（参考訳）: 深さ推定による過給熱ガウス平滑化
Authors: Manoj Biswanath, Chenxin Cai, Hannah Schieber, Daniel Roth, Benjamin Busam,
Abstract要約: 本研究では, 熱画像のみを用いた温度-深度ガウス散乱法(TDg)を提案する。平均的に、学習された知覚的イメージパッチ類似度(LPIPS)、構造的類似度指標(SSIM)、TDgのピーク信号対雑音比(PSNR)などのレンダリング品質指標は、ベースラインMSMG値よりも1.12%、0.034%、0.01%良い。
参考スコア（独自算出の注目度）: 17.284356190513005
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Efficient and robust 3D scene representation is crucial in autonomous driving, robotics, and related fields. While RGB images provide valuable content for 3D reconstruction, other modalities like thermal or depth can enable additional information on the environment. Lately, novel view synthesis methods like 3D Gaussian Splatting have started using multiple modalities to further boost their performance. But fusing or combining multimodal data can make the process slower and can bring in additional challenges. Therefore, our project aims to use single modality based on thermal infrared domain, by removing the reliance on visible light as much as possible. This single modality can be expected to be faster as it does not rely on multimodal data. We propose a method, Thermal-to-Depth Gaussian Splatting (TDg), that uses only thermal images and depth estimation in its architecture to derive the radiance fields. Our TDg method outperforms the MSMG (Multiple Single-Modal Gaussians) baseline in most cases on our test datasets, RGBT-Scenes and ThermalMix. On average, the rendering quality metrics such as learned perceptual image patch similarity (LPIPS), structural similarity index measure (SSIM), and peak signal-to-noise ratio (PSNR) of TDg are 1.12%, 0.034%, and 0.01% better than the baseline MSMG values. It also reduces the training time significantly, by 12 mins 47 secs (55% improvement). Overall, our method is successful in deriving these thermal radiance fields, which can ultimately have several applications, such as identifying heat sources critical in surveillance, search or rescue operations, and industrial inspections where temperature is widely used to monitor machines.
Abstract（参考訳）: 効率的で堅牢な3Dシーン表現は、自律運転、ロボット工学、および関連する分野において不可欠である。 RGB画像は3D再構成に有用なコンテンツを提供するが、熱や深度といった他のモダリティは環境に関する追加情報を可能にする。近年,3次元ガウス・スプレイティングのような新しいビュー合成手法が,その性能向上のために複数のモダリティの使用を開始している。しかし、マルチモーダルデータの融合や結合はプロセスを遅くし、さらなる課題をもたらす可能性がある。そこで本研究の目的は、可視光への依存を極力取り除き、熱赤外領域に基づく単一モードの利用である。この単一のモダリティは、マルチモーダルデータに依存しないため、より高速であることが期待できる。本研究では, 熱画像のみを用いた温度-深度ガウス散乱法(TDg)を提案する。我々のTDg法はMSMG(Multiple Single-Modal Gaussian)ベースラインよりも優れている。平均的に、学習された知覚的イメージパッチ類似度(LPIPS)、構造的類似度指標(SSIM)、TDgのピーク信号対雑音比(PSNR)などのレンダリング品質指標は、ベースラインMSMG値よりも1.12%、0.034%、0.01%良い。また、トレーニング時間を12分47秒(55%改善)で大幅に短縮する。本手法は, 監視, 探索, 救助活動に不可欠な熱源の同定, 機器の温度監視に広く用いられている産業検査など, 最終的にいくつかの応用が期待できる熱放射界の導出に成功している。

論文の概要: Supercharging Thermal Gaussian Splatting with Depth Estimation

関連論文リスト