Fugu-MT 論文翻訳(概要): Bin Latent Transformer (BiLT): A shift-invariant autoencoder for calibration-free spectral unmixing of turbid media

論文の概要: Bin Latent Transformer (BiLT): A shift-invariant autoencoder for calibration-free spectral unmixing of turbid media

arxiv url: http://arxiv.org/abs/2605.11829v1
Date: Tue, 12 May 2026 09:17:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 21:48:56.747478
Title: Bin Latent Transformer (BiLT): A shift-invariant autoencoder for calibration-free spectral unmixing of turbid media
Title（参考訳）: Bin Latent Transformer (BiLT): 振動媒体のキャリブレーションフリースペクトルアンミックスのためのシフト不変オートエンコーダ
Authors: Martin Hohmann,
Abstract要約: この研究は、高密度エンコーダをクロスアテンションスキャナーに置き換えるBin Latent Transformer (BiLT)-Autoencoderを導入している。物理制約付き線形デコーダは、強制吸収/散乱分離と3段階のカリキュラム拡張戦略により、アーキテクチャを完備する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The accurate recovery of constituent-level optical properties from integrating sphere measurements is a central analytical challenge in pharmaceutical analysis, food science, and biomedical diagnostics. Neural network autoencoders can extract spectrally resolved absorption and scattering coefficients for each constituent without prior knowledge, but their fully connected encoders bind learned features to absolute wavelength indices, causing accuracy loss under spectrometer calibration drift or hardware exchange. This work introduces the Bin Latent Transformer (BiLT)-Autoencoder, in which the dense encoder is replaced by a cross-attention scanner: 16 learnable probe vectors query a convolutional feature map, aggregating morphological spectral information independently of absolute wavelength position. A physics-constrained linear decoder with enforced absorption/scattering separation and a three-phase curriculum augmentation strategy complete the architecture. On a liquid phantom benchmark (intralipid and two ink absorbers; 496 samples), the model achieves $R^2 = 0.979$ and $0.975$ for $μ_a(λ)$ and $μ_s'(λ)$, respectively, on held-out test spectra, maintaining $R^2 > 0.90$ for $μ_a$ and $R^2 \approx 0.99$ for $μ_s'$ across the full tested shift range of $\pm 10$ spectral bands. The model generalises to a simulated spectrometer with a broader instrument line shape (${\approx}24$nm FWHM) without retraining, retaining $R^2 \approx 0.96$ and $0.974$ for the two channels. Attention map analysis reveals a physically interpretable two-component probe strategy: sparse anchor probes at absorption-edge wavelengths combined with a diffuse, SNR-driven ensemble at the high-transmittance long-wavelength region, which recruits additional probes dynamically under noise to provide implicit spectral averaging.
Abstract（参考訳）: 球面測定の統合による構成レベルの光学特性の正確な回復は、医薬品分析、食品科学、生医学的診断における中心的な分析課題である。ニューラルネットワークオートエンコーダは、事前の知識なしに各構成成分のスペクトル分解吸収と散乱係数を抽出することができるが、完全に連結されたエンコーダは、学習した特徴を絶対波長の指標に結合させ、分光計のキャリブレーションドリフトやハードウェア交換による精度の低下を引き起こす。この研究は、高密度エンコーダをクロスアテンションスキャナーに置き換えるBin Latent Transformer (BiLT)-Autoencoderを導入し、16個の学習可能なプローブベクトルが畳み込み特徴マップをクエリし、絶対波長位置とは無関係に形態学的スペクトル情報を集約する。物理制約付き線形デコーダの吸収・散乱分離と3段階のカリキュラム拡張戦略によりアーキテクチャが完成する。 R^2 = 0.979$と$0.975$ for $μ_a(λ)$と$μ_s'(λ)$はそれぞれ、ホールドアウトテストスペクトル上で、$R^2 > 0.90$ for $μ_a$と$R^2 \approx 0.99$ for $μ_s'$の完全なテストシフト範囲で達成される。このモデルは、より広い計器線形状の模擬分光器({\approx}24$nm FWHM)に再トレーニングすることなく一般化し、R^2 \approx 0.96$と0.974$を2つのチャネルに保持する。アテンションマップ解析により、物理的に解釈可能な2成分プローブ戦略が明らかにされる: 吸収端波長でのスパースアンカープローブと拡散性、SNR駆動の高透過長波長領域でのアンサンブルは、暗黙のスペクトル平均化を提供するために、雑音下で動的に追加プローブを募集する。

論文の概要: Bin Latent Transformer (BiLT): A shift-invariant autoencoder for calibration-free spectral unmixing of turbid media

関連論文リスト