Fugu-MT 論文翻訳(概要): Phaedra: Learning High-Fidelity Discrete Tokenization for the Physical Science

論文の概要: Phaedra: Learning High-Fidelity Discrete Tokenization for the Physical Science

arxiv url: http://arxiv.org/abs/2602.03915v1
Date: Tue, 03 Feb 2026 17:12:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-05 19:45:11.225236
Title: Phaedra: Learning High-Fidelity Discrete Tokenization for the Physical Science
Title（参考訳）: Phaedra: 物理科学のための高忠実離散化学習
Authors: Levi Lingsch, Georgios Kissas, Johannes Jakubik, Siddhartha Mishra,
Abstract要約: 物理空間およびスペクトル空間におけるPDE特性の忠実度を測定するために,画像トークンの集合の精度について検討する。これらが細部と精密な大きさの両方を捉えるのに苦労しているという観察に基づいて、我々はPaedraを提案する。 Phaedraは、さまざまなPDEデータセットの再構成を一貫して改善することを示した。
参考スコア（独自算出の注目度）: 16.904886444697347
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Tokens are discrete representations that allow modern deep learning to scale by transforming high-dimensional data into sequences that can be efficiently learned, generated, and generalized to new tasks. These have become foundational for image and video generation and, more recently, physical simulation. As existing tokenizers are designed for the explicit requirements of realistic visual perception of images, it is necessary to ask whether these approaches are optimal for scientific images, which exhibit a large dynamic range and require token embeddings to retain physical and spectral properties. In this work, we investigate the accuracy of a suite of image tokenizers across a range of metrics designed to measure the fidelity of PDE properties in both physical and spectral space. Based on the observation that these struggle to capture both fine details and precise magnitudes, we propose Phaedra, inspired by classical shape-gain quantization and proper orthogonal decomposition. We demonstrate that Phaedra consistently improves reconstruction across a range of PDE datasets. Additionally, our results show strong out-of-distribution generalization capabilities to three tasks of increasing complexity, namely known PDEs with different conditions, unknown PDEs, and real-world Earth observation and weather data.
Abstract（参考訳）: トークンは、高次元データを効率よく学習し、生成し、新しいタスクに一般化できるシーケンスに変換することで、現代のディープラーニングのスケールを可能にする離散表現である。これらは画像とビデオ生成の基礎となり、最近では物理シミュレーションも行われている。既存のトークン化器は、画像のリアルな視覚的知覚の明確な要件のために設計されているため、これらのアプローチが、大きなダイナミックレンジを示し、物理的およびスペクトル特性を維持するためにトークン埋め込みを必要とする科学画像に最適かどうかを問う必要がある。本研究では,物理空間とスペクトル空間の両方におけるPDE特性の忠実度を測定するために,一連の画像トークン化器の精度について検討する。これらが細部と正確な大きさの両方を捉えるのに苦労しているという観察に基づいて、古典的な形状のゲイン量子化と適切な直交分解にインスパイアされたファエドラを提案する。 Phaedraは、さまざまなPDEデータセットの再構成を一貫して改善することを示した。さらに, この結果から, 複雑性を増大させる3つのタスク, 既知のPDE, 未知のPDE, 実世界の地球観測・気象データに対して, 分布外一般化の強い能力を示した。

論文の概要: Phaedra: Learning High-Fidelity Discrete Tokenization for the Physical Science

関連論文リスト