Fugu-MT 論文翻訳(概要): MEDAL: Manifold Embedding Distillation via Autoencoder Learning

論文の概要: MEDAL: Manifold Embedding Distillation via Autoencoder Learning

arxiv url: http://arxiv.org/abs/2605.24244v1
Date: Fri, 22 May 2026 21:45:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:17.771391
Title: MEDAL: Manifold Embedding Distillation via Autoencoder Learning
Title（参考訳）: MEDAL:オートエンコーダ学習による人工内包蒸留
Authors: Irene Chang, Tarek M. Zikry, Genevera I. Allen,
Abstract要約: 低次元埋め込みは高次元データの視覚的要約として広く使われ、下流の科学的発見を可能にする。しかし、t-SNE や UMAP などの一般的な非線形次元減少法は、視覚的魅力のみに基づいて選択されることが多い。我々は,再利用可能なエンコーダ・デコーダモデルに適合する多様体を蒸留する新しいフレームワーク MEDAL を開発した。
参考スコア（独自算出の注目度）: 5.046831208137848
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Low-dimensional embeddings are widely used as visual summaries of high-dimensional data and to enable downstream scientific discoveries. Yet, popular nonlinear dimension reduction methods, such as t-SNE and UMAP, are often selected based on visual appeal alone and without rigorous quantitative validation. A major reason is that manifold embeddings typically do not provide an out-of-sample map nor an inverse back to the original feature space; this makes held-out validation, the gold standard in supervised learning, all but impossible. To address these challenges, we develop a novel framework, MEDAL (Manifold Embedding Distillation via Autoencoder Learning), which distills a fitted manifold embedding into a reusable encoder--decoder model. MEDAL trains a constrained autoencoder whose bottleneck exactly matches any teacher embedding while the decoder reconstructs the original input; this yields an explicit map for new samples, an approximate inverse, and a pointwise reconstruction-based measure of distortion in the manifold space. This converts static manifold embeddings into models that can be evaluated on held-out data, enabling quantitative validation including comparing different dimension reduction methods as well as hyperparameter tuning. Across multiple benchmark and scientific case studies, we show that MEDAL enables held-out validation to determine optimal manifold embeddings and hyperparameters, reveals biologically coherent regions that are difficult to preserve in two dimensional embeddings, and detects distribution shift when new samples are mapped into a fixed reference manifold. MEDAL provides a general validation wrapper to any existing dimension reduction technique that will improve the rigor and
Abstract（参考訳）: 低次元埋め込みは高次元データの視覚的要約として広く使われ、下流の科学的発見を可能にする。しかし、t-SNE や UMAP などの一般的な非線形次元減少法は、視覚的魅力だけで、厳密な定量的検証を行なわずに選択されることが多い。主な理由は、多様体の埋め込みは通常、外接写像や元の特徴空間への逆戻りを提供しないからである。これらの課題に対処するため、我々は、再利用可能なエンコーダ-デコーダモデルに適合する多様体を蒸留する新しいフレームワーク、MEDAL(Manifold Embedding Distillation via Autoencoder Learning)を開発した。 MEDALは制約付きオートエンコーダを訓練し、ボトルネックは教師の埋め込みと正確に一致し、デコーダは元の入力を再構築する。これにより、静的な多様体の埋め込みをホールドアウトデータで評価できるモデルに変換し、異なる次元還元法の比較やハイパーパラメータチューニングを含む定量的検証を可能にする。複数のベンチマークおよび科学的ケーススタディにおいて、MEDALは、最適多様体の埋め込みとハイパーパラメータを決定するためのホールドアウト検証を可能にし、2次元の埋め込みでは保存が難しい生物学的に一貫性のある領域を明らかにし、新しいサンプルを固定基準多様体にマッピングした場合の分布変化を検出する。 MEDALは、リガーを改良する既存の次元縮小技術に対して、一般的な検証ラッパーを提供する。

論文の概要: MEDAL: Manifold Embedding Distillation via Autoencoder Learning

関連論文リスト