Fugu-MT 論文翻訳(概要): Encoder-Decoder Manifold Alignment for Idempotent Generation

論文の概要: Encoder-Decoder Manifold Alignment for Idempotent Generation

arxiv url: http://arxiv.org/abs/2606.22304v1
Date: Sun, 21 Jun 2026 02:14:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 19:18:30.6033
Title: Encoder-Decoder Manifold Alignment for Idempotent Generation
Title（参考訳）: エンコーダ・デコーダ・マニフォールド・アライメント
Authors: Dareen Alharthi, Abdul Waheed, Bhiksha Raj,
Abstract要約: この失敗の主な理由は、エンコーダとデコーダによって学習された多様体間の幾何学的ミスマッチである。我々は、エンコーダとデコーダに同じ基礎となるデータ多様体の一貫性のある表現を学習させ、このギャップを明示的に埋める新しいトレーニングフレームワークを提案する。
参考スコア（独自算出の注目度）: 51.959208893243776
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recently, several learning paradigms have been introduced to enforce idempotency in generative models. The goal is to ensure that repeated application of a model leaves samples unchanged once they lie on the target data manifold. In practice, however, many of these approaches fail to achieve exact fixed points, leading to instability and drift under repeated application. In this work, we argue that a key reason for this failure is a geometric mismatch between the manifolds learned by the encoder and decoder. The encoder projects inputs onto one latent manifold, while the decoder implicitly learns to reconstruct data from a different manifold. This discrepancy prevents the model from learning truly idempotent mappings. To address this issue, we propose a new training framework that explicitly closes this gap by forcing the encoder and decoder to learn consistent representations of the same underlying data manifold. By aligning the geometry of these components, our method encourages stable projections. Empirically, we show that our approach achieves significantly lower idempotency error and consistently regenerates identical outputs under repeated application, compared to existing methods. We demonstrate the effectiveness of the proposed framework on both image generation and image editing tasks. Finally, we show that enforcing idempotency in this manner improves identity preservation and information stability, leading to more realistic and controllable generative editing models.
Abstract（参考訳）: 近年, 生成モデルにおいて, イデオポトシーを強制する学習パラダイムがいくつか導入されている。目標は、モデルの繰り返し適用が、ターゲットデータ多様体上に置かれると、サンプルが変更されないようにすることです。しかし、実際には、これらのアプローチの多くは正確な固定点を達成することができず、繰り返し適用された場合の不安定性とドリフトにつながる。この研究において、この失敗の鍵となる理由は、エンコーダとデコーダによって学習された多様体間の幾何学的ミスマッチであると主張する。エンコーダは1つの潜在多様体に入力を投影し、デコーダは異なる多様体からデータを暗黙的に再構成することを学習する。この不一致は、モデルが真の等等写像を学習するのを防ぐ。この問題に対処するために、エンコーダとデコーダに同じ基礎となるデータ多様体の一貫性のある表現を学習させ、このギャップを明示的に埋める新しいトレーニングフレームワークを提案する。これらの成分の幾何を整列させることで、安定な射影を奨励する。実験により,本手法は既往の手法と比較して,Idempotencyエラーを著しく低減し,同一の出力を連続的に再現することを示す。本稿では,画像生成作業と画像編集作業におけるフレームワークの有効性を示す。最後に、この方法でのイデオロシティの強制は、アイデンティティの保存と情報の安定性を改善し、より現実的で制御可能な生成編集モデルをもたらすことを示す。

論文の概要: Encoder-Decoder Manifold Alignment for Idempotent Generation

関連論文リスト