Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting
- URL: http://arxiv.org/abs/2410.02601v3
- Date: Fri, 20 Jun 2025 15:25:47 GMT
- Title: Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting
- Authors: Sergei Kholkin, Grigoriy Ksenofontov, David Li, Nikita Kornilov, Nikita Gushchin, Alexandra Suvorikova, Alexey Kroshnin, Evgeny Burnaev, Alexander Korotin,
- Abstract summary: Iterative Markovian Fitting (IMF) procedure successfully solves the Schr"odinger Bridge (SB) problem.<n>We show a close connection between IMF and the Iterative Proportional Fitting (IPF) procedure.<n>We refer to this combined approach as the Iterative Proportional Markovian Fitting (IPMF) procedure.
- Score: 87.37278888311839
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Iterative Markovian Fitting (IMF) procedure, which iteratively projects onto the space of Markov processes and the reciprocal class, successfully solves the Schr\"odinger Bridge (SB) problem. However, an efficient practical implementation requires a heuristic modification - alternating between fitting forward and backward time diffusion at each iteration. This modification is crucial for stabilizing training and achieving reliable results in applications such as unpaired domain translation. Our work reveals a close connection between the modified version of IMF and the Iterative Proportional Fitting (IPF) procedure - a foundational method for the SB problem, also known as Sinkhorn's algorithm. Specifically, we demonstrate that the heuristic modification of the IMF effectively integrates both IMF and IPF procedures. We refer to this combined approach as the Iterative Proportional Markovian Fitting (IPMF) procedure. Through theoretical and empirical analysis, we establish the convergence of IPMF procedure under various settings, contributing to developing a unified framework for solving SB problems. Moreover, from a practical standpoint, the IPMF procedure enables a flexible trade-off between image similarity and generation quality, offering a new mechanism for tailoring models to specific tasks.
Related papers
- Learning to Fuse: Modality-Aware Adaptive Scheduling for Robust Multimodal Foundation Models [0.0]
Modality-Aware Adaptive Fusion Scheduling (MA-AFS) learns to dynamically modulate the contribution of each modality on a per-instance basis.<n>Our work highlights the importance of adaptive fusion and opens a promising direction toward reliable and uncertainty-aware multimodal learning.
arXiv Detail & Related papers (2025-06-15T05:57:45Z) - Locality Preserving Markovian Transition for Instance Retrieval [59.16243976912006]
We introduce the Locality Preserving Markovian Transition (LPMT) framework, which employs a long-term thermodynamic transition process with multiple states for accurate manifold distance measurement.<n>The proposed LPMT first integrates diffusion processes across separate graphs using Bidirectional Collaborative Diffusion (BCD) to establish strong similarity relationships.<n>After, Locality State Embedding (LSE) encodes each instance into a distribution for enhanced local consistency.<n>These distributions are interconnected via the Thermodynamic Markovian Transition (TMT) process, enabling efficient global retrieval while maintaining local effectiveness.
arXiv Detail & Related papers (2025-06-05T16:07:31Z) - Federated Smoothing ADMM for Localization [9.25126455172971]
Federated systems are characterized by distributed data, non-smoothity, and non-smoothness.
We propose a robust algorithm to tackle the scalability and outlier issues inherent in such environments.
To validate the reliability of the proposed algorithm, we show that it converges to a stationary point.
numerical simulations highlight its superior performance in convergence resilience compared to existing state-of-the-art localization methods.
arXiv Detail & Related papers (2025-03-12T16:01:34Z) - Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators [4.910937238451485]
We introduce a novel framework designed to train a modality-agnostic HFR method capable of handling multiple modalities during inference.
We achieve this by implementing a computationally efficient automatic routing mechanism called Switch Style Modulation Blocks (SSMB)
Our proposed SSMB can be trained end-to-end and seamlessly integrated into pre-trained face recognition models, transforming them into modality-agnostic HFR models.
arXiv Detail & Related papers (2024-07-11T16:21:48Z) - Adversarial Schrödinger Bridge Matching [66.39774923893103]
Iterative Markovian Fitting (IMF) procedure alternates between Markovian and reciprocal projections of continuous-time processes.
We propose a novel Discrete-time IMF (D-IMF) procedure in which learning of processes is replaced by learning just a few transition probabilities in discrete time.
We show that our D-IMF procedure can provide the same quality of unpaired domain translation as the IMF, using only several generation steps instead of hundreds.
arXiv Detail & Related papers (2024-05-23T11:29:33Z) - Light and Optimal Schrödinger Bridge Matching [67.93806073192938]
We propose a novel procedure to learn Schr"odinger Bridges (SB) which we call the textbf Schr"odinger bridge matching.
We show that the optimal bridge matching objective coincides with the recently discovered energy-based modeling (EBM) objectives to learn EOT/SB.
We develop a light solver (which we call LightSB-M) to implement optimal matching in practice using the mixture parameterization of the Schr"odinger potential.
arXiv Detail & Related papers (2024-02-05T17:17:57Z) - Transport meets Variational Inference: Controlled Monte Carlo Diffusions [5.5654189024307685]
We present a principled and systematic framework for sampling and generative modelling centred around divergences on path space.
Our work culminates in the development of the emphControlled Monte Carlo Diffusion sampler (CMCD) for Bayesian computation.
arXiv Detail & Related papers (2023-07-03T14:28:36Z) - Faster Video Moment Retrieval with Point-Level Supervision [70.51822333023145]
Video Moment Retrieval (VMR) aims at retrieving the most relevant events from an untrimmed video with natural language queries.
Existing VMR methods suffer from two defects: massive expensive temporal annotations and complicated cross-modal interaction modules.
We propose a novel method termed Cheaper and Faster Moment Retrieval (CFMR)
arXiv Detail & Related papers (2023-05-23T12:53:50Z) - Diffusion Bridge Mixture Transports, Schr\"odinger Bridge Problems and
Generative Modeling [4.831663144935879]
We propose a novel sampling-based iterative algorithm, the iterated diffusion bridge mixture (IDBM) procedure, aimed at solving the dynamic Schr"odinger bridge problem.
The IDBM procedure exhibits the attractive property of realizing a valid transport between the target probability measures at each iteration.
arXiv Detail & Related papers (2023-04-03T12:13:42Z) - Relational Reasoning via Set Transformers: Provable Efficiency and
Applications to MARL [154.13105285663656]
A cooperative Multi-A gent R einforcement Learning (MARL) with permutation invariant agents framework has achieved tremendous empirical successes in real-world applications.
Unfortunately, the theoretical understanding of this MARL problem is lacking due to the curse of many agents and the limited exploration of the relational reasoning in existing works.
We prove that the suboptimality gaps of the model-free and model-based algorithms are independent of and logarithmic in the number of agents respectively, which mitigates the curse of many agents.
arXiv Detail & Related papers (2022-09-20T16:42:59Z) - Robust Phi-Divergence MDPs [13.555107578858307]
We develop a novel solution framework for robust MDPs with s-rectangular ambiguity sets.
We show that the associated s-rectangular robust MDPs can be solved substantially faster than with state-of-the-art commercial solvers.
arXiv Detail & Related papers (2022-05-27T19:08:55Z) - A unified algorithm framework for mean-variance optimization in
discounted Markov decision processes [7.510742715895749]
This paper studies the risk-averse mean-variance optimization in infinite-horizon discounted Markov decision processes (MDPs)
We introduce a pseudo mean to transform the untreatable MDP to a standard one with a redefined reward function in standard form.
We propose a unified algorithm framework with a bilevel optimization structure for the discounted mean-variance optimization.
arXiv Detail & Related papers (2022-01-15T02:19:56Z) - Permutation Invariant Policy Optimization for Mean-Field Multi-Agent
Reinforcement Learning: A Principled Approach [128.62787284435007]
We propose the mean-field proximal policy optimization (MF-PPO) algorithm, at the core of which is a permutation-invariant actor-critic neural architecture.
We prove that MF-PPO attains the globally optimal policy at a sublinear rate of convergence.
In particular, we show that the inductive bias introduced by the permutation-invariant neural architecture enables MF-PPO to outperform existing competitors.
arXiv Detail & Related papers (2021-05-18T04:35:41Z) - Deep Multimodal Fusion by Channel Exchanging [87.40768169300898]
This paper proposes a parameter-free multimodal fusion framework that dynamically exchanges channels between sub-networks of different modalities.
The validity of such exchanging process is also guaranteed by sharing convolutional filters yet keeping separate BN layers across modalities, which, as an add-on benefit, allows our multimodal architecture to be almost as compact as a unimodal network.
arXiv Detail & Related papers (2020-11-10T09:53:20Z) - Gaussian MRF Covariance Modeling for Efficient Black-Box Adversarial
Attacks [86.88061841975482]
We study the problem of generating adversarial examples in a black-box setting, where we only have access to a zeroth order oracle.
We use this setting to find fast one-step adversarial attacks, akin to a black-box version of the Fast Gradient Sign Method(FGSM)
We show that the method uses fewer queries and achieves higher attack success rates than the current state of the art.
arXiv Detail & Related papers (2020-10-08T18:36:51Z) - A Hybrid Evolutionary Algorithm for Reliable Facility Location Problem [10.668347198815438]
The reliable facility location problem (RFLP) plays a vital role in the decision-making and management of modern supply chain and logistics.
We propose a novel model for the RFLP. Instead of assuming allocating a fixed number of facilities to each customer as in the existing works, we set the number of allocated facilities as an independent variable.
To handle it, we propose EAMLS, a hybrid evolutionary algorithm, which combines a memorable local search (MLS) method and an evolutionary algorithm (EA)
arXiv Detail & Related papers (2020-06-27T11:31:55Z) - Understanding Nesterov's Acceleration via Proximal Point Method [52.99237600875452]
The proximal point method (PPM) is often used as a building block for designing optimization algorithms.
In this work, we use the PPM method to provide conceptually simple derivations along with convergence analyses of different versions of Nesterov's accelerated gradient method (AGM)
arXiv Detail & Related papers (2020-05-17T17:17:18Z) - State-only Imitation with Transition Dynamics Mismatch [16.934888672659824]
Imitation Learning (IL) is a popular paradigm for training agents to achieve complicated goals by leveraging expert behavior.
We present a new state-only IL algorithm in this paper.
We show that our algorithm is particularly effective when there is a transition dynamics mismatch between the expert and imitator MDPs.
arXiv Detail & Related papers (2020-02-27T02:27:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.