Land-then-transport: A Flow Matching-Based Generative Decoder for Wireless Image Transmission
- URL: http://arxiv.org/abs/2601.07512v1
- Date: Mon, 12 Jan 2026 13:09:37 GMT
- Title: Land-then-transport: A Flow Matching-Based Generative Decoder for Wireless Image Transmission
- Authors: Jingwen Fu, Ming Xiao, Mikael Skoglund, Dong In Kim,
- Abstract summary: We propose a flow-matching generative decoder for low-latency decoding.<n>Experiments show consistent gains over JPEG2000+LDPC, DeepJSCC, and diffusion-based baselines.<n> LTT provides a deterministic, physically interpretable, and efficient framework for generative wireless image decoding.
- Score: 38.71668959954467
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Due to strict rate and reliability demands, wireless image transmission remains difficult for both classical layered designs and joint source-channel coding (JSCC), especially under low latency. Diffusion-based generative decoders can deliver strong perceptual quality by leveraging learned image priors, but iterative stochastic denoising leads to high decoding delay. To enable low-latency decoding, we propose a flow-matching (FM) generative decoder under a new land-then-transport (LTT) paradigm that tightly integrates the physical wireless channel into a continuous-time probability flow. For AWGN channels, we build a Gaussian smoothing path whose noise schedule indexes effective noise levels, and derive a closed-form teacher velocity field along this path. A neural-network student vector field is trained by conditional flow matching, yielding a deterministic, channel-aware ODE decoder with complexity linear in the number of ODE steps. At inference, it only needs an estimate of the effective noise variance to set the ODE starting time. We further show that Rayleigh fading and MIMO channels can be mapped, via linear MMSE equalization and singular-value-domain processing, to AWGN-equivalent channels with calibrated starting times. Therefore, the same probability path and trained velocity field can be reused for Rayleigh and MIMO without retraining. Experiments on MNIST, Fashion-MNIST, and DIV2K over AWGN, Rayleigh, and MIMO demonstrate consistent gains over JPEG2000+LDPC, DeepJSCC, and diffusion-based baselines, while achieving good perceptual quality with only a few ODE steps. Overall, LTT provides a deterministic, physically interpretable, and computation-efficient framework for generative wireless image decoding across diverse channels.
Related papers
- Context Video Semantic Transmission with Variable Length and Rate Coding over MIMO Channels [49.624608869195065]
We propose the context video semantic transmission (CVST) framework for wireless video transmission.<n>We learn a context-channel correlation map to explicitly formulate the relationships between feature groups and multiple input multiple output (MIMO) subchannels.<n>We demonstrate substantial performance gains over various standardized separated coding methods and recent wireless video semantic communication approaches.
arXiv Detail & Related papers (2025-12-23T10:48:43Z) - Consistency Flow Model Achieves One-step Denoising Error Correction Codes [28.89866643527586]
We introduce the Error Correction Consistency Flow Model (ECCFM) for high-fidelity one-step decoding.<n>ECCFM attains lower bit-error rates (BER) than autoregressive and diffusion-based baselines.<n>It delivers inference speeds up from 30x to 100x faster than denoising diffusion decoders.
arXiv Detail & Related papers (2025-12-01T08:07:51Z) - Unsupervised Radio Map Construction in Mixed LoS/NLoS Indoor Environments [34.91945910235526]
This paper aims to recover the data collection trajectory directly from the channel propagation sequence.<n>The proposed method achieves an average localization accuracy of 0.65 meters in an indoor environment.
arXiv Detail & Related papers (2025-10-09T09:53:24Z) - SING: Semantic Image Communications using Null-Space and INN-Guided Diffusion Models [52.40011613324083]
Joint source-channel coding systems (DeepJSCC) have recently demonstrated remarkable performance in wireless image transmission.<n>Existing methods focus on minimizing distortion between the transmitted image and the reconstructed version at the receiver, often overlooking perceptual quality.<n>We propose SING, a novel framework that formulates the recovery of high-quality images from corrupted reconstructions as an inverse problem.
arXiv Detail & Related papers (2025-03-16T12:32:11Z) - Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion [56.38386580040991]
Consistency Trajectory Model (CTM) is a generalization of Consistency Models (CM)
CTM enables the efficient combination of adversarial training and denoising score matching loss to enhance performance.
Unlike CM, CTM's access to the score function can streamline the adoption of established controllable/conditional generation methods.
arXiv Detail & Related papers (2023-10-01T05:07:17Z) - Channelformer: Attention based Neural Solution for Wireless Channel
Estimation and Effective Online Training [1.0499453838486013]
We propose an encoder-decoder neural architecture (called Channelformer) to achieve improved channel estimation.
We employ multi-head attention in the encoder and a residual convolutional neural architecture as the decoder.
We also propose an effective online training method based on the fifth generation (5G) new radio (NR) configuration for the modern communication systems.
arXiv Detail & Related papers (2023-02-08T23:18:23Z) - Non-Coherent Over-the-Air Decentralized Gradient Descent [0.0]
Implementing Decentralized Gradient Descent in wireless systems is challenging due to noise, fading, and limited bandwidth.
This paper introduces a scalable DGD algorithm that eliminates the need for scheduling, topology information, or CSI.
arXiv Detail & Related papers (2022-11-19T19:15:34Z) - Denoising Diffusion Error Correction Codes [92.10654749898927]
Recently, neural decoders have demonstrated their advantage over classical decoding techniques.
Recent state-of-the-art neural decoders suffer from high complexity and lack the important iterative scheme characteristic of many legacy decoders.
We propose to employ denoising diffusion models for the soft decoding of linear codes at arbitrary block lengths.
arXiv Detail & Related papers (2022-09-16T11:00:50Z) - Two-Timescale End-to-End Learning for Channel Acquisition and Hybrid
Precoding [94.40747235081466]
We propose an end-to-end deep learning-based joint transceiver design algorithm for millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) systems.
We develop a DNN architecture that maps the received pilots into feedback bits at the receiver, and then further maps the feedback bits into the hybrid precoder at the transmitter.
arXiv Detail & Related papers (2021-10-22T20:49:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.