Robust Image Semantic Coding with Learnable CSI Fusion Masking over MIMO Fading Channels
- URL: http://arxiv.org/abs/2406.07389v1
- Date: Thu, 30 May 2024 08:23:17 GMT
- Title: Robust Image Semantic Coding with Learnable CSI Fusion Masking over MIMO Fading Channels
- Authors: Bingyan Xie, Yongpeng Wu, Yuxuan Shi, Wenjun Zhang, Shuguang Cui, Merouane Debbah,
- Abstract summary: Existing semantic communication frameworks mainly consider single-input single-output Gaussian channels or Rayleigh fading channels.
We propose a learnable CSI fusion semantic communication framework, where CSI is treated as side information by the semantic extractor.
Experiment results testify the superiority of LCFSC over traditional schemes and state-of-the-art Swin Transformer-based semantic communication frameworks.
- Score: 44.15994739018646
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Though achieving marvelous progress in various scenarios, existing semantic communication frameworks mainly consider single-input single-output Gaussian channels or Rayleigh fading channels, neglecting the widely-used multiple-input multiple-output (MIMO) channels, which hinders the application into practical systems. One common solution to combat MIMO fading is to utilize feedback MIMO channel state information (CSI). In this paper, we incorporate MIMO CSI into system designs from a new perspective and propose the learnable CSI fusion semantic communication (LCFSC) framework, where CSI is treated as side information by the semantic extractor to enhance the semantic coding. To avoid feature fusion due to abrupt combination of CSI with features, we present a non-invasive CSI fusion multi-head attention module inside the Swin Transformer. With the learned attention masking map determined by both source and channel states, more robust attention distribution could be generated. Furthermore, the percentage of mask elements could be flexibly adjusted by the learnable mask ratio, which is produced based on the conditional variational interference in an unsupervised manner. In this way, CSI-aware semantic coding is achieved through learnable CSI fusion masking. Experiment results testify the superiority of LCFSC over traditional schemes and state-of-the-art Swin Transformer-based semantic communication frameworks in MIMO fading channels.
Related papers
- Rethinking Transformer-Based Blind-Spot Network for Self-Supervised Image Denoising [94.09442506816724]
Blind-spot networks (BSN) have been prevalent neural architectures in self-supervised image denoising (SSID)
We build a Transformer-based Blind-Spot Network (TBSN) which shows strong local fitting and global perspective abilities.
arXiv Detail & Related papers (2024-04-11T15:39:10Z) - A Low-Overhead Incorporation-Extrapolation based Few-Shot CSI Feedback Framework for Massive MIMO Systems [45.22132581755417]
Accurate channel state information (CSI) is essential for downlink precoding in frequency division duplexing (FDD) massive multiple-input multiple-output (MIMO) systems.
However, obtaining CSI through feedback from the user equipment (UE) becomes challenging with the increasing scale of antennas and subcarriers.
Deep learning-based methods have emerged for compressing CSI but these methods require substantial collected samples.
Existing deep learning methods also suffer from dramatically growing feedback overhead owing to their focus on full-dimensional CSI feedback.
We propose a low-overhead-Extrapolation based Few-Shot CSI
arXiv Detail & Related papers (2023-12-07T06:01:47Z) - Joint Channel Estimation and Feedback with Masked Token Transformers in
Massive MIMO Systems [74.52117784544758]
This paper proposes an encoder-decoder based network that unveils the intrinsic frequency-domain correlation within the CSI matrix.
The entire encoder-decoder network is utilized for channel compression.
Our method outperforms state-of-the-art channel estimation and feedback techniques in joint tasks.
arXiv Detail & Related papers (2023-06-08T06:15:17Z) - Co-attention Propagation Network for Zero-Shot Video Object Segmentation [91.71692262860323]
Zero-shot object segmentation (ZS-VOS) aims to segment objects in a video sequence without prior knowledge of these objects.
Existing ZS-VOS methods often struggle to distinguish between foreground and background or to keep track of the foreground in complex scenarios.
We propose an encoder-decoder-based hierarchical co-attention propagation network (HCPN) capable of tracking and segmenting objects.
arXiv Detail & Related papers (2023-04-08T04:45:48Z) - Overview of Deep Learning-based CSI Feedback in Massive MIMO Systems [77.0986534024972]
Deep learning (DL)-based CSI feedback refers to CSI compression and reconstruction by a DL-based autoencoder and can greatly reduce feedback overhead.
The focus is on novel neural network architectures and utilization of communication expert knowledge to improve CSI feedback accuracy.
arXiv Detail & Related papers (2022-06-29T03:28:57Z) - PolarDenseNet: A Deep Learning Model for CSI Feedback in MIMO Systems [18.646674391114548]
We propose an AI-based CSI feedback based on an auto-encoder architecture that encodes the CSI at UE into a low-dimensional latent space and decodes it back at the base station.
Our simulation results show that the AI-based proposed architecture outperforms the state-of-the-art high-resolution linear combination codebook.
arXiv Detail & Related papers (2022-02-02T19:04:49Z) - Recursive CSI Quantization of Time-Correlated MIMO Channels by Deep
Learning Classification [12.670741983987943]
In frequency division duplex (FDD) multiple-input multiple-output (MIMO) wireless communications, limited channel state information (CSI) feedback is a central tool.
To achieve a given CSI quality, the CSI quantization codebook size has to grow exponentially with the number of antennas.
arXiv Detail & Related papers (2020-09-28T18:19:02Z) - Distributed Deep Convolutional Compression for Massive MIMO CSI Feedback [9.959844922120524]
Massive multiple-input multiple-output (MIMO) systems require downlink channel state information (CSI) at the base station (BS) to achieve spatial diversity and multiplexing gains.
In this paper, we propose a deep learning (DL)-based CSI compression scheme, called DeepCMC, composed of convolutional layers followed by quantization and entropy coding blocks.
DeepCMC is trained to minimize a weighted rate-distortion cost, which enables a trade-off between the CSI quality and its feedback overhead.
arXiv Detail & Related papers (2020-03-07T12:33:31Z) - DeepSIC: Deep Soft Interference Cancellation for Multiuser MIMO
Detection [98.43451011898212]
In multiuser multiple-input multiple-output (MIMO) setups, where multiple symbols are simultaneously transmitted, accurate symbol detection is challenging.
We propose a data-driven implementation of the iterative soft interference cancellation (SIC) algorithm which we refer to as DeepSIC.
DeepSIC learns to carry out joint detection from a limited set of training samples without requiring the channel to be linear.
arXiv Detail & Related papers (2020-02-08T18:31:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.