Wireless Transmission of Images With The Assistance of Multi-level
Semantic Information
- URL: http://arxiv.org/abs/2202.04754v2
- Date: Fri, 8 Dec 2023 14:16:43 GMT
- Title: Wireless Transmission of Images With The Assistance of Multi-level
Semantic Information
- Authors: Zhenguo Zhang, Qianqian Yang, Shibo He, Mingyang Sun, Jiming Chen
- Abstract summary: MLSC-image is a multi-level semantic aware communication system for wireless image transmission.
We employ a pretrained image caption to capture the text semantics and a pretrained image segmentation model to obtain the segmentation semantics.
The numerical results validate the effectiveness and efficiency of the proposed semantic communication system.
- Score: 16.640928669609934
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Semantic-oriented communication has been considered as a promising to boost
the bandwidth efficiency by only transmitting the semantics of the data. In
this paper, we propose a multi-level semantic aware communication system for
wireless image transmission, named MLSC-image, which is based on the deep
learning techniques and trained in an end to end manner. In particular, the
proposed model includes a multilevel semantic feature extractor, that extracts
both the highlevel semantic information, such as the text semantics and the
segmentation semantics, and the low-level semantic information, such as local
spatial details of the images. We employ a pretrained image caption to capture
the text semantics and a pretrained image segmentation model to obtain the
segmentation semantics. These high-level and low-level semantic features are
then combined and encoded by a joint semantic and channel encoder into symbols
to transmit over the physical channel. The numerical results validate the
effectiveness and efficiency of the proposed semantic communication system,
especially under the limited bandwidth condition, which indicates the
advantages of the high-level semantics in the compression of images.
Related papers
- Trustworthy Image Semantic Communication with GenAI: Explainablity, Controllability, and Efficiency [59.15544887307901]
Image semantic communication (ISC) has garnered significant attention for its potential to achieve high efficiency in visual content transmission.
Existing ISC systems based on joint source-channel coding face challenges in interpretability, operability, and compatibility.
We propose a novel trustworthy ISC framework that employs Generative Artificial Intelligence (GenAI) for multiple downstream inference tasks.
arXiv Detail & Related papers (2024-08-07T14:32:36Z) - Semantic Communication Enhanced by Knowledge Graph Representation Learning [11.68356846628016]
This paper investigates the advantages of representing and processing semantic knowledge extracted into graphs within the emerging paradigm of semantic communications.
We propose sending semantic symbols solely equivalent to node embeddings through the wireless channel and inferring the complete knowledge graph at the receiver.
arXiv Detail & Related papers (2024-07-27T20:57:10Z) - Semantic Similarity Score for Measuring Visual Similarity at Semantic Level [5.867765921443141]
We propose a semantic evaluation metric -- SeSS (Semantic Similarity Score) based on Scene Graph Generation and graph matching.
The metric can measure the semantic-level differences in semantic-level information of images and can be used for evaluation in visual semantic communication systems.
arXiv Detail & Related papers (2024-06-06T08:51:26Z) - Deep Joint Semantic Coding and Beamforming for Near-Space Airship-Borne Massive MIMO Network [70.63240823677182]
Near-space airship-borne communication network urgently needs reliable and efficient Airship-to-X link.
This paper proposes to integrate semantic communication with massive multiple-input multiple-output (MIMO) technology.
arXiv Detail & Related papers (2024-05-30T09:46:59Z) - Transformer-Aided Semantic Communications [28.63893944806149]
We employ vision transformers specifically for the purpose of compression and compact representation of the input image.
Through the use of the attention mechanism inherent in transformers, we create an attention mask.
We evaluate the effectiveness of our proposed framework using the TinyImageNet dataset.
arXiv Detail & Related papers (2024-05-02T17:50:53Z) - Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks [2.2997117992292764]
This study proposes a multi-modal image transmission method that leverages various types of semantic information for efficient semantic communication.
The proposed method extracts multi-modal semantic information from an original image and transmits only that to a receiver.
The receiver generates multiple images using an image-generation model and selects an output image based on semantic similarity.
arXiv Detail & Related papers (2024-04-17T11:42:39Z) - Communication-Efficient Framework for Distributed Image Semantic
Wireless Transmission [68.69108124451263]
Federated learning-based semantic communication (FLSC) framework for multi-task distributed image transmission with IoT devices.
Each link is composed of a hierarchical vision transformer (HVT)-based extractor and a task-adaptive translator.
Channel state information-based multiple-input multiple-output transmission module designed to combat channel fading and noise.
arXiv Detail & Related papers (2023-08-07T16:32:14Z) - Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic
Image Synthesis [139.2216271759332]
We propose a novel ECGAN for the challenging semantic image synthesis task.
The semantic labels do not provide detailed structural information, making it challenging to synthesize local details and structures.
The widely adopted CNN operations such as convolution, down-sampling, and normalization usually cause spatial resolution loss.
We propose a novel contrastive learning method, which aims to enforce pixel embeddings belonging to the same semantic class to generate more similar image content.
arXiv Detail & Related papers (2023-07-22T14:17:19Z) - Towards Semantic Communications: Deep Learning-Based Image Semantic
Coding [42.453963827153856]
We conceive the semantic communications for image data that is much more richer in semantics and bandwidth sensitive.
We propose an reinforcement learning based adaptive semantic coding (RL-ASC) approach that encodes images beyond pixel level.
Experimental results demonstrate that the proposed RL-ASC is noise robust and could reconstruct visually pleasant and semantic consistent image.
arXiv Detail & Related papers (2022-08-08T12:29:55Z) - CRIS: CLIP-Driven Referring Image Segmentation [71.56466057776086]
We propose an end-to-end CLIP-Driven Referring Image framework (CRIS)
CRIS resorts to vision-language decoding and contrastive learning for achieving the text-to-pixel alignment.
Our proposed framework significantly outperforms the state-of-the-art performance without any post-processing.
arXiv Detail & Related papers (2021-11-30T07:29:08Z) - CTNet: Context-based Tandem Network for Semantic Segmentation [77.4337867789772]
This work proposes a novel Context-based Tandem Network (CTNet) by interactively exploring the spatial contextual information and the channel contextual information.
To further improve the performance of the learned representations for semantic segmentation, the results of the two context modules are adaptively integrated.
arXiv Detail & Related papers (2021-04-20T07:33:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.