Semantic Communication Enabled Holographic Video Processing and Transmission
- URL: http://arxiv.org/abs/2510.13408v1
- Date: Wed, 15 Oct 2025 11:06:48 GMT
- Title: Semantic Communication Enabled Holographic Video Processing and Transmission
- Authors: Jingkai Ying, Zhiyuan Qi, Yulong Feng, Zhijin Qin, Zhu Han, Rahim Tafazolli, Yonina C. Eldar,
- Abstract summary: This article provides an overview of holographic video communication and outlines the requirements of a holographic video communication system.<n>Key technologies, including semantic sampling, joint semantic-channel coding, and semantic-aware transmission, are designed based on the proposed architecture.
- Score: 80.02919983620494
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Holographic video communication is considered a paradigm shift in visual communications, becoming increasingly popular for its ability to offer immersive experiences. This article provides an overview of holographic video communication and outlines the requirements of a holographic video communication system. Particularly, following a brief review of semantic com- munication, an architecture for a semantic-enabled holographic video communication system is presented. Key technologies, including semantic sampling, joint semantic-channel coding, and semantic-aware transmission, are designed based on the proposed architecture. Two related use cases are presented to demonstrate the performance gain of the proposed methods. Finally, potential research topics are discussed to pave the way for the realization of semantic-enabled holographic video communications.
Related papers
- Multi-Modal Semantic Communication [39.55262791529245]
We propose a novel Multi-Modal Semantic Communication framework that integrates text-based user queries to guide the information extraction process.<n>Our proposed system employs a cross-modal attention mechanism that fuses visual features with language embeddings to produce soft relevance scores.<n>At the receiver, the patches are reconstructed and combined to preserve taskcritical information.
arXiv Detail & Related papers (2025-12-17T18:47:22Z) - Large Generative Model-assisted Talking-face Semantic Communication System [55.42631520122753]
This study introduces a Large Generative Model-assisted Talking-face Semantic Communication (LGM-TSC) system.
Generative Semantic Extractor (GSE) at the transmitter converts semantically sparse talking-face videos into texts with high information density.
Private Knowledge Base (KB) based on the Large Language Model (LLM) for semantic disambiguation and correction.
Generative Semantic Reconstructor (GSR) that utilizes BERT-VITS2 and SadTalker models to transform text back into a high-QoE talking-face video.
arXiv Detail & Related papers (2024-11-06T12:45:46Z) - Trustworthy Image Semantic Communication with GenAI: Explainablity, Controllability, and Efficiency [59.15544887307901]
Image semantic communication (ISC) has garnered significant attention for its potential to achieve high efficiency in visual content transmission.
Existing ISC systems based on joint source-channel coding face challenges in interpretability, operability, and compatibility.
We propose a novel trustworthy ISC framework that employs Generative Artificial Intelligence (GenAI) for multiple downstream inference tasks.
arXiv Detail & Related papers (2024-08-07T14:32:36Z) - Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation [29.87407471246318]
This research delves into the complexities of synchronizing facial movements and creating visually appealing, temporally consistent animations.
Our innovative approach embraces the end-to-end diffusion paradigm and introduces a hierarchical audio-driven visual synthesis module.
The proposed hierarchical audio-driven visual synthesis offers adaptive control over expression and pose diversity, enabling more effective personalization tailored to different identities.
arXiv Detail & Related papers (2024-06-13T04:33:20Z) - Semantic Face Compression for Metaverse: A Compact 3D Descriptor Based
Approach [15.838410034900138]
We envision a new metaverse communication paradigm for virtual avatar faces, and develop the semantic face compression with compact 3D facial descriptors.
The proposed scheme is expected to enable numerous applications, such as digital human communication based on machine analysis.
arXiv Detail & Related papers (2023-09-24T13:39:50Z) - CrossTalk: Enhancing Communication and Collaboration in
Videoconferencing with Intent Recognition from Conversational Speech [3.333406057333272]
We envision digital communication media as proactive facilitators that can provide unobtrusive assistance to enhance communication and collaboration.
We propose three key design concepts to explore the systematic integration of intelligence into communication and collaboration.
We developed CrossTalk, a videoconferencing system that instantiates these concepts, which was found to enable a more fluid and flexible communication and collaboration experience.
arXiv Detail & Related papers (2023-08-07T05:40:01Z) - Cognitive Semantic Communication Systems Driven by Knowledge Graph:
Principle, Implementation, and Performance Evaluation [74.38561925376996]
Two cognitive semantic communication frameworks are proposed for the single-user and multiple-user communication scenarios.
An effective semantic correction algorithm is proposed by mining the inference rule from the knowledge graph.
For the multi-user cognitive semantic communication system, a message recovery algorithm is proposed to distinguish messages of different users.
arXiv Detail & Related papers (2023-03-15T12:01:43Z) - Wireless End-to-End Image Transmission System using Semantic
Communications [4.2421412410466575]
The research shows that the resource gain in the form of bandwidth saving is immense when transmitting the semantic segmentation map through the physical channel.
The research studies the effect of physical channel distortions and quantization noise on semantic communication-based multimedia content transmission.
arXiv Detail & Related papers (2023-02-27T12:33:53Z) - Communication Beyond Transmitting Bits: Semantics-Guided Source and
Channel Coding [7.080957878208516]
"Semantic communications" offers promising research direction.
Injecting semantic guidance into the coded transmission design to achieve semantics-aware communications shows great potential for breakthrough in effectiveness and reliability.
This article sheds light on semantics-guided source and channel coding as a transmission paradigm of semantic communications.
arXiv Detail & Related papers (2022-08-04T06:12:55Z) - Cross-Modal Graph with Meta Concepts for Video Captioning [101.97397967958722]
We propose Cross-Modal Graph (CMG) with meta concepts for video captioning.
To cover the useful semantic concepts in video captions, we weakly learn the corresponding visual regions for text descriptions.
We construct holistic video-level and local frame-level video graphs with the predicted predicates to model video sequence structures.
arXiv Detail & Related papers (2021-08-14T04:00:42Z) - Neuro-Symbolic Representations for Video Captioning: A Case for
Leveraging Inductive Biases for Vision and Language [148.0843278195794]
We propose a new model architecture for learning multi-modal neuro-symbolic representations for video captioning.
Our approach uses a dictionary learning-based method of learning relations between videos and their paired text descriptions.
arXiv Detail & Related papers (2020-11-18T20:21:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.