Multi-Stage Residual Hiding for Image-into-Audio Steganography
- URL: http://arxiv.org/abs/2101.01872v1
- Date: Wed, 6 Jan 2021 05:01:45 GMT
- Title: Multi-Stage Residual Hiding for Image-into-Audio Steganography
- Authors: Wenxue Cui, Shaohui Liu, Feng Jiang, Yongliang Liu, Debin Zhao
- Abstract summary: We present a cross-modal steganography method for hiding image content into audio carriers.
The proposed framework makes the controlling of payload capacity more flexible.
Experiments suggest that modifications to the carrier are unnoticeable by human listeners.
- Score: 40.669605041776954
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: The widespread application of audio communication technologies has speeded up
audio data flowing across the Internet, which made it a popular carrier for
covert communication. In this paper, we present a cross-modal steganography
method for hiding image content into audio carriers while preserving the
perceptual fidelity of the cover audio. In our framework, two multi-stage
networks are designed: the first network encodes the decreasing multilevel
residual errors inside different audio subsequences with the corresponding
stage sub-networks, while the second network decodes the residual errors from
the modified carrier with the corresponding stage sub-networks to produce the
final revealed results. The multi-stage design of proposed framework not only
make the controlling of payload capacity more flexible, but also make hiding
easier because of the gradual sparse characteristic of residual errors.
Qualitative experiments suggest that modifications to the carrier are
unnoticeable by human listeners and that the decoded images are highly
intelligible.
Related papers
- Noise Adaption Network for Morse Code Image Classification [0.3031375888004876]
The transmission of Morse code images faces challenges due to diverse noises and distortions.
Existing methodologies predominantly concentrate on categorizing Morse code images affected by a single type of noise.
We propose a novel two-stage approach, termed the Noise Adaptation Network (NANet), for Morse code image classification.
arXiv Detail & Related papers (2024-10-24T22:24:52Z) - TMFNet: Two-Stream Multi-Channels Fusion Networks for Color Image Operation Chain Detection [9.346492393908322]
We propose a novel two-stream multi-channels fusion network for color image operation chain detection.
The proposed method achieves state-of-the-art generalization ability while maintaining robustness to JPEG compression.
arXiv Detail & Related papers (2024-09-12T02:04:26Z) - High-Fidelity Speech Synthesis with Minimal Supervision: All Using
Diffusion Models [56.00939852727501]
Minimally-supervised speech synthesis decouples TTS by combining two types of discrete speech representations.
Non-autoregressive framework enhances controllability, and duration diffusion model enables diversified prosodic expression.
arXiv Detail & Related papers (2023-09-27T09:27:03Z) - Deep Semantic Statistics Matching (D2SM) Denoising Network [70.01091467628068]
We introduce the Deep Semantic Statistics Matching (D2SM) Denoising Network.
It exploits semantic features of pretrained classification networks, then it implicitly matches the probabilistic distribution of clear images at the semantic feature space.
By learning to preserve the semantic distribution of denoised images, we empirically find our method significantly improves the denoising capabilities of networks.
arXiv Detail & Related papers (2022-07-19T14:35:42Z) - Timbre Transfer with Variational Auto Encoding and Cycle-Consistent
Adversarial Networks [0.6445605125467573]
This research project investigates the application of deep learning to timbre transfer, where the timbre of a source audio can be converted to the timbre of a target audio with minimal loss in quality.
The adopted approach combines Variational Autoencoders with Generative Adversarial Networks to construct meaningful representations of the source audio and produce realistic generations of the target audio.
arXiv Detail & Related papers (2021-09-05T15:06:53Z) - WaveFill: A Wavelet-based Generation Network for Image Inpainting [57.012173791320855]
WaveFill is a wavelet-based inpainting network that decomposes images into multiple frequency bands.
WaveFill decomposes images by using discrete wavelet transform (DWT) that preserves spatial information naturally.
It applies L1 reconstruction loss to the low-frequency bands and adversarial loss to high-frequency bands, hence effectively mitigate inter-frequency conflicts.
arXiv Detail & Related papers (2021-07-23T04:44:40Z) - Deep Neural Networks based Invisible Steganography for Audio-into-Image
Algorithm [0.0]
The integrity of both image and audio is well preserved, while the maximum length of the hidden audio is significantly improved.
We employ a joint deep neural network architecture consisting of two sub-models: the first network hides the secret audio into an image, and the second one is responsible for decoding the image to obtain the original audio.
arXiv Detail & Related papers (2021-02-18T06:13:05Z) - D-Unet: A Dual-encoder U-Net for Image Splicing Forgery Detection and
Localization [108.8592577019391]
Image splicing forgery detection is a global binary classification task that distinguishes the tampered and non-tampered regions by image fingerprints.
We propose a novel network called dual-encoder U-Net (D-Unet) for image splicing forgery detection, which employs an unfixed encoder and a fixed encoder.
In an experimental comparison study of D-Unet and state-of-the-art methods, D-Unet outperformed the other methods in image-level and pixel-level detection.
arXiv Detail & Related papers (2020-12-03T10:54:02Z) - Image Denoising for Strong Gaussian Noises With Specialized CNNs for
Different Frequency Components [4.010371060637209]
In machine learning approach to image denoising a network is trained to recover a clean image from a noisy one.
In this paper a novel structure is proposed based on training multiple specialized networks.
arXiv Detail & Related papers (2020-11-26T23:20:25Z) - Identity Enhanced Residual Image Denoising [61.75610647978973]
We learn a fully-convolutional network model that consists of a Chain of Identity Mapping Modules and residual on the residual architecture for image denoising.
The proposed network produces remarkably higher numerical accuracy and better visual image quality than the classical state-of-the-art and CNN algorithms.
arXiv Detail & Related papers (2020-04-26T04:52:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.