Deep Neural Networks based Invisible Steganography for Audio-into-Image
Algorithm
- URL: http://arxiv.org/abs/2102.09173v1
- Date: Thu, 18 Feb 2021 06:13:05 GMT
- Title: Deep Neural Networks based Invisible Steganography for Audio-into-Image
Algorithm
- Authors: Quang Pham Huu, Thoi Hoang Dinh, Ngoc N. Tran, Toan Pham Van and Thanh
Ta Minh
- Abstract summary: The integrity of both image and audio is well preserved, while the maximum length of the hidden audio is significantly improved.
We employ a joint deep neural network architecture consisting of two sub-models: the first network hides the secret audio into an image, and the second one is responsible for decoding the image to obtain the original audio.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: In the last few years, steganography has attracted increasing attention from
a large number of researchers since its applications are expanding further than
just the field of information security. The most traditional method is based on
digital signal processing, such as least significant bit encoding. Recently,
there have been some new approaches employing deep learning to address the
problem of steganography. However, most of the existing approaches are designed
for image-in-image steganography. In this paper, the use of deep learning
techniques to hide secret audio into the digital images is proposed. We employ
a joint deep neural network architecture consisting of two sub-models: the
first network hides the secret audio into an image, and the second one is
responsible for decoding the image to obtain the original audio. Extensive
experiments are conducted with a set of 24K images and the VIVOS Corpus audio
dataset. Through experimental results, it can be seen that our method is more
effective than traditional approaches. The integrity of both image and audio is
well preserved, while the maximum length of the hidden audio is significantly
improved.
Related papers
- Noise Adaption Network for Morse Code Image Classification [0.3031375888004876]
The transmission of Morse code images faces challenges due to diverse noises and distortions.
Existing methodologies predominantly concentrate on categorizing Morse code images affected by a single type of noise.
We propose a novel two-stage approach, termed the Noise Adaptation Network (NANet), for Morse code image classification.
arXiv Detail & Related papers (2024-10-24T22:24:52Z) - THInImg: Cross-modal Steganography for Presenting Talking Heads in
Images [14.09277898001307]
Cross-modal Steganography is the practice of concealing secret signals in publicly available cover signals unobtrusively.
We propose THInImg, which manages to hide lengthy audio data inside an identity image by leveraging the properties of human face.
THInImg can present up to 80 seconds of high quality talking-head video (including audio) in an identity image with 160x160 resolution.
arXiv Detail & Related papers (2023-11-28T19:11:01Z) - Align, Adapt and Inject: Sound-guided Unified Image Generation [50.34667929051005]
We propose a unified framework 'Align, Adapt, and Inject' (AAI) for sound-guided image generation, editing, and stylization.
Our method adapts input sound into a sound token, like an ordinary word, which can plug and play with existing Text-to-Image (T2I) models.
Our proposed AAI outperforms other text and sound-guided state-of-the-art methods.
arXiv Detail & Related papers (2023-06-20T12:50:49Z) - Human-imperceptible, Machine-recognizable Images [76.01951148048603]
A major conflict is exposed relating to software engineers between better developing AI systems and distancing from the sensitive training data.
This paper proposes an efficient privacy-preserving learning paradigm, where images are encrypted to become human-imperceptible, machine-recognizable''
We show that the proposed paradigm can ensure the encrypted images have become human-imperceptible while preserving machine-recognizable information.
arXiv Detail & Related papers (2023-06-06T13:41:37Z) - System Fingerprint Recognition for Deepfake Audio: An Initial Dataset
and Investigation [51.06875680387692]
We present the first deepfake audio dataset for system fingerprint recognition (SFR)
We collected the dataset from the speech synthesis systems of seven Chinese vendors that use the latest state-of-the-art deep learning technologies.
arXiv Detail & Related papers (2022-08-21T05:15:40Z) - Deep Audio Waveform Prior [19.826973437576395]
We show that existing SOTA architectures for audio source separation contain deep priors even when working with the raw waveform.
A network with relevant deep priors is likely to generate a cleaner version of the signal before converging on the corrupted signal.
arXiv Detail & Related papers (2022-07-21T12:25:03Z) - Voice-assisted Image Labelling for Endoscopic Ultrasound Classification
using Neural Networks [48.732863591145964]
We propose a multi-modal convolutional neural network architecture that labels endoscopic ultrasound (EUS) images from raw verbal comments provided by a clinician during the procedure.
Our results show a prediction accuracy of 76% at image level on a dataset with 5 different labels.
arXiv Detail & Related papers (2021-10-12T21:22:24Z) - Multi-Stage Residual Hiding for Image-into-Audio Steganography [40.669605041776954]
We present a cross-modal steganography method for hiding image content into audio carriers.
The proposed framework makes the controlling of payload capacity more flexible.
Experiments suggest that modifications to the carrier are unnoticeable by human listeners.
arXiv Detail & Related papers (2021-01-06T05:01:45Z) - Multi-Image Steganography Using Deep Neural Networks [9.722040907570072]
Steganography is the science of hiding a secret message within an ordinary public message.
We aim to utilize deep neural networks for the encoding and decoding of multiple secret images inside a single cover image of the same resolution.
arXiv Detail & Related papers (2021-01-02T01:51:38Z) - Dual Adversarial Network: Toward Real-world Noise Removal and Noise
Generation [52.75909685172843]
Real-world image noise removal is a long-standing yet very challenging task in computer vision.
We propose a novel unified framework to deal with the noise removal and noise generation tasks.
Our method learns the joint distribution of the clean-noisy image pairs.
arXiv Detail & Related papers (2020-07-12T09:16:06Z) - Deep Learning on Image Denoising: An overview [92.07378559622889]
We offer a comparative study of deep techniques in image denoising.
We first classify the deep convolutional neural networks (CNNs) for additive white noisy images.
Next, we compare the state-of-the-art methods on public denoising datasets in terms of quantitative and qualitative analysis.
arXiv Detail & Related papers (2019-12-31T05:03:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.