When Visible-to-Thermal Facial GAN Beats Conditional Diffusion
- URL: http://arxiv.org/abs/2302.09395v1
- Date: Sat, 18 Feb 2023 18:02:31 GMT
- Title: When Visible-to-Thermal Facial GAN Beats Conditional Diffusion
- Authors: Catherine Ordun, Edward Raff, Sanjay Purushotham
- Abstract summary: Telemedicine applications could benefit from thermal imagery, but conventional computers are reliant on RGB cameras and lack thermal sensors.
We propose the Visible-to-Thermal Facial GAN (VTF-GAN) that is specifically designed to generate high-resolution thermal faces.
Results show that VTF-GAN achieves high quality, crisp, and perceptually realistic thermal faces using a combined set of patch, temperature, perceptual, and Fourier Transform losses.
- Score: 36.33347149799959
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Thermal facial imagery offers valuable insight into physiological states such
as inflammation and stress by detecting emitted radiation in the infrared
spectrum, which is unseen in the visible spectra. Telemedicine applications
could benefit from thermal imagery, but conventional computers are reliant on
RGB cameras and lack thermal sensors. As a result, we propose the
Visible-to-Thermal Facial GAN (VTF-GAN) that is specifically designed to
generate high-resolution thermal faces by learning both the spatial and
frequency domains of facial regions, across spectra. We compare VTF-GAN against
several popular GAN baselines and the first conditional Denoising Diffusion
Probabilistic Model (DDPM) for VT face translation (VTF-Diff). Results show
that VTF-GAN achieves high quality, crisp, and perceptually realistic thermal
faces using a combined set of patch, temperature, perceptual, and Fourier
Transform losses, compared to all baselines including diffusion.
Related papers
- Hypergraph-Guided Disentangled Spectrum Transformer Networks for
Near-Infrared Facial Expression Recognition [31.783671943393344]
We give the first attempt to deep NIR facial expression recognition and proposed a novel method called near-infrared facial expression transformer (NFER-Former)
NFER-Former disentangles the expression information and spectrum information from the input image, so that the expression features can be extracted without the interference of spectrum variation.
We have constructed a large NIR-VIS Facial Expression dataset that includes 360 subjects to better validate the efficiency of NFER-Former.
arXiv Detail & Related papers (2023-12-10T15:15:50Z) - Learning Domain and Pose Invariance for Thermal-to-Visible Face
Recognition [6.454199265634863]
We propose a novel Domain and Pose Invariant Framework that simultaneously learns domain and pose invariant representations.
Our proposed framework is composed of modified networks for extracting the most correlated intermediate representations from off-pose thermal and frontal visible face imagery.
Although DPIF focuses on learning to match off-pose thermal to frontal visible faces, we also show that DPIF enhances performance when matching frontal thermal face images to frontal visible face images.
arXiv Detail & Related papers (2022-11-17T05:24:02Z) - A Synthesis-Based Approach for Thermal-to-Visible Face Verification [105.63410428506536]
This paper presents an algorithm that achieves state-of-the-art performance on the ARL-VTF and TUFTS multi-spectral face datasets.
We also present MILAB-VTF(B), a challenging multi-spectral face dataset composed of paired thermal and visible videos.
arXiv Detail & Related papers (2021-08-21T17:59:56Z) - Thermal Image Super-Resolution Using Second-Order Channel Attention with
Varying Receptive Fields [4.991042925292453]
We introduce a system to efficiently reconstruct thermal images.
The restoration of thermal images is critical for applications that involve safety, search and rescue, and military operations.
arXiv Detail & Related papers (2021-07-30T22:17:51Z) - Heterogeneous Face Frontalization via Domain Agnostic Learning [74.86585699909459]
We propose a domain agnostic learning-based generative adversarial network (DAL-GAN) which can synthesize frontal views in the visible domain from thermal faces with pose variations.
DAL-GAN consists of a generator with an auxiliary classifier and two discriminators which capture both local and global texture discriminations for better synthesis.
arXiv Detail & Related papers (2021-07-17T20:41:41Z) - Simultaneous Face Hallucination and Translation for Thermal to Visible
Face Verification using Axial-GAN [74.22129648654783]
We introduce the task of thermal-to-visible face verification from low-resolution thermal images.
We propose Axial-Generative Adversarial Network (Axial-GAN) to synthesize high-resolution visible images for matching.
arXiv Detail & Related papers (2021-04-13T22:34:28Z) - A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset [62.193924313292875]
We present the DEVCOM Army Research Laboratory Visible-Thermal Face dataset (ARL-VTF)
With over 500,000 images from 395 subjects, the ARL-VTF dataset represents to the best of our knowledge, the largest collection of paired visible and thermal face images to date.
This paper presents benchmark results and analysis on thermal face landmark detection and thermal-to-visible face verification by evaluating state-of-the-art models on the ARL-VTF dataset.
arXiv Detail & Related papers (2021-01-07T17:17:12Z) - Multi-Scale Thermal to Visible Face Verification via Attribute Guided
Synthesis [55.29770222566124]
We use attributes extracted from visible images to synthesize attribute-preserved visible images from thermal imagery for cross-modal matching.
A novel multi-scale generator is proposed to synthesize the visible image from the thermal image guided by the extracted attributes.
A pre-trained VGG-Face network is leveraged to extract features from the synthesized image and the input visible image for verification.
arXiv Detail & Related papers (2020-04-20T01:45:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.