All-optical image classification through unknown random diffusers using
a single-pixel diffractive network
- URL: http://arxiv.org/abs/2208.03968v1
- Date: Mon, 8 Aug 2022 08:26:08 GMT
- Title: All-optical image classification through unknown random diffusers using
a single-pixel diffractive network
- Authors: Yi Luo, Bijie Bai, Yuhang Li, Ege Cetintas, Aydogan Ozcan
- Abstract summary: classification of an object behind a random and unknown scattering medium sets a challenging task for computational imaging and machine vision fields.
Recent deep learning-based approaches demonstrated the classification of objects using diffuser-distorted patterns collected by an image sensor.
Here, we present an all-optical processor to directly classify unknown objects through unknown, random phase diffusers using broadband illumination detected with a single pixel.
- Score: 13.7472825798265
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Classification of an object behind a random and unknown scattering medium
sets a challenging task for computational imaging and machine vision fields.
Recent deep learning-based approaches demonstrated the classification of
objects using diffuser-distorted patterns collected by an image sensor. These
methods demand relatively large-scale computing using deep neural networks
running on digital computers. Here, we present an all-optical processor to
directly classify unknown objects through unknown, random phase diffusers using
broadband illumination detected with a single pixel. A set of transmissive
diffractive layers, optimized using deep learning, forms a physical network
that all-optically maps the spatial information of an input object behind a
random diffuser into the power spectrum of the output light detected through a
single pixel at the output plane of the diffractive network. We numerically
demonstrated the accuracy of this framework using broadband radiation to
classify unknown handwritten digits through random new diffusers, never used
during the training phase, and achieved a blind testing accuracy of 88.53%.
This single-pixel all-optical object classification system through random
diffusers is based on passive diffractive layers that process broadband input
light and can operate at any part of the electromagnetic spectrum by simply
scaling the diffractive features proportional to the wavelength range of
interest. These results have various potential applications in, e.g.,
biomedical imaging, security, robotics, and autonomous driving.
Related papers
- Deep Learning Based Speckle Filtering for Polarimetric SAR Images. Application to Sentinel-1 [51.404644401997736]
We propose a complete framework to remove speckle in polarimetric SAR images using a convolutional neural network.
Experiments show that the proposed approach offers exceptional results in both speckle reduction and resolution preservation.
arXiv Detail & Related papers (2024-08-28T10:07:17Z) - Diffusion Facial Forgery Detection [56.69763252655695]
This paper introduces DiFF, a comprehensive dataset dedicated to face-focused diffusion-generated images.
We conduct extensive experiments on the DiFF dataset via a human test and several representative forgery detection methods.
The results demonstrate that the binary detection accuracy of both human observers and automated detectors often falls below 30%.
arXiv Detail & Related papers (2024-01-29T03:20:19Z) - Rethinking the Up-Sampling Operations in CNN-based Generative Network
for Generalizable Deepfake Detection [86.97062579515833]
We introduce the concept of Neighboring Pixel Relationships(NPR) as a means to capture and characterize the generalized structural artifacts stemming from up-sampling operations.
A comprehensive analysis is conducted on an open-world dataset, comprising samples generated by tft28 distinct generative models.
This analysis culminates in the establishment of a novel state-of-the-art performance, showcasing a remarkable tft11.6% improvement over existing methods.
arXiv Detail & Related papers (2023-12-16T14:27:06Z) - Unmixing Optical Signals from Undersampled Volumetric Measurements by Filtering the Pixel Latent Variables [5.74378659752939]
Latent Unmixing is a new approach which applies a band-pass filter to the latent space of a multi-spectralal neural network.
It enables better isolation and quantification of individual signal contributions, especially in the context of undersampled distributions.
We showcase the method's practical use in experimental physics through two test cases that highlight the versatility of our approach.
arXiv Detail & Related papers (2023-12-08T20:34:37Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - Probabilistic Deep Metric Learning for Hyperspectral Image
Classification [91.5747859691553]
This paper proposes a probabilistic deep metric learning framework for hyperspectral image classification.
It aims to predict the category of each pixel for an image captured by hyperspectral sensors.
Our framework can be readily applied to existing hyperspectral image classification methods.
arXiv Detail & Related papers (2022-11-15T17:57:12Z) - Time-lapse image classification using a diffractive neural network [0.0]
We show for the first time a time-lapse image classification scheme using a diffractive network.
We show a blind testing accuracy of 62.03% on the optical classification of objects from the CIFAR-10 dataset.
This constitutes the highest inference accuracy achieved so far using a single diffractive network.
arXiv Detail & Related papers (2022-08-23T08:16:30Z) - Analysis of Diffractive Neural Networks for Seeing Through Random
Diffusers [15.017918620413585]
We provide a computer-free, all-optical imaging method for seeing through random, unknown phase diffusers using diffractive neural networks.
By analyzing various diffractive networks designed to image through random diffusers with different correlation lengths, a trade-off between the image reconstruction fidelity and distortion reduction capability of the diffractive network was observed.
arXiv Detail & Related papers (2022-05-01T09:12:24Z) - Multispectral Satellite Data Classification using Soft Computing
Approach [5.3971200250581814]
We propose a grid-density based clustering technique for identification of objects.
We introduce an approach to classify a satellite image data using a rule induction based machine learning algorithm.
arXiv Detail & Related papers (2022-03-21T17:25:09Z) - A photosensor employing data-driven binning for ultrafast image
recognition [0.0]
Pixel binning is a technique widely used in optical image acquisition and spectroscopy.
Here, we push the concept of binning to its limit by combining a large fraction of the sensor elements into a single superpixel.
For a given pattern recognition task, its optimal shape is determined from training data using a machine learning algorithm.
arXiv Detail & Related papers (2021-11-20T15:38:39Z) - Light Stage Super-Resolution: Continuous High-Frequency Relighting [58.09243542908402]
We propose a learning-based solution for the "super-resolution" of scans of human faces taken from a light stage.
Our method aggregates the captured images corresponding to neighboring lights in the stage, and uses a neural network to synthesize a rendering of the face.
Our learned model is able to produce renderings for arbitrary light directions that exhibit realistic shadows and specular highlights.
arXiv Detail & Related papers (2020-10-17T23:40:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.