Deep Multilabel CNN for Forensic Footwear Impression Descriptor
Identification
- URL: http://arxiv.org/abs/2102.05090v1
- Date: Tue, 9 Feb 2021 19:39:28 GMT
- Title: Deep Multilabel CNN for Forensic Footwear Impression Descriptor
Identification
- Authors: Marcin Budka, Akanda Wahid Ul Ashraf, Scott Neville, Alun Mackrill,
Matthew Bennett
- Abstract summary: We employ a deep learning approach to classify footwear impression's features known as emphdescriptors for forensic use cases.
We develop and evaluate a technique for feeding downsampled greyscale impressions to a neural network pre-trained on data from a different domain.
- Score: 0.9786690381850356
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years deep neural networks have become the workhorse of computer
vision. In this paper, we employ a deep learning approach to classify footwear
impression's features known as \emph{descriptors} for forensic use cases.
Within this process, we develop and evaluate an effective technique for feeding
downsampled greyscale impressions to a neural network pre-trained on data from
a different domain. Our approach relies on learnable preprocessing layer paired
with multiple interpolation methods used in parallel. We empirically show that
this technique outperforms using a single type of interpolated image without
learnable preprocessing, and can help to avoid the computational penalty
related to using high resolution inputs, by making more efficient use of the
low resolution inputs. We also investigate the effect of preserving the aspect
ratio of the inputs, which leads to considerable boost in accuracy without
increasing the computational budget with respect to squished rectangular
images. Finally, we formulate a set of best practices for transfer learning
with greyscale inputs, potentially widely applicable in computer vision tasks
ranging from footwear impression classification to medical imaging.
Related papers
- AttentionPainter: An Efficient and Adaptive Stroke Predictor for Scene Painting [82.54770866332456]
Stroke-based Rendering (SBR) aims to decompose an input image into a sequence of parameterized strokes, which can be rendered into a painting that resembles the input image.
We propose AttentionPainter, an efficient and adaptive model for single-step neural painting.
arXiv Detail & Related papers (2024-10-21T18:36:45Z) - ConKeD: Multiview contrastive descriptor learning for keypoint-based retinal image registration [6.618504904743609]
We propose ConKeD, a novel deep learning approach to learn descriptors for retinal image registration.
In contrast to current registration methods, our approach employs a novel multi-positive multi-negative contrastive learning strategy.
Our experimental results demonstrate the benefits of the novel multi-positive multi-negative strategy.
arXiv Detail & Related papers (2024-01-11T13:22:54Z) - Deep Multi-Threshold Spiking-UNet for Image Processing [51.88730892920031]
This paper introduces the novel concept of Spiking-UNet for image processing, which combines the power of Spiking Neural Networks (SNNs) with the U-Net architecture.
To achieve an efficient Spiking-UNet, we face two primary challenges: ensuring high-fidelity information propagation through the network via spikes and formulating an effective training strategy.
Experimental results show that, on image segmentation and denoising, our Spiking-UNet achieves comparable performance to its non-spiking counterpart.
arXiv Detail & Related papers (2023-07-20T16:00:19Z) - Self-Supervised Endoscopic Image Key-Points Matching [1.3764085113103222]
This paper proposes a novel self-supervised approach for endoscopic image matching based on deep learning techniques.
Our method outperformed standard hand-crafted local feature descriptors in terms of precision and recall.
arXiv Detail & Related papers (2022-08-24T10:47:21Z) - On Efficient Transformer and Image Pre-training for Low-level Vision [74.22436001426517]
Pre-training has marked numerous state of the arts in high-level computer vision.
We present an in-depth study of image pre-training.
We find pre-training plays strikingly different roles in low-level tasks.
arXiv Detail & Related papers (2021-12-19T15:50:48Z) - Differentiable Patch Selection for Image Recognition [37.11810982945019]
We propose a differentiable Top-K operator to select the most relevant parts of the input to process high resolution images.
We show results for traffic sign recognition, inter-patch relationship reasoning, and fine-grained recognition without using object/part bounding box annotations.
arXiv Detail & Related papers (2021-04-07T11:15:51Z) - Leveraging Self-Supervision for Cross-Domain Crowd Counting [71.75102529797549]
State-of-the-art methods for counting people in crowded scenes rely on deep networks to estimate crowd density.
We train our network to recognize upside-down real images from regular ones and incorporate into it the ability to predict its own uncertainty.
This yields an algorithm that consistently outperforms state-of-the-art cross-domain crowd counting ones without any extra computation at inference time.
arXiv Detail & Related papers (2021-03-30T12:37:55Z) - Blind deblurring for microscopic pathology images using deep learning
networks [0.0]
We demonstrate a deep-learning-based approach that can alleviate the defocus and motion blur of a microscopic image.
It produces a sharper and cleaner image with retrieved fine details without prior knowledge of the blur type, blur extent and pathological stain.
We test our approach on different types of pathology specimens and demonstrate great performance on image blur correction and the subsequent improvement on the diagnosis outcome of AI algorithms.
arXiv Detail & Related papers (2020-11-24T03:52:45Z) - Learning to Learn Parameterized Classification Networks for Scalable
Input Images [76.44375136492827]
Convolutional Neural Networks (CNNs) do not have a predictable recognition behavior with respect to the input resolution change.
We employ meta learners to generate convolutional weights of main networks for various input scales.
We further utilize knowledge distillation on the fly over model predictions based on different input resolutions.
arXiv Detail & Related papers (2020-07-13T04:27:25Z) - Guided interactive image segmentation using machine learning and color
based data set clustering [0.16683739531034203]
We present a novel approach that combines machine learning based interactive image segmentation using supervoxels with a clustering method for the automated identification of similarly colored images in large data sets.
Our approach solves the problem of significant color variability prevalent and often unavoidable in biological and medical images.
arXiv Detail & Related papers (2020-05-15T17:25:48Z) - BP-DIP: A Backprojection based Deep Image Prior [49.375539602228415]
We propose two image restoration approaches: (i) Deep Image Prior (DIP), which trains a convolutional neural network (CNN) from scratch in test time using the degraded image; and (ii) a backprojection (BP) fidelity term, which is an alternative to the standard least squares loss that is usually used in previous DIP works.
We demonstrate the performance of the proposed method, termed BP-DIP, on the deblurring task and show its advantages over the plain DIP, with both higher PSNR values and better inference run-time.
arXiv Detail & Related papers (2020-03-11T17:09:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.