On the Effect of Pre-Processing and Model Complexity for Plastic
Analysis Using Short-Wave-Infrared Hyper-Spectral Imaging
- URL: http://arxiv.org/abs/2203.11209v1
- Date: Mon, 21 Mar 2022 11:19:11 GMT
- Title: On the Effect of Pre-Processing and Model Complexity for Plastic
Analysis Using Short-Wave-Infrared Hyper-Spectral Imaging
- Authors: Klaas Dijkstra, Maya Aghaei, Femke Jaarsma, Martin Dijkstra, Rudy
Folkersma, Jan Jager, Jaap van de Loosdrecht
- Abstract summary: Computer vision and deep learning enable automated analysis of short-wave-infrared hyper-spectral images of plastics.
In this paper, we show the importance of efficient model selection for resolving the task of hyper-spectral image segmentation.
We introduce the largest, most versatile hyper-spectral dataset of plastic flakes of four primary polymer types.
- Score: 0.11083289076967892
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The importance of plastic waste recycling is undeniable. In this respect,
computer vision and deep learning enable solutions through the automated
analysis of short-wave-infrared hyper-spectral images of plastics. In this
paper, we offer an exhaustive empirical study to show the importance of
efficient model selection for resolving the task of hyper-spectral image
segmentation of various plastic flakes using deep learning. We assess the
complexity level of generic and specialized models and infer their performance
capacity: generic models are often unnecessarily complex. We introduce two
variants of a specialized hyper-spectral architecture, PlasticNet, that
outperforms several well-known segmentation architectures in both performance
as well as computational complexity. In addition, we shed lights on the
significance of signal pre-processing within the realm of hyper-spectral
imaging. To complete our contribution, we introduce the largest, most versatile
hyper-spectral dataset of plastic flakes of four primary polymer types.
Related papers
- Disentangled representations of microscopy images [0.9849635250118911]
This work proposes a Disentangled Representation Learning (DRL) methodology to enhance model interpretability for microscopy image classification.<n>We show how a DRL framework, based on transferring a representation learnt from synthetic data, can provide a good trade-off between accuracy and interpretability in this domain.
arXiv Detail & Related papers (2025-06-25T17:44:37Z) - CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis [75.25966323298003]
Spectral imaging offers promising applications across diverse domains, including medicine and urban scene understanding.
variability in channel dimensionality and captured wavelengths among spectral cameras impede the development of AI-driven methodologies.
We introduce $textbfCARL$, a model for $textbfC$amera-$textbfA$gnostic $textbfR$esupervised $textbfL$ across RGB, multispectral, and hyperspectral imaging modalities.
arXiv Detail & Related papers (2025-04-27T13:06:40Z) - Machine learning surrogate models of many-body dispersion interactions in polymer melts [40.83978401377059]
We introduce a machine learning surrogate model specifically designed to predict MBD forces in polymer melts.
Our model is based on a trimmed SchNet architecture that selectively retains the most relevant atomic connections.
Characterized by high computational efficiency, our surrogate model enables practical incorporation of MBD effects into large-scale molecular simulations.
arXiv Detail & Related papers (2025-03-19T12:15:35Z) - Super-Resolution for Interferometric Imaging: Model Comparisons and Performance Analysis [0.0]
The study evaluates two Super-Resolution models, RCAN and Real-ESRGAN, for their effectiveness in reconstructing high-resolution interferograms from a microparticle-based dataset.
The results demonstrate that RCAN achieves superior numerical precision, making it ideal for applications requiring highly accurate phase map reconstruction.
arXiv Detail & Related papers (2025-02-21T11:50:57Z) - Revealing the Evolution of Order in Materials Microstructures Using Multi-Modal Computer Vision [4.6481041987538365]
Development of high-performance materials for microelectronics depends on our ability to describe and direct property-defining microstructural order.
Here, we demonstrate a multi-modal machine learning (ML) approach to describe order from electron microscopy analysis of the complex oxide La$_1-x$Sr$_x$FeO$_3$.
We observe distinct differences in the performance of uni- and multi-modal models, from which we draw general lessons in describing crystal order using computer vision.
arXiv Detail & Related papers (2024-11-15T02:44:32Z) - HyperDet: Generalizable Detection of Synthesized Images by Generating and Merging A Mixture of Hyper LoRAs [17.88153857572688]
We introduce a novel and generalizable detection framework termed HyperDet.
In this work, we propose a novel objective function that balances the pixel and semantic artifacts effectively.
Our work paves a new way to establish generalizable domain-specific fake image detectors based on pretrained large vision models.
arXiv Detail & Related papers (2024-10-08T13:43:01Z) - UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks [36.61645124563195]
We present UltraPixel, a novel architecture utilizing cascade diffusion models to generate high-quality images at multiple resolutions.
We use semantics-rich representations of lower-resolution images in the later denoising stage to guide the whole generation of highly detailed high-resolution images.
Our model achieves fast training with reduced data requirements, producing photo-realistic high-resolution images.
arXiv Detail & Related papers (2024-07-02T11:02:19Z) - Inter-slice Super-resolution of Magnetic Resonance Images by Pre-training and Self-supervised Fine-tuning [49.197385954021456]
In clinical practice, 2D magnetic resonance (MR) sequences are widely adopted. While individual 2D slices can be stacked to form a 3D volume, the relatively large slice spacing can pose challenges for visualization and subsequent analysis tasks.
To reduce slice spacing, deep-learning-based super-resolution techniques are widely investigated.
Most current solutions require a substantial number of paired high-resolution and low-resolution images for supervised training, which are typically unavailable in real-world scenarios.
arXiv Detail & Related papers (2024-06-10T02:20:26Z) - Memory-efficient High-resolution OCT Volume Synthesis with Cascaded Amortized Latent Diffusion Models [48.87160158792048]
We introduce a cascaded amortized latent diffusion model (CA-LDM) that can synthesis high-resolution OCT volumes in a memory-efficient way.
Experiments on a public high-resolution OCT dataset show that our synthetic data have realistic high-resolution and global features, surpassing the capabilities of existing methods.
arXiv Detail & Related papers (2024-05-26T10:58:22Z) - PLASTIC: Improving Input and Label Plasticity for Sample Efficient
Reinforcement Learning [54.409634256153154]
In Reinforcement Learning (RL), enhancing sample efficiency is crucial.
In principle, off-policy RL algorithms can improve sample efficiency by allowing multiple updates per environment interaction.
Our study investigates the underlying causes of this phenomenon by dividing plasticity into two aspects.
arXiv Detail & Related papers (2023-06-19T06:14:51Z) - GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from
Multi-view Images [79.39247661907397]
We introduce an effective framework Generalizable Model-based Neural Radiance Fields to synthesize free-viewpoint images.
Specifically, we propose a geometry-guided attention mechanism to register the appearance code from multi-view 2D images to a geometry proxy.
arXiv Detail & Related papers (2023-03-24T03:32:02Z) - Real-time Hyperspectral Imaging in Hardware via Trained Metasurface
Encoders [55.16861072631285]
Hyperspectral imaging has attracted significant attention to identify spectral signatures for image classification and automated pattern recognition in computer vision.
This work introduces Hyplex, a new integrated architecture addressing the limitations discussed above.
Hyplex is a CMOS-compatible, fast hyperspectral camera that replaces bulk optics with nanoscale metasurfaces inversely designed through artificial intelligence.
arXiv Detail & Related papers (2022-04-05T09:52:51Z) - Interpretable Hyperspectral AI: When Non-Convex Modeling meets
Hyperspectral Remote Sensing [57.52865154829273]
Hyperspectral imaging, also known as image spectrometry, is a landmark technique in geoscience remote sensing (RS)
In the past decade efforts have been made to process analyze these hyperspectral (HS) products mainly by means of seasoned experts.
For this reason, it is urgent to develop more intelligent and automatic approaches for various HS RS applications.
arXiv Detail & Related papers (2021-03-02T03:32:10Z) - Unveiling the role of plasticity rules in reservoir computing [0.0]
Reservoir Computing (RC) is an appealing approach in Machine Learning.
We analyze the role that plasticity rules play on the changes that lead to a better performance of RC.
arXiv Detail & Related papers (2021-01-14T19:55:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.