Determination of Trace Organic Contaminant Concentration via Machine
Classification of Surface-Enhanced Raman Spectra
- URL: http://arxiv.org/abs/2402.00197v1
- Date: Wed, 31 Jan 2024 21:49:40 GMT
- Title: Determination of Trace Organic Contaminant Concentration via Machine
Classification of Surface-Enhanced Raman Spectra
- Authors: Vishnu Jayaprakash, Jae Bem You, Chiranjeevi Kanike, Jinfeng Liu,
Christopher McCallum, and Xuehua Zhang
- Abstract summary: We show an approach for predicting the concentration of sample pollutants from messy, unprocessed Raman data using machine learning.
Using standard machine learning models, the concentration of sample pollutants are predicted with more than 80 percent cross-validation accuracy from raw Raman data.
- Score: 0.7029155133139362
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Accurate detection and analysis of traces of persistent organic pollutants in
water is important in many areas, including environmental monitoring and food
quality control, due to their long environmental stability and potential
bioaccumulation. While conventional analysis of organic pollutants requires
expensive equipment, surface enhanced Raman spectroscopy (SERS) has
demonstrated great potential for accurate detection of these contaminants.
However, SERS analytical difficulties, such as spectral preprocessing,
denoising, and substrate-based spectral variation, have hindered widespread use
of the technique. Here, we demonstrate an approach for predicting the
concentration of sample pollutants from messy, unprocessed Raman data using
machine learning. Frequency domain transform methods, including the Fourier and
Walsh Hadamard transforms, are applied to sets of Raman spectra of three model
micropollutants in water (rhodamine 6G, chlorpyrifos, and triclosan), which are
then used to train machine learning algorithms. Using standard machine learning
models, the concentration of sample pollutants are predicted with more than 80
percent cross-validation accuracy from raw Raman data. cross-validation
accuracy of 85 percent was achieved using deep learning for a moderately sized
dataset (100 spectra), and 70 to 80 percent cross-validation accuracy was
achieved even for very small datasets (50 spectra). Additionally, standard
models were shown to accurately identify characteristic peaks via analysis of
their importance scores. The approach shown here has the potential to be
applied to facilitate accurate detection and analysis of persistent organic
pollutants by surface-enhanced Raman spectroscopy.
Related papers
- Interpretable cancer cell detection with phonon microscopy using multi-task conditional neural networks for inter-batch calibration [39.759100498329275]
We present a conditional neural network framework to simultaneously achieve inter-batch calibration.
We validate our approach by training and validating on different experimental batches.
We extend our model to reconstruct denoised signals, enabling physical interpretation of salient features indicating disease state.
arXiv Detail & Related papers (2024-03-26T12:20:10Z) - Artificial Intelligence for reverse engineering: application to
detergents using Raman spectroscopy [0.0]
Development of digital tools and analytical techniques will allow for the identification of potential toxic molecules.
The combination of various digital tools (spectral database, mixture database, experimental design, Chemometrics / Machine Learning algorithmldots) together with different sample preparation methods has enabled the identification of the mixture's constituents.
This strategy is also applicable in the industrial sector for product or raw material control, as well as for quality control purposes.
arXiv Detail & Related papers (2023-10-31T08:16:22Z) - Robust retrieval of material chemical states in X-ray microspectroscopy [10.621361408885765]
We propose a novel data formulation model for X-ray microspectroscopy and develop a dedicated unmixing framework to solve this problem.
Our framework can accurately identify and characterize chemical states in complex and heterogeneous samples, even under challenging conditions.
arXiv Detail & Related papers (2023-08-08T12:17:02Z) - Spectral Analysis of Marine Debris in Simulated and Observed
Sentinel-2/MSI Images using Unsupervised Classification [0.0]
This study uses Radiative Transfer Model (RTM) simulated data and data from the Multispectral Instrument (MSI) of the Sentinel-2 mission in combination with machine learning algorithms.
The results indicate that the spectral behavior of pollutants is influenced by factors such as the type of polymer and pixel coverage percentage.
These insights can guide future research in remote sensing applications for detecting marine plastic pollution.
arXiv Detail & Related papers (2023-06-26T18:46:47Z) - ChemVise: Maximizing Out-of-Distribution Chemical Detection with the
Novel Application of Zero-Shot Learning [60.02503434201552]
This research proposes learning approximations of complex exposures from training sets of simple ones.
We demonstrate this approach to synthetic sensor responses surprisingly improves the detection of out-of-distribution obscured chemical analytes.
arXiv Detail & Related papers (2023-02-09T20:19:57Z) - Evaluation of the potential of Near Infrared Hyperspectral Imaging for
monitoring the invasive brown marmorated stink bug [53.682955739083056]
The brown marmorated stink bug (BMSB), Halyomorpha halys, is an invasive insect pest of global importance that damages several crops.
The present study consists in a preliminary evaluation at the laboratory level of Near Infrared Hyperspectral Imaging (NIR-HSI) as a possible technology to detect BMSB specimens.
arXiv Detail & Related papers (2023-01-19T11:37:20Z) - Exploring Supervised Machine Learning for Multi-Phase Identification and
Quantification from Powder X-Ray Diffraction Spectra [1.0660480034605242]
Powder X-ray diffraction analysis is a critical component of materials characterization methodologies.
Deep learning has become a prime focus for predicting crystallographic parameters and features from X-ray spectra.
Here, we are interested in conventional supervised learning algorithms in lieu of deep learning for multi-label crystalline phase identification.
arXiv Detail & Related papers (2022-11-16T00:36:13Z) - Estimating the concentration of chiral media with bright squeezed light [77.34726150561087]
We quantify the performance of Gaussian probes in estimating the concentration of chiral analytes.
Four-fold precision enhancement is achievable using state-of-the-art squeezing levels and intensity measurements.
arXiv Detail & Related papers (2022-08-21T17:18:10Z) - Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet
Transmission Spectra [68.8204255655161]
We focus on unsupervised techniques for analyzing spectral data from transiting exoplanets.
We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations.
We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes.
arXiv Detail & Related papers (2022-01-07T22:26:33Z) - Automated fragment identification for electron ionisation mass
spectrometry: application to atmospheric measurements of halocarbons [0.0]
Non-target screening consists in searching a sample for all present substances, suspected or unknown.
This approach has been introduced more than a decade ago in the field of water analysis, but is still very scarce for indoor and atmospheric trace gas measurements.
We present data analysis tools to enable automated identification of unknown compounds measured by GC-EI-HRMS.
arXiv Detail & Related papers (2021-03-23T09:35:10Z) - Unassisted Noise Reduction of Chemical Reaction Data Sets [59.127921057012564]
We propose a machine learning-based, unassisted approach to remove chemically wrong entries from data sets.
Our results show an improved prediction quality for models trained on the cleaned and balanced data sets.
arXiv Detail & Related papers (2021-02-02T09:34:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.