Combination of Raman spectroscopy and chemometrics: A review of recent
studies published in the Spectrochimica Acta, Part A: Molecular and
Biomolecular Spectroscopy Journal
- URL: http://arxiv.org/abs/2210.10051v1
- Date: Tue, 18 Oct 2022 13:08:20 GMT
- Title: Combination of Raman spectroscopy and chemometrics: A review of recent
studies published in the Spectrochimica Acta, Part A: Molecular and
Biomolecular Spectroscopy Journal
- Authors: Yulia Khristoforova, Lyudmila Bratchenko, Ivan Bratchenko
- Abstract summary: This review considers the application of Raman spectroscopy in combination with chemometrics to study samples and their changes caused by different factors.
We summarized the best strategies for creating classification models and highlighted some common drawbacks when it comes to the application of chemometrics techniques.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Raman spectroscopy is a promising technique used for noninvasive analysis of
samples in various fields of application due to its ability for fingerprint
probing of samples at the molecular level. Chemometrics methods are widely used
nowadays for better understanding of the recorded spectral fingerprints of
samples and differences in their chemical composition. This review considers a
number of manuscripts published in the Spectrochimica Acta, Part A: Molecular
and Biomolecular Spectroscopy Journal that presented findings regarding the
application of Raman spectroscopy in combination with chemometrics to study
samples and their changes caused by different factors. In 57 reviewed
manuscripts, we analyzed application of chemometrics algorithms, statistical
modeling parameters, utilization of cross validation, sample sizes, as well as
the performance of the proposed classification and regression model. We
summarized the best strategies for creating classification models and
highlighted some common drawbacks when it comes to the application of
chemometrics techniques. According to our estimations, about 70% of the papers
are likely to contain unsupported or invalid data due to insufficient
description of the utilized methods or drawbacks of the proposed classification
models. These drawbacks include: (1) insufficient experimental sample size for
classification/regression to achieve significant and reliable results, (2) lack
of cross validation (or a test set) for verification of the
classifier/regression performance, (3) incorrect division of the spectral data
into the training and the test/validation sets; (4) improper selection of the
PC number to reduce the analyzed spectral data dimension.
Related papers
- ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering [54.80411755871931]
Question Answering (QA) effectively evaluates language models' reasoning and knowledge depth.
Chemical QA plays a crucial role in both education and research by effectively translating complex chemical information into readily understandable format.
This dataset reflects typical real-world challenges, including an imbalanced data distribution and a substantial amount of unlabeled data that can be potentially useful.
We introduce a QAMatch model, specifically designed to effectively answer chemical questions by fully leveraging our collected data.
arXiv Detail & Related papers (2024-07-24T01:46:55Z) - NovoBench: Benchmarking Deep Learning-based De Novo Peptide Sequencing Methods in Proteomics [58.03989832372747]
We present the first unified benchmark NovoBench for emphde novo peptide sequencing.
It comprises diverse mass spectrum data, integrated models, and comprehensive evaluation metrics.
Recent methods, including DeepNovo, PointNovo, Casanovo, InstaNovo, AdaNovo and $pi$-HelixNovo are integrated into our framework.
arXiv Detail & Related papers (2024-06-16T08:23:21Z) - A Gaussian Process Model for Ordinal Data with Applications to Chemoinformatics [0.0]
We present conditional Gaussian process models to predict ordinal outcomes from chemical experiments.
A novel aspect of our model is that the kernel contains a scaling parameter, that controls the strength of the correlation between elements of the chemical space.
Using molecular fingerprints, a numerical representation of a compound's location within the chemical space, we show that accounting for correlation amongst chemical compounds improves predictive performance.
arXiv Detail & Related papers (2024-05-16T11:18:32Z) - Robust retrieval of material chemical states in X-ray microspectroscopy [10.621361408885765]
We propose a novel data formulation model for X-ray microspectroscopy and develop a dedicated unmixing framework to solve this problem.
Our framework can accurately identify and characterize chemical states in complex and heterogeneous samples, even under challenging conditions.
arXiv Detail & Related papers (2023-08-08T12:17:02Z) - ChemVise: Maximizing Out-of-Distribution Chemical Detection with the
Novel Application of Zero-Shot Learning [60.02503434201552]
This research proposes learning approximations of complex exposures from training sets of simple ones.
We demonstrate this approach to synthetic sensor responses surprisingly improves the detection of out-of-distribution obscured chemical analytes.
arXiv Detail & Related papers (2023-02-09T20:19:57Z) - Rethinking Semi-Supervised Medical Image Segmentation: A
Variance-Reduction Perspective [51.70661197256033]
We propose ARCO, a semi-supervised contrastive learning framework with stratified group theory for medical image segmentation.
We first propose building ARCO through the concept of variance-reduced estimation and show that certain variance-reduction techniques are particularly beneficial in pixel/voxel-level segmentation tasks.
We experimentally validate our approaches on eight benchmarks, i.e., five 2D/3D medical and three semantic segmentation datasets, with different label settings.
arXiv Detail & Related papers (2023-02-03T13:50:25Z) - Comparing Spectroscopy Measurements in the Prediction of in Vitro
Dissolution Profile using Artificial Neural Networks [0.0]
Dissolution testing is part of the target product quality that is essential in approving new products in the pharmaceutical industry.
The prediction of the dissolution profile based on spectroscopic data is an alternative to the current destructive and time-consuming method.
Raman and near-infrared (NIR) spectroscopies are two fast and complementary methods that provide information on the tablets' physical and chemical properties.
arXiv Detail & Related papers (2022-10-19T04:34:04Z) - Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet
Transmission Spectra [68.8204255655161]
We focus on unsupervised techniques for analyzing spectral data from transiting exoplanets.
We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations.
We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes.
arXiv Detail & Related papers (2022-01-07T22:26:33Z) - Gaussian Process Regression for Absorption Spectra Analysis of Molecular
Dimers [68.8204255655161]
We discuss an approach based on a machine learning technique, where the parameters for the numerical calculations are chosen from Gaussian Process Regression (GPR)
This approach does not only quickly converge to an optimal parameter set, but in addition provides information about the complete parameter space.
We find that indeed the GPR gives reliable results which are in agreement with direct calculations of these parameters using quantum chemical methods.
arXiv Detail & Related papers (2021-12-14T17:46:45Z) - Investigating Manifold Neighborhood size for Nonlinear Analysis of LIBS
Amino Acid Spectra [0.0]
classification and identification of amino acids in aqueous solutions is important in the study of biomacromolecules.
Current methods for LIBS spectral analysis achieves promising results using PCA, a linear method.
We developed an information theoretic method for measurement of LIBS energy spectra, implemented manifold methods for nonlinear dimensionality reduction, and found while clustering results were not statistically significantly different, nonlinear methods lead to increased classification accuracy.
arXiv Detail & Related papers (2021-05-25T17:17:00Z) - Automated fragment identification for electron ionisation mass
spectrometry: application to atmospheric measurements of halocarbons [0.0]
Non-target screening consists in searching a sample for all present substances, suspected or unknown.
This approach has been introduced more than a decade ago in the field of water analysis, but is still very scarce for indoor and atmospheric trace gas measurements.
We present data analysis tools to enable automated identification of unknown compounds measured by GC-EI-HRMS.
arXiv Detail & Related papers (2021-03-23T09:35:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.