Quantitative Prediction on the Enantioselectivity of Multiple Chiral
Iodoarene Scaffolds Based on Whole Geometry
- URL: http://arxiv.org/abs/2103.14065v1
- Date: Thu, 25 Mar 2021 20:08:56 GMT
- Title: Quantitative Prediction on the Enantioselectivity of Multiple Chiral
Iodoarene Scaffolds Based on Whole Geometry
- Authors: Prema Dhorma Lama, Surendra Kumar, Kang Kim, Sangjin Ahn, Mi-hyun Kim
- Abstract summary: We introduce a predictive workflow for the extension of the reaction scope of chiral catalysts across name reactions.
Whole geometry descriptors were encoded from DFT optimized 3D structures of multiple catalyst scaffolds.
For the consensus prediction of ensemble models, this global descriptor can be compared with sterimol parameters and noncovalent interaction.
- Score: 4.042350304426974
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The mechanistic underpinnings of asymmetric catalysis at atomic levels
provide shortcuts for developing the potential value of chiral catalysts beyond
the current state-of-the-art. In the enantioselective redox transformations,
the present intuition-driven studies require a systematic approach to support
their intuitive idea. Arguably, the most systematic approach would be based on
the reliable quantitative structure-selectivity relationship of diverse and
dissimilar chiral scaffolds in an optimal feature space that is universally
applied to reactions. Here, we introduce a predictive workflow for the
extension of the reaction scope of chiral catalysts across name reactions. For
this purpose, whole geometry descriptors were encoded from DFT optimized 3D
structures of multiple catalyst scaffolds, 113 catalysts in 9 clusters. The
molecular descriptors were verified by the statistical comparison of the
enantioselective predictive classification models built from each descriptors
of chiral iodoarenes. More notably, capturing the whole molecular geometry
through one hot encoding of split three-dimensional molecular fingerprints
presented reliable enantioselective predictive regression models for three
different name reactions by recycling the data and metadata obtained across
reactions. The potential use value of this workflow and the advantages of
recyclability, compatibility, and generality proved that the workflow can be
applied for name reactions other than the aforementioned name reactions (out of
samples). Furthermore, for the consensus prediction of ensemble models, this
global descriptor can be compared with sterimol parameters and noncovalent
interaction vectors. This study is one case showing how to overcome the
sparsity of experimental data in organic reactions, especially asymmetric
catalysis.
Related papers
- Learning Chemical Reaction Representation with Reactant-Product Alignment [50.28123475356234]
This paper introduces modelname, a novel chemical reaction representation learning model tailored for a variety of organic-reaction-related tasks.
By integrating atomic correspondence between reactants and products, our model discerns the molecular transformations that occur during the reaction, thereby enhancing the comprehension of the reaction mechanism.
We have designed an adapter structure to incorporate reaction conditions into the chemical reaction representation, allowing the model to handle diverse reaction conditions and adapt to various datasets and downstream tasks, e.g., reaction performance prediction.
arXiv Detail & Related papers (2024-11-26T17:41:44Z) - A Machine Learning and Explainable AI Framework Tailored for Unbalanced Experimental Catalyst Discovery [10.92613600218535]
We introduce a robust machine learning and explainable AI (XAI) framework to accurately classify the catalytic yield of various compositions.
This framework combines a series of ML practices designed to handle the scarcity and imbalance of catalyst data.
We believe that such insights can assist chemists in the development and identification of novel catalysts with superior performance.
arXiv Detail & Related papers (2024-07-10T13:09:53Z) - Explainable Data-driven Modeling of Adsorption Energy in Heterogeneous Catalysis [6.349503549199403]
This study aims to bridge the gap between physics-based studies and data-driven methodologies.
We employ two XAI techniques: Post-hoc XAI analysis and Symbolic Regression.
Our work establishes a robust framework that integrates machine learning techniques with XAI.
arXiv Detail & Related papers (2024-05-30T18:06:14Z) - Beyond Major Product Prediction: Reproducing Reaction Mechanisms with
Machine Learning Models Trained on a Large-Scale Mechanistic Dataset [10.968137261042715]
Mechanistic understanding of organic reactions can facilitate reaction development, impurity prediction, and in principle, reaction discovery.
While several machine learning models have sought to address the task of predicting reaction products, their extension to predicting reaction mechanisms has been impeded by the lack of a corresponding mechanistic dataset.
We construct such a dataset by imputing intermediates between experimentally reported reactants and products using expert reaction templates and train several machine learning models on the resulting dataset of 5,184,184 elementary steps.
arXiv Detail & Related papers (2024-03-07T15:26:23Z) - 3DReact: Geometric deep learning for chemical reactions [35.38031930589095]
We introduce 3DReact, a deep geometric learning model to predict reaction properties from three-dimensional structures of reactants and products.
We show that, compared to existing models for reaction property prediction, 3DReact offers a flexible framework that exploits atom-mapping information.
It performs systematically well across different datasets, atom-mapping regimes, as well as both geometries and extrapolation tasks.
arXiv Detail & Related papers (2023-12-13T17:26:54Z) - Towards out-of-distribution generalizable predictions of chemical
kinetics properties [61.15970601264632]
Out-Of-Distribution (OOD) kinetic property prediction is required to be generalizable.
In this paper, we categorize the OOD kinetic property prediction into three levels (structure, condition, and mechanism)
We create comprehensive datasets to benchmark the state-of-the-art ML approaches for reaction prediction in the OOD setting and the state-of-the-art graph OOD methods in kinetics property prediction problems.
arXiv Detail & Related papers (2023-10-04T20:36:41Z) - Local manifold learning and its link to domain-based physics knowledge [53.15471241298841]
In many reacting flow systems, the thermo-chemical state-space is assumed to evolve close to a low-dimensional manifold (LDM)
We show that PCA applied in local clusters of data (local PCA) is capable of detecting the intrinsic parameterization of the thermo-chemical state-space.
arXiv Detail & Related papers (2022-07-01T09:06:25Z) - Rxn Hypergraph: a Hypergraph Attention Model for Chemical Reaction
Representation [70.97737157902947]
There is currently no universal and widely adopted method for robustly representing chemical reactions.
Here we exploit graph-based representations of molecular structures to develop and test a hypergraph attention neural network approach.
We evaluate this hypergraph representation in three experiments using three independent data sets of chemical reactions.
arXiv Detail & Related papers (2022-01-02T12:33:10Z) - Analytical Modelling of Exoplanet Transit Specroscopy with Dimensional
Analysis and Symbolic Regression [68.8204255655161]
The deep learning revolution has opened the door for deriving such analytical results directly with a computer algorithm fitting to the data.
We successfully demonstrate the use of symbolic regression on synthetic data for the transit radii of generic hot Jupiter exoplanets.
As a preprocessing step, we use dimensional analysis to identify the relevant dimensionless combinations of variables.
arXiv Detail & Related papers (2021-12-22T00:52:56Z) - Discovering Latent Causal Variables via Mechanism Sparsity: A New
Principle for Nonlinear ICA [81.4991350761909]
Independent component analysis (ICA) refers to an ensemble of methods which formalize this goal and provide estimation procedure for practical application.
We show that the latent variables can be recovered up to a permutation if one regularizes the latent mechanisms to be sparse.
arXiv Detail & Related papers (2021-07-21T14:22:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.