Inverse Design of Potential Singlet Fission Molecules using a Transfer
Learning Based Approach
- URL: http://arxiv.org/abs/2003.07666v1
- Date: Tue, 17 Mar 2020 12:37:54 GMT
- Title: Inverse Design of Potential Singlet Fission Molecules using a Transfer
Learning Based Approach
- Authors: Akshay Subramanian (1), Utkarsh Saha (2), Tejasvini Sharma (2), Naveen
K. Tailor (2), Soumitra Satapathi (2) ((1) Department of Metallurgical and
Materials Engineering, Indian Institute of Technology Roorkee, (2) Department
of Physics, Indian Institute of Technology Roorkee)
- Abstract summary: We put forward inverse design of possible singlet fission molecules using a transfer learning based approach.
We make use of a much larger ChEMBL dataset of structurally similar molecules to transfer the learned characteristics to the singlet fission dataset.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Singlet fission has emerged as one of the most exciting phenomena known to
improve the efficiencies of different types of solar cells and has found uses
in diverse optoelectronic applications. The range of available singlet fission
molecules is, however, limited as to undergo singlet fission, molecules have to
satisfy certain energy conditions. Recent advances in material search using
inverse design has enabled the prediction of materials for a wide range of
applications and has emerged as one of the most efficient methods in the
discovery of suitable materials. It is particularly helpful in manipulating
large datasets, uncovering hidden information from the molecular dataset and
generating new structures. However, we seldom encounter large datasets in
structure prediction problems in material science. In our work, we put forward
inverse design of possible singlet fission molecules using a transfer learning
based approach where we make use of a much larger ChEMBL dataset of
structurally similar molecules to transfer the learned characteristics to the
singlet fission dataset.
Related papers
- Physical Consistency Bridges Heterogeneous Data in Molecular Multi-Task Learning [79.75718786477638]
We exploit the specialty of molecular tasks that there are physical laws connecting them, and design consistency training approaches.
We demonstrate that the more accurate energy data can improve the accuracy of structure prediction.
We also find that consistency training can directly leverage force and off-equilibrium structure data to improve structure prediction.
arXiv Detail & Related papers (2024-10-14T03:11:33Z) - Data-Efficient Molecular Generation with Hierarchical Textual Inversion [48.816943690420224]
We introduce Hierarchical textual Inversion for Molecular generation (HI-Mol), a novel data-efficient molecular generation method.
HI-Mol is inspired by the importance of hierarchical information, e.g., both coarse- and fine-grained features, in understanding the molecule distribution.
Compared to the conventional textual inversion method in the image domain using a single-level token embedding, our multi-level token embeddings allow the model to effectively learn the underlying low-shot molecule distribution.
arXiv Detail & Related papers (2024-05-05T08:35:23Z) - A Heat Diffusion Perspective on Geodesic Preserving Dimensionality
Reduction [66.21060114843202]
We propose a more general heat kernel based manifold embedding method that we call heat geodesic embeddings.
Results show that our method outperforms existing state of the art in preserving ground truth manifold distances.
We also showcase our method on single cell RNA-sequencing datasets with both continuum and cluster structure.
arXiv Detail & Related papers (2023-05-30T13:58:50Z) - Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular
Property Prediction [53.06671763877109]
We develop molecular embeddings that encode complex molecular characteristics to improve the performance of few-shot molecular property prediction.
Our approach leverages large amounts of synthetic data, namely the results of molecular docking calculations.
On multiple molecular property prediction benchmarks, training from the embedding space substantially improves Multi-Task, MAML, and Prototypical Network few-shot learning performance.
arXiv Detail & Related papers (2023-02-04T01:32:40Z) - Discovery of structure-property relations for molecules via
hypothesis-driven active learning over the chemical space [0.0]
We introduce a novel approach for the active learning over the chemical spaces based on hypothesis learning.
We construct the hypotheses on the possible relationships between structures and functionalities of interest based on a small subset of data.
This approach combines the elements from the symbolic regression methods such as SISSO and active learning into a single framework.
arXiv Detail & Related papers (2023-01-06T14:22:43Z) - A Molecular Multimodal Foundation Model Associating Molecule Graphs with
Natural Language [63.60376252491507]
We propose a molecular multimodal foundation model which is pretrained from molecular graphs and their semantically related textual data.
We believe that our model would have a broad impact on AI-empowered fields across disciplines such as biology, chemistry, materials, environment, and medicine.
arXiv Detail & Related papers (2022-09-12T00:56:57Z) - Transferring Chemical and Energetic Knowledge Between Molecular Systems
with Machine Learning [5.27145343046974]
We propose a novel methodology for transferring knowledge obtained from simple molecular systems to a more complex one.
We focus on the classification of high and low free-energy states.
Our results show a remarkable AUC of 0.92 for transfer learning from tri-alanine to the deca-alanine system.
arXiv Detail & Related papers (2022-05-06T16:21:00Z) - Augmenting Molecular Deep Generative Models with Topological Data
Analysis Representations [21.237758981760784]
We present a SMILES Variational Auto-Encoder (VAE) augmented with topological data analysis (TDA) representations of molecules.
Our experiments show that this TDA augmentation enables a SMILES VAE to capture the complex relation between 3D geometry and electronic properties.
arXiv Detail & Related papers (2021-06-08T15:49:21Z) - BIGDML: Towards Exact Machine Learning Force Fields for Materials [55.944221055171276]
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof.
Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 atoms.
arXiv Detail & Related papers (2021-06-08T10:14:57Z) - TorsionNet: A Reinforcement Learning Approach to Sequential Conformer
Search [17.2131835813425]
We present an efficient sequential conformer search technique based on reinforcement learning under the rigid rotor approximation.
Our experimental results show that torsionNet outperforms the highest scoring chemoinformatics method by 4x on large alkanes, and by several orders of magnitude on the previously unexplored biopolymer lignin.
arXiv Detail & Related papers (2020-06-12T11:03:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.