ChemoVerse: Manifold traversal of latent spaces for novel molecule
discovery
- URL: http://arxiv.org/abs/2009.13946v1
- Date: Tue, 29 Sep 2020 12:11:40 GMT
- Title: ChemoVerse: Manifold traversal of latent spaces for novel molecule
discovery
- Authors: Harshdeep Singh, Nicholas McCarthy, Qurrat Ul Ain, Jeremiah Hayes
- Abstract summary: It is essential to identify molecular structures with the desired chemical properties.
Recent advances in generative models using neural networks and machine learning are being widely used to design virtual libraries of drug-like compounds.
- Score: 0.7742297876120561
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In order to design a more potent and effective chemical entity, it is
essential to identify molecular structures with the desired chemical
properties. Recent advances in generative models using neural networks and
machine learning are being widely used by many emerging startups and
researchers in this domain to design virtual libraries of drug-like compounds.
Although these models can help a scientist to produce novel molecular
structures rapidly, the challenge still exists in the intelligent exploration
of the latent spaces of generative models, thereby reducing the randomness in
the generative procedure. In this work we present a manifold traversal with
heuristic search to explore the latent chemical space. Different heuristics and
scores such as the Tanimoto coefficient, synthetic accessibility, binding
activity, and QED drug-likeness can be incorporated to increase the validity
and proximity for desired molecular properties of the generated molecules. For
evaluating the manifold traversal exploration, we produce the latent chemical
space using various generative models such as grammar variational autoencoders
(with and without attention) as they deal with the randomized generation and
validity of compounds. With this novel traversal method, we are able to find
more unseen compounds and more specific regions to mine in the latent space.
Finally, these components are brought together in a simple platform allowing
users to perform search, visualization and selection of novel generated
compounds.
Related papers
- Navigating Chemical Space with Latent Flows [20.95884505685799]
We propose a new framework, ChemFlow, to traverse chemical space through navigating the latent space learned by molecule generative models through flows.
We validate the efficacy of ChemFlow on molecule manipulation and single- and multi-objective optimization tasks under both supervised and unsupervised molecular discovery settings.
arXiv Detail & Related papers (2024-05-07T03:55:57Z) - Interactive Molecular Discovery with Natural Language [69.89287960545903]
We propose the conversational molecular design, a novel task adopting natural language for describing and editing target molecules.
To better accomplish this task, we design ChatMol, a knowledgeable and versatile generative pre-trained model, enhanced by injecting experimental property information.
arXiv Detail & Related papers (2023-06-21T02:05:48Z) - CHA2: CHemistry Aware Convex Hull Autoencoder Towards Inverse Molecular
Design [2.169755083801688]
It is impossible to explore the entire search space comprehensively to exploit de novo structures with properties of interest.
To address this challenge, reducing the intractable search space into a lower-dimensional latent volume helps examine molecular candidates more feasibly.
We propose using a convex hall surrounding the top molecules in terms of high QEDs to ensnare a tight subspace in the latent representation as an efficient way to reveal novel molecules with high QEDs.
arXiv Detail & Related papers (2023-02-21T21:05:31Z) - Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular
Property Prediction [53.06671763877109]
We develop molecular embeddings that encode complex molecular characteristics to improve the performance of few-shot molecular property prediction.
Our approach leverages large amounts of synthetic data, namely the results of molecular docking calculations.
On multiple molecular property prediction benchmarks, training from the embedding space substantially improves Multi-Task, MAML, and Prototypical Network few-shot learning performance.
arXiv Detail & Related papers (2023-02-04T01:32:40Z) - Domain-Agnostic Molecular Generation with Chemical Feedback [44.063584808910896]
MolGen is a pre-trained molecular language model tailored specifically for molecule generation.
It internalizes structural and grammatical insights through the reconstruction of over 100 million molecular SELFIES.
Our chemical feedback paradigm steers the model away from molecular hallucinations, ensuring alignment between the model's estimated probabilities and real-world chemical preferences.
arXiv Detail & Related papers (2023-01-26T17:52:56Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - Exploring Chemical Space with Score-based Out-of-distribution Generation [57.15855198512551]
We propose a score-based diffusion scheme that incorporates out-of-distribution control in the generative differential equation (SDE)
Since some novel molecules may not meet the basic requirements of real-world drugs, MOOD performs conditional generation by utilizing the gradients from a property predictor.
We experimentally validate that MOOD is able to explore the chemical space beyond the training distribution, generating molecules that outscore ones found with existing methods, and even the top 0.01% of the original training pool.
arXiv Detail & Related papers (2022-06-06T06:17:11Z) - Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning [68.8204255655161]
We introduce a novel framework for scalable 3D design that uses a hierarchical agent to build molecules.
In a variety of experiments, we show that our agent, guided only by energy considerations, can efficiently learn to produce molecules with over 100 atoms.
arXiv Detail & Related papers (2022-02-01T18:54:24Z) - Generative chemistry: drug discovery with deep learning generative
models [0.0]
This paper reviews the latest advances in generative chemistry which relies on generative modeling to expedite the drug discovery process.
The detailed discussions on utilizing cutting-edge generative architectures, including recurrent neural network, variational autoencoder, adversarial autoencoder, and generative adversarial network for compound generation are focused.
arXiv Detail & Related papers (2020-08-20T14:38:21Z) - Learning To Navigate The Synthetically Accessible Chemical Space Using
Reinforcement Learning [75.95376096628135]
We propose a novel forward synthesis framework powered by reinforcement learning (RL) for de novo drug design.
In this setup, the agent learns to navigate through the immense synthetically accessible chemical space.
We describe how the end-to-end training in this study represents an important paradigm in radically expanding the synthesizable chemical space.
arXiv Detail & Related papers (2020-04-26T21:40:03Z) - The Synthesizability of Molecules Proposed by Generative Models [3.032184156362992]
Discovery of functional molecules is an expensive and time-consuming process.
One class of techniques of growing interest for early-stage drug discovery is de novo molecular generation and optimization.
These techniques can suggest novel molecular structures intended to maximize a multi-objective function.
However, the utility of these approaches is stymied by ignorance of synthesizability.
arXiv Detail & Related papers (2020-02-17T15:41:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.