Knowledge-informed Molecular Learning: A Survey on Paradigm Transfer
- URL: http://arxiv.org/abs/2202.10587v2
- Date: Tue, 5 Sep 2023 10:46:44 GMT
- Title: Knowledge-informed Molecular Learning: A Survey on Paradigm Transfer
- Authors: Yin Fang, Zhuo Chen, Xiaohui Fan and Ningyu Zhang
- Abstract summary: Machine learning, notably deep learning, has significantly propelled molecular investigations within the biochemical sphere.
Traditionally, modeling for such research has centered around a handful of paradigms.
To enhance the generation and decipherability of purely data-driven models, scholars have integrated biochemical domain knowledge into these molecular study models.
- Score: 20.893861195128643
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning, notably deep learning, has significantly propelled
molecular investigations within the biochemical sphere. Traditionally, modeling
for such research has centered around a handful of paradigms. For instance, the
prediction paradigm is frequently deployed for tasks such as molecular property
prediction. To enhance the generation and decipherability of purely data-driven
models, scholars have integrated biochemical domain knowledge into these
molecular study models. This integration has sparked a surge in paradigm
transfer, which is solving one molecular learning task by reformulating it as
another one. With the emergence of Large Language Models, these paradigms have
demonstrated an escalating trend towards harmonized unification. In this work,
we delineate a literature survey focused on knowledge-informed molecular
learning from the perspective of paradigm transfer. We classify the paradigms,
scrutinize their methodologies, and dissect the contribution of domain
knowledge. Moreover, we encapsulate prevailing trends and identify intriguing
avenues for future exploration in molecular learning.
Related papers
- Bridging Text and Molecule: A Survey on Multimodal Frameworks for Molecule [16.641797535842752]
In this paper, we present the first systematic survey on multimodal frameworks for molecules research.
We begin with the development of molecular deep learning and point out the necessity to involve textual modality.
Furthermore, we delves into the utilization of large language models and prompting techniques for molecular tasks and present significant applications in drug discovery.
arXiv Detail & Related papers (2024-03-07T03:03:13Z) - Leveraging Biomolecule and Natural Language through Multi-Modal
Learning: A Survey [75.47055414002571]
The integration of biomolecular modeling with natural language (BL) has emerged as a promising interdisciplinary area at the intersection of artificial intelligence, chemistry and biology.
We provide an analysis of recent advancements achieved through cross modeling of biomolecules and natural language.
arXiv Detail & Related papers (2024-03-03T14:59:47Z) - Scientific Language Modeling: A Quantitative Review of Large Language
Models in Molecular Science [27.874571056109758]
Large language models (LLMs) offer a fresh approach to tackle scientific problems from a natural language processing (NLP) perspective.
We propose a multi-modal benchmark, named ChEBI-20-MM, and perform 1263 experiments to assess the model's compatibility with data modalities and knowledge acquisition.
Our pioneering analysis offers an exploration of the learning mechanism and paves the way for advancing SLM in molecular science.
arXiv Detail & Related papers (2024-02-06T16:12:36Z) - Interactive Molecular Discovery with Natural Language [69.89287960545903]
We propose the conversational molecular design, a novel task adopting natural language for describing and editing target molecules.
To better accomplish this task, we design ChatMol, a knowledgeable and versatile generative pre-trained model, enhanced by injecting experimental property information.
arXiv Detail & Related papers (2023-06-21T02:05:48Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - A Molecular Multimodal Foundation Model Associating Molecule Graphs with
Natural Language [63.60376252491507]
We propose a molecular multimodal foundation model which is pretrained from molecular graphs and their semantically related textual data.
We believe that our model would have a broad impact on AI-empowered fields across disciplines such as biology, chemistry, materials, environment, and medicine.
arXiv Detail & Related papers (2022-09-12T00:56:57Z) - Generative Enriched Sequential Learning (ESL) Approach for Molecular
Design via Augmented Domain Knowledge [1.4410716345002657]
generative machine learning techniques can generate novel chemical structures based on molecular fingerprint representation.
Lack of supervised domain knowledge can mislead the learning procedure to be relatively biased to the prevalent molecules observed in the training data.
We alleviated this drawback by augmenting the training data with domain knowledge, e.g. quantitative estimates of the drug-likeness score (QEDs)
arXiv Detail & Related papers (2022-04-05T20:16:11Z) - Geometric Deep Learning on Molecular Representations [0.0]
Geometric deep learning (GDL) is based on neural network architectures that incorporate and process symmetry information.
This review provides a structured and harmonized overview of molecular GDL, highlighting its applications in drug discovery, chemical synthesis prediction, and quantum chemistry.
Emphasis is placed on the relevance of the learned molecular features and their complementarity to well-established molecular descriptors.
arXiv Detail & Related papers (2021-07-26T09:23:43Z) - Learning Neural Generative Dynamics for Molecular Conformation
Generation [89.03173504444415]
We study how to generate molecule conformations (textiti.e., 3D structures) from a molecular graph.
We propose a novel probabilistic framework to generate valid and diverse conformations given a molecular graph.
arXiv Detail & Related papers (2021-02-20T03:17:58Z) - A Perspective on Deep Learning for Molecular Modeling and Simulations [8.891007063629187]
We focus on the limitations of traditional deep learning models from the perspective of molecular physics.
We summarized several representative applications, ranging from supervised to unsupervised and reinforcement learning.
We outlook promising directions which may help address the existing issues in the current framework of deep molecular modeling.
arXiv Detail & Related papers (2020-04-25T22:58:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.