A Systematic Survey of Chemical Pre-trained Models
- URL: http://arxiv.org/abs/2210.16484v3
- Date: Thu, 27 Apr 2023 03:30:37 GMT
- Title: A Systematic Survey of Chemical Pre-trained Models
- Authors: Jun Xia, Yanqiao Zhu, Yuanqi Du, Stan Z.Li
- Abstract summary: Training Deep Neural Networks (DNNs) from scratch often requires abundant labeled molecules, which are expensive to acquire in the real world.
To alleviate this issue, tremendous efforts have been devoted to Molecular Pre-trained Models (CPMs)
CPMs are pre-trained using large-scale unlabeled molecular databases and then fine-tuned over specific downstream tasks.
- Score: 38.57023440288189
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep learning has achieved remarkable success in learning representations for
molecules, which is crucial for various biochemical applications, ranging from
property prediction to drug design. However, training Deep Neural Networks
(DNNs) from scratch often requires abundant labeled molecules, which are
expensive to acquire in the real world. To alleviate this issue, tremendous
efforts have been devoted to Molecular Pre-trained Models (CPMs), where DNNs
are pre-trained using large-scale unlabeled molecular databases and then
fine-tuned over specific downstream tasks. Despite the prosperity, there lacks
a systematic review of this fast-growing field. In this paper, we present the
first survey that summarizes the current progress of CPMs. We first highlight
the limitations of training molecular representation models from scratch to
motivate CPM studies. Next, we systematically review recent advances on this
topic from several key perspectives, including molecular descriptors, encoder
architectures, pre-training strategies, and applications. We also highlight the
challenges and promising avenues for future research, providing a useful
resource for both machine learning and scientific communities.
Related papers
Err
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.