On Graph Neural Network Ensembles for Large-Scale Molecular Property
Prediction
- URL: http://arxiv.org/abs/2106.15529v1
- Date: Tue, 29 Jun 2021 15:58:34 GMT
- Title: On Graph Neural Network Ensembles for Large-Scale Molecular Property
Prediction
- Authors: Edward Elson Kosasih, Joaquin Cabezas, Xavier Sumba, Piotr Bielak,
Kamil Tagowski, Kelvin Idanwekhai, Benedict Aaron Tjandra, Arian Rokkum
Jamasb
- Abstract summary: The PCQM4M-LSC dataset defines a molecular HOMO-LUMO property prediction task on about 3.8M graphs.
We show our current work-in-progress solution which builds an ensemble of three graph neural networks models.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In order to advance large-scale graph machine learning, the Open Graph
Benchmark Large Scale Challenge (OGB-LSC) was proposed at the KDD Cup 2021. The
PCQM4M-LSC dataset defines a molecular HOMO-LUMO property prediction task on
about 3.8M graphs. In this short paper, we show our current work-in-progress
solution which builds an ensemble of three graph neural networks models based
on GIN, Bayesian Neural Networks and DiffPool. Our approach outperforms the
provided baseline by 7.6%. Moreover, using uncertainty in our ensemble's
prediction, we can identify molecules whose HOMO-LUMO gaps are harder to
predict (with Pearson's correlation of 0.5181). We anticipate that this will
facilitate active learning.
Related papers
- MolGraph-xLSTM: A graph-based dual-level xLSTM framework with multi-head mixture-of-experts for enhanced molecular representation and interpretability [9.858315463084084]
MolGraph-xLSTM is a graph-based xLSTM model that enhances feature extraction and effectively models molecule long-range interactions.
Our approach processes molecular graphs at two scales: atom-level and motif-level.
We validate MolGraph-xLSTM on 10 molecular property prediction datasets, covering both classification and regression tasks.
arXiv Detail & Related papers (2025-01-30T15:47:59Z) - LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering [59.89626219328127]
Graph clustering is a fundamental problem in machine learning.
Deep learning methods achieve the state-of-the-art results in recent years, but they still cannot work without predefined cluster numbers.
We propose to address this problem from a fresh perspective of graph information theory.
arXiv Detail & Related papers (2024-05-20T05:46:41Z) - On the Scalability of GNNs for Molecular Graphs [7.402389334892391]
Graph Neural Networks (GNNs) are yet to show the benefits of scale due to the lower efficiency of sparse operations, large data requirements, and lack of clarity about the effectiveness of various architectures.
We analyze message-passing networks, graph Transformers, and hybrid architectures on the largest public collection of 2D molecular graphs.
For the first time, we observe that GNNs benefit tremendously from the increasing scale of depth, width, number of molecules, number of labels, and the diversity in the pretraining datasets.
arXiv Detail & Related papers (2024-04-17T17:11:31Z) - Bi-level Contrastive Learning for Knowledge-Enhanced Molecule
Representations [55.42602325017405]
We propose a novel method called GODE, which takes into account the two-level structure of individual molecules.
By pre-training two graph neural networks (GNNs) on different graph structures, combined with contrastive learning, GODE fuses molecular structures with their corresponding knowledge graph substructures.
When fine-tuned across 11 chemical property tasks, our model outperforms existing benchmarks, registering an average ROC-AUC uplift of 13.8% for classification tasks and an average RMSE/MAE enhancement of 35.1% for regression tasks.
arXiv Detail & Related papers (2023-06-02T15:49:45Z) - An ensemble of VisNet, Transformer-M, and pretraining models for
molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022 [48.109627319222334]
ViSNet Team achieved the MAE of 0.0723 eV on the test-challenge set, dramatically reducing the error by 39.75% compared with the best method in the last year competition.
arXiv Detail & Related papers (2022-11-23T09:12:17Z) - HiGNN: Hierarchical Informative Graph Neural Networks for Molecular
Property Prediction Equipped with Feature-Wise Attention [5.735627221409312]
We propose a well-designed hierarchical informative graph neural networks framework (termed HiGNN) for predicting molecular property.
Experiments demonstrate that HiGNN achieves state-of-the-art predictive performance on many challenging drug discovery-associated benchmark datasets.
arXiv Detail & Related papers (2022-08-30T05:16:15Z) - MolGraph: a Python package for the implementation of molecular graphs
and graph neural networks with TensorFlow and Keras [51.92255321684027]
MolGraph is a graph neural network (GNN) package for molecular machine learning (ML)
MolGraph implements a chemistry module to accommodate the generation of small molecular graphs, which can be passed to a GNN algorithm to solve a molecular ML problem.
GNNs proved useful for molecular identification and improved interpretability of chromatographic retention time data.
arXiv Detail & Related papers (2022-08-21T18:37:41Z) - Multiscale Spatio-Temporal Graph Neural Networks for 3D Skeleton-Based
Motion Prediction [92.16318571149553]
We propose a multiscale-temporal graph neural network (MST-GNN) to predict the future 3D-based skeleton human poses.
The MST-GNN outperforms state-of-the-art methods in both short and long-term motion prediction.
arXiv Detail & Related papers (2021-08-25T14:05:37Z) - Fast Quantum Property Prediction via Deeper 2D and 3D Graph Networks [41.727588601578155]
We design a deep graph neural network to predict quantum properties by directly learning from 2D molecular graphs.
We also propose a 3D graph neural network to learn from low-coster sets.
arXiv Detail & Related papers (2021-06-16T05:07:28Z) - OGB-LSC: A Large-Scale Challenge for Machine Learning on Graphs [69.23600404232883]
OGB Large-Scale Challenge (OGB-LSC) is a collection of three real-world datasets for advancing the state-of-the-art in large-scale graph ML.
OGB-LSC provides dedicated baseline experiments, scaling up expressive graph ML models to the massive datasets.
arXiv Detail & Related papers (2021-03-17T04:08:03Z) - Distance-aware Molecule Graph Attention Network for Drug-Target Binding
Affinity Prediction [54.93890176891602]
We propose a diStance-aware Molecule graph Attention Network (S-MAN) tailored to drug-target binding affinity prediction.
As a dedicated solution, we first propose a position encoding mechanism to integrate the topological structure and spatial position information into the constructed pocket-ligand graph.
We also propose a novel edge-node hierarchical attentive aggregation structure which has edge-level aggregation and node-level aggregation.
arXiv Detail & Related papers (2020-12-17T17:44:01Z) - Examining COVID-19 Forecasting using Spatio-Temporal Graph Neural
Networks [8.949096210063662]
In this work, we examine a novel forecasting approach for COVID-19 case prediction that uses Graph Neural Networks and mobility data.
We evaluate this approach on the US county level COVID-19 dataset, and demonstrate that the rich spatial and temporal information leveraged by the graph neural network allows the model to learn complex dynamics.
arXiv Detail & Related papers (2020-07-06T23:11:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.