Fugu-MT 論文翻訳(概要): Coordinating Cross-modal Distillation for Molecular Property Prediction

論文の概要: Coordinating Cross-modal Distillation for Molecular Property Prediction

arxiv url: http://arxiv.org/abs/2211.16712v1
Date: Wed, 30 Nov 2022 03:19:34 GMT
ステータス: 翻訳完了
システム内更新日: 2022-12-01 18:08:09.335254
Title: Coordinating Cross-modal Distillation for Molecular Property Prediction
Title（参考訳）: 分子特性予測のためのクロスモーダル蒸留のコーディネート
Authors: Hao Zhang, Nan Zhang, Ruixin Zhang, Lei Shen, Yingyi Zhang, and Meng Liu
Abstract要約: グローバルな分子蒸留と局所的な原子蒸留を含む枠組みを提案する。また、原子と分子情報の調整方法を正当化する理論的な洞察も提供します。提案手法は,他の手法よりも優れた性能を示す。
参考スコア（独自算出の注目度）: 21.811716721135248
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In recent years, molecular graph representation learning (GRL) has drawn much more attention in molecular property prediction (MPP) problems. The existing graph methods have demonstrated that 3D geometric information is significant for better performance in MPP. However, accurate 3D structures are often costly and time-consuming to obtain, limiting the large-scale application of GRL. It is an intuitive solution to train with 3D to 2D knowledge distillation and predict with only 2D inputs. But some challenging problems remain open for 3D to 2D distillation. One is that the 3D view is quite distinct from the 2D view, and the other is that the gradient magnitudes of atoms in distillation are discrepant and unstable due to the variable molecular size. To address these challenging problems, we exclusively propose a distillation framework that contains global molecular distillation and local atom distillation. We also provide a theoretical insight to justify how to coordinate atom and molecular information, which tackles the drawback of variable molecular size for atom information distillation. Experimental results on two popular molecular datasets demonstrate that our proposed model achieves superior performance over other methods. Specifically, on the largest MPP dataset PCQM4Mv2 served as an "ImageNet Large Scale Visual Recognition Challenge" in the field of graph ML, the proposed method achieved a 6.9% improvement compared with the best works. And we obtained fourth place with the MAE of 0.0734 on the test-challenge set for OGB-LSC 2022 Graph Regression Task. We will release the code soon.
Abstract（参考訳）: 近年,分子グラフ表現学習 (GRL) が分子特性予測 (MPP) 問題において注目されている。既存のグラフ手法では、3次元幾何情報がmppの性能向上に重要であることが示されている。しかし、正確な3D構造は高コストで入手に時間がかかることが多く、GRLの大規模適用が制限される。これは、3dから2dの知識蒸留で訓練し、2d入力だけで予測する直感的な解決策である。しかし、3Dから2Dへの蒸留には難しい問題がある。 1つは3次元ビューが2次元ビューとは全く異なることであり、もう1つは、蒸留中の原子の勾配の等級が、分子の大きさの変動により分散し不安定であることである。これらの課題に対処するため,グローバルな分子蒸留と局所的な原子蒸留を含む蒸留フレームワークを提案する。また、原子情報蒸留における可変分子サイズの欠点に対処するため、原子と分子情報のコーディネート方法を正当化する理論的な洞察を与える。 2つのポピュラーな分子データセットにおける実験結果から,提案手法は他の手法よりも優れた性能が得られた。具体的には、最も大きなMPPデータセットであるPCQM4Mv2は、グラフMLの分野で"ImageNet Large Scale Visual Recognition Challenge"として機能し、提案手法はベストワークに比べて6.9%改善された。また,OGB-LSC 2022 Graph Regression TaskにおけるテストチェレンジセットのMAEは0.0734で4位となった。私たちはすぐにコードをリリースします。

論文の概要: Coordinating Cross-modal Distillation for Molecular Property Prediction

関連論文リスト