Kensert Alexander, Desmet Gert, Cabooter Deirdre
Pharmaceutical and Pharmacological Sciences, KU Leuven, Herestraat 49, 3000, Leuven, Belgium.
Chemical Engineering, Vrije Universiteit Brussel, Pleinlaan 2, 1050, Brussel, Belgium.
J Comput Aided Mol Des. 2024 Dec 5;39(1):3. doi: 10.1007/s10822-024-00578-w.
Molecular machine learning (ML) has proven important for tackling various molecular problems, such as predicting molecular properties based on molecular descriptors or fingerprints. Since relatively recently, graph neural network (GNN) algorithms have been implemented for molecular ML, showing comparable or superior performance to descriptor or fingerprint-based approaches. Although various tools and packages exist to apply GNNs in molecular ML, a new GNN package, named MolGraph, was developed in this work with the motivation to create GNN model pipelines highly compatible with the TensorFlow and Keras application programming interface (API). MolGraph also implements a module to accommodate the generation of small molecular graphs, which can be passed to a GNN algorithm to solve a molecular ML problem. To validate the GNNs, benchmarking was conducted using the datasets from MoleculeNet, as well as three chromatographic retention time datasets. The benchmarking results demonstrate that the GNNs performed in line with expectations. Additionally, the GNNs proved useful for molecular identification and improved interpretability of chromatographic retention time data. MolGraph is available at https://github.com/akensert/molgraph . Installation, tutorials and implementation details can be found at https://molgraph.readthedocs.io/en/latest/ .
分子机器学习(ML)已被证明在解决各种分子问题方面很重要,例如基于分子描述符或指纹预测分子性质。直到最近,图神经网络(GNN)算法才被应用于分子ML,其表现与基于描述符或指纹的方法相当或更优。尽管存在各种工具和软件包可将GNN应用于分子ML,但在本研究中开发了一个名为MolGraph的新GNN软件包,目的是创建与TensorFlow和Keras应用程序编程接口(API)高度兼容的GNN模型管道。MolGraph还实现了一个模块来生成小分子图,这些图可传递给GNN算法以解决分子ML问题。为了验证GNN,使用来自MoleculeNet的数据集以及三个色谱保留时间数据集进行了基准测试。基准测试结果表明,GNN的表现符合预期。此外,GNN被证明对分子识别有用,并提高了色谱保留时间数据的可解释性。MolGraph可在https://github.com/akensert/molgraph获取。安装、教程和实现细节可在https://molgraph.readthedocs.io/en/latest/找到。