Guo Renfeng, Zhang Youjia, Liao Yuxuan, Yang Qiong, Xie Ting, Fan Xiaqiong, Lin Zhonglong, Chen Yi, Lu Hongmei, Zhang Zhimin
College of Chemistry and Chemical Engineering, Central South University, 410083, Changsha, China.
School of Computer Science and Technology, Huazhong University of Science and Technology, 430074, Wuhan, China.
Commun Chem. 2023 Jul 4;6(1):139. doi: 10.1038/s42004-023-00939-w.
The collision cross section (CCS) values derived from ion mobility spectrometry can be used to improve the accuracy of compound identification. Here, we have developed the Structure included graph merging with adduct method for CCS prediction (SigmaCCS) based on graph neural networks using 3D conformers as inputs. A model was trained, evaluated, and tested with >5,000 experimental CCS values. It achieved a coefficient of determination of 0.9945 and a median relative error of 1.1751% on the test set. The model-agnostic interpretation method and the visualization of the learned representations were used to investigate the chemical rationality of SigmaCCS. An in-silico database with 282 million CCS values was generated for three different adduct types of 94 million compounds. Its source code is publicly available at https://github.com/zmzhang/SigmaCCS . Altogether, SigmaCCS is an accurate, rational, and off-the-shelf method to directly predict CCS values from molecular structures.
从离子淌度光谱法得出的碰撞截面(CCS)值可用于提高化合物鉴定的准确性。在此,我们基于图神经网络,以三维构象体作为输入,开发了用于CCS预测的包含结构的图合并加合物方法(SigmaCCS)。使用超过5000个实验性CCS值对一个模型进行了训练、评估和测试。在测试集上,它的决定系数达到0.9945,中位相对误差为1.1751%。使用模型无关解释方法和学习表征的可视化来研究SigmaCCS的化学合理性。针对9400万个化合物的三种不同加合物类型,生成了一个包含2.82亿个CCS值的虚拟数据库。其源代码可在https://github.com/zmzhang/SigmaCCS上公开获取。总之,SigmaCCS是一种准确、合理且现成可用的方法,可直接从分子结构预测CCS值。