Lu Qing
Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences Beijing 100190 China
RSC Adv. 2021 Nov 5;11(57):35879-35886. doi: 10.1039/d1ra05752a. eCollection 2021 Nov 4.
Molecular structure recognition is fundamental in computational chemistry. The most common approach is to calculate the root mean square deviation (RMSD) between two sets of molecular coordinates. However, this method does not perform well for large molecules. In this work, a new method is proposed for structure comparison. Blob detection is used for recognizing structural features. Fragmentation of molecules is proposed as the pre-treatment. Mapping between blobs and atoms is developed as the post-treatment. A set of key parameters important for blob detections are determined. The dissimilarity is quantified by calculating the Euclidean metric of the blob vectors. The overall algorithm is found to be accurate to distinguish structural dissimilarity. The method has potential to be combined with other pattern recognition techniques for new chemistry discoveries.
分子结构识别是计算化学的基础。最常见的方法是计算两组分子坐标之间的均方根偏差(RMSD)。然而,这种方法对于大分子效果不佳。在这项工作中,提出了一种用于结构比较的新方法。使用斑点检测来识别结构特征。提出将分子碎片化作为预处理。开发斑点与原子之间的映射作为后处理。确定了一组对斑点检测很重要的关键参数。通过计算斑点向量的欧几里得度量来量化差异。发现整个算法在区分结构差异方面是准确的。该方法有潜力与其他模式识别技术相结合以实现新的化学发现。