Lu Xin-Yu, Wu Hao-Ping, Ma Hao, Li Hui, Li Jia, Liu Yan-Ti, Pan Zheng-Yan, Xie Yi, Wang Lei, Ren Bin, Liu Guo-Kun
State Key Laboratory of Physical Chemistry of Solid Surfaces, Collaborative Innovation Center of Chemistry for Energy Materials (iChEM), College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, P. R. China.
Tan Kah Kee Innovation Laboratory, Xiamen 361005, P. R. China.
Anal Chem. 2024 May 21;96(20):7959-7975. doi: 10.1021/acs.analchem.4c01639. Epub 2024 Apr 25.
Spectrum-structure correlation is playing an increasingly crucial role in spectral analysis and has undergone significant development in recent decades. With the advancement of spectrometers, the high-throughput detection triggers the explosive growth of spectral data, and the research extension from small molecules to biomolecules accompanies massive chemical space. Facing the evolving landscape of spectrum-structure correlation, conventional chemometrics becomes ill-equipped, and deep learning assisted chemometrics rapidly emerges as a flourishing approach with superior ability of extracting latent features and making precise predictions. In this review, the molecular and spectral representations and fundamental knowledge of deep learning are first introduced. We then summarize the development of how deep learning assist to establish the correlation between spectrum and molecular structure in the recent 5 years, by empowering spectral prediction (i.e., forward structure-spectrum correlation) and further enabling library matching and molecular generation (i.e., inverse spectrum-structure correlation). Finally, we highlight the most important open issues persisted with corresponding potential solutions. With the fast development of deep learning, it is expected to see ultimate solution of establishing spectrum-structure correlation soon, which would trigger substantial development of various disciplines.
光谱-结构相关性在光谱分析中发挥着越来越关键的作用,并且在近几十年中取得了显著发展。随着光谱仪的进步,高通量检测引发了光谱数据的爆炸式增长,并且研究范围从小分子扩展到生物分子伴随着巨大的化学空间。面对不断演变的光谱-结构相关性格局,传统化学计量学显得力不从心,而深度学习辅助化学计量学迅速崛起,成为一种蓬勃发展的方法,具有卓越的提取潜在特征和进行精确预测的能力。在这篇综述中,首先介绍了深度学习的分子和光谱表示以及基础知识。然后,我们总结了深度学习在最近5年中如何通过赋能光谱预测(即正向结构-光谱相关性)以及进一步实现库匹配和分子生成(即反向光谱-结构相关性)来协助建立光谱与分子结构之间的相关性。最后,我们强调了仍然存在的最重要的开放性问题以及相应的潜在解决方案。随着深度学习的快速发展,预计很快就能看到建立光谱-结构相关性的最终解决方案,这将引发各个学科的实质性发展。