• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过结合化学见解增强的预训练模型对光谱和结构描述符进行跨模态预测。

Cross-Modal Prediction of Spectral and Structural Descriptors via a Pretrained Model Enhanced with Chemical Insights.

作者信息

Yang Guokun, Jiang Shuang, Luo Yi, Wang Song, Jiang Jun

机构信息

Key Laboratory of Precision and Intelligent Chemistry, School of Chemistry and Materials Science, University of Science and Technology of China, Hefei, Anhui 230026, P. R. China.

出版信息

J Phys Chem Lett. 2024 Aug 29;15(34):8766-8772. doi: 10.1021/acs.jpclett.4c02129. Epub 2024 Aug 20.

DOI:10.1021/acs.jpclett.4c02129
PMID:39163398
Abstract

Proposing and utilizing machine learning descriptors for chemical property prediction and material screening have become a cutting-edge field in artificial intelligence-enabled chemical research. However, a single descriptor typically captures only partial features of a chemical object, resulting in an information deficiency and limiting generalizability. Obtaining a comprehensive set of descriptors is essential but challenging, especially when accessing some microlevel structural and electronic features due to technological limitations. Herein, we exploit multimodal chemical descriptors to construct an encoder-decoder machine learning framework that enables the cross-modal prediction of spectral and structural descriptors. By pretraining the model to endow it with chemical insights, the multimodal data fusion is implemented in a descriptor-encoded hidden layer. The model's capabilities are validated in the system of CO/NO adsorption on Au/Ag surfaces, demonstrating successful reciprocal prediction of infrared spectra, Raman spectra, and internal coordinates. This work provides a proof-of-concept for the feasibility of cross-modal predictions between different chemical features and will significantly reduce the machine learning model's dependence on complete physicochemical parameters and improve its multitarget prediction capabilities.

摘要

提出并利用机器学习描述符进行化学性质预测和材料筛选已成为人工智能辅助化学研究中的一个前沿领域。然而,单个描述符通常只能捕捉化学对象的部分特征,导致信息不足并限制了通用性。获取一组全面的描述符至关重要但具有挑战性,特别是由于技术限制而难以获取一些微观结构和电子特征时。在此,我们利用多模态化学描述符构建了一个编码器 - 解码器机器学习框架,该框架能够对光谱和结构描述符进行跨模态预测。通过对模型进行预训练以赋予其化学见解,多模态数据融合在描述符编码的隐藏层中实现。该模型的能力在CO/NO在Au/Ag表面吸附的系统中得到验证,证明了对红外光谱、拉曼光谱和内部坐标的成功相互预测。这项工作为不同化学特征之间跨模态预测的可行性提供了概念验证,并将显著降低机器学习模型对完整物理化学参数的依赖,提高其多目标预测能力。

相似文献

1
Cross-Modal Prediction of Spectral and Structural Descriptors via a Pretrained Model Enhanced with Chemical Insights.通过结合化学见解增强的预训练模型对光谱和结构描述符进行跨模态预测。
J Phys Chem Lett. 2024 Aug 29;15(34):8766-8772. doi: 10.1021/acs.jpclett.4c02129. Epub 2024 Aug 20.
2
Transfer Learning: Making Retrosynthetic Predictions Based on a Small Chemical Reaction Dataset Scale to a New Level.迁移学习:基于小规模化学反应数据集的逆向合成预测扩展到新的水平。
Molecules. 2020 May 19;25(10):2357. doi: 10.3390/molecules25102357.
3
Harnessing Shannon entropy-based descriptors in machine learning models to enhance the prediction accuracy of molecular properties.在机器学习模型中利用基于香农熵的描述符来提高分子性质的预测准确性。
J Cheminform. 2023 May 21;15(1):54. doi: 10.1186/s13321-023-00712-0.
4
Descriptor-augmented machine learning for enzyme-chemical interaction predictions.用于酶-化学相互作用预测的描述符增强机器学习
Synth Syst Biotechnol. 2024 Feb 28;9(2):259-268. doi: 10.1016/j.synbio.2024.02.006. eCollection 2024 Jun.
5
Predicting human odor perception represented by continuous values from mass spectra of essential oils resembling chemical mixtures.从类似化学混合物的精油的质谱中预测连续值表示的人类气味感知。
PLoS One. 2020 Jun 19;15(6):e0234688. doi: 10.1371/journal.pone.0234688. eCollection 2020.
6
A BERT-based pretraining model for extracting molecular structural information from a SMILES sequence.一种基于BERT的预训练模型,用于从SMILES序列中提取分子结构信息。
J Cheminform. 2024 Jun 19;16(1):71. doi: 10.1186/s13321-024-00848-7.
7
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
8
Machine Learning Using Combined Structural and Chemical Descriptors for Prediction of Methane Adsorption Performance of Metal Organic Frameworks (MOFs).基于结构和化学描述符的机器学习在预测金属有机骨架(MOFs)甲烷吸附性能中的应用。
ACS Comb Sci. 2017 Oct 9;19(10):640-645. doi: 10.1021/acscombsci.7b00056. Epub 2017 Sep 5.
9
An Ensemble Structure and Physicochemical (SPOC) Descriptor for Machine-Learning Prediction of Chemical Reaction and Molecular Properties.用于机器学习预测化学反应和分子性质的集成结构和物理化学(SPOC)描述符。
Chemphyschem. 2022 Jul 19;23(14):e202200255. doi: 10.1002/cphc.202200255. Epub 2022 May 19.
10
Machine learning of spectra-property relationship for imperfect and small chemistry data.光谱-性质关系的机器学习研究:针对不完整和小化学数据集
Proc Natl Acad Sci U S A. 2023 May 16;120(20):e2220789120. doi: 10.1073/pnas.2220789120. Epub 2023 May 8.