• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于化学数据监督学习的无监督流形嵌入以编码分子量子信息。

Unsupervised manifold embedding to encode molecular quantum information for supervised learning of chemical data.

作者信息

Li Tonglei, Huls Nicholas J, Lu Shan, Hou Peng

机构信息

Deparment of Industrial and Molecular Pharmaceutics, Purdue University, West Lafayette, 47907, IN, USA.

出版信息

Commun Chem. 2024 Jun 11;7(1):133. doi: 10.1038/s42004-024-01217-z.

DOI:10.1038/s42004-024-01217-z
PMID:38862828
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11166954/
Abstract

Molecular representation is critical in chemical machine learning. It governs the complexity of model development and the fulfillment of training data to avoid either over- or under-fitting. As electronic structures and associated attributes are the root cause for molecular interactions and their manifested properties, we have sought to examine the local electron information on a molecular manifold to understand and predict molecular interactions. Our efforts led to the development of a lower-dimensional representation of a molecular manifold, Manifold Embedding of Molecular Surface (MEMS), to embody surface electronic quantities. By treating a molecular surface as a manifold and computing its embeddings, the embedded electronic attributes retain the chemical intuition of molecular interactions. MEMS can be further featurized as input for chemical learning. Our solubility prediction with MEMS demonstrated the feasibility of both shallow and deep learning by neural networks, suggesting that MEMS is expressive and robust against dimensionality reduction.

摘要

分子表示在化学机器学习中至关重要。它决定了模型开发的复杂性以及训练数据的完备性,以避免过拟合或欠拟合。由于电子结构及其相关属性是分子相互作用及其表现出的性质的根本原因,我们试图研究分子流形上的局部电子信息,以理解和预测分子相互作用。我们的努力促成了一种分子流形的低维表示——分子表面流形嵌入(MEMS)的发展,以体现表面电子量。通过将分子表面视为一个流形并计算其嵌入,嵌入的电子属性保留了分子相互作用的化学直观。MEMS可以进一步特征化作为化学学习的输入。我们用MEMS进行的溶解度预测证明了神经网络进行浅层和深度学习的可行性,表明MEMS在降维方面具有表现力和鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/d4ec0a0cb2c6/42004_2024_1217_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/d79ca8404b15/42004_2024_1217_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/59b5bcbd3b6b/42004_2024_1217_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/28373b2cd19e/42004_2024_1217_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/1617fb27ae4f/42004_2024_1217_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/609a277a7014/42004_2024_1217_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/7df934f8f0d3/42004_2024_1217_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/08fb5ec7a274/42004_2024_1217_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/2dfe9092eff2/42004_2024_1217_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/cf8c5ab3663d/42004_2024_1217_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/d4ec0a0cb2c6/42004_2024_1217_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/d79ca8404b15/42004_2024_1217_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/59b5bcbd3b6b/42004_2024_1217_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/28373b2cd19e/42004_2024_1217_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/1617fb27ae4f/42004_2024_1217_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/609a277a7014/42004_2024_1217_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/7df934f8f0d3/42004_2024_1217_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/08fb5ec7a274/42004_2024_1217_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/2dfe9092eff2/42004_2024_1217_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/cf8c5ab3663d/42004_2024_1217_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d85/11166954/d4ec0a0cb2c6/42004_2024_1217_Fig10_HTML.jpg

相似文献

1
Unsupervised manifold embedding to encode molecular quantum information for supervised learning of chemical data.用于化学数据监督学习的无监督流形嵌入以编码分子量子信息。
Commun Chem. 2024 Jun 11;7(1):133. doi: 10.1038/s42004-024-01217-z.
2
Unsupervised robust discriminative manifold embedding with self-expressiveness.无监督鲁棒判别流形嵌入与自表达能力。
Neural Netw. 2019 May;113:102-115. doi: 10.1016/j.neunet.2018.11.003. Epub 2019 Jan 11.
3
Should We Embed in Chemistry? A Comparison of Unsupervised Transfer Learning with PCA, UMAP, and VAE on Molecular Fingerprints.我们应该融入化学吗?无监督迁移学习与主成分分析(PCA)、均匀流形近似与投影(UMAP)以及变分自编码器(VAE)在分子指纹上的比较。
Pharmaceuticals (Basel). 2021 Aug 2;14(8):758. doi: 10.3390/ph14080758.
4
Toward learning robust contrastive embeddings for binaural sound source localization.迈向用于双耳声源定位的学习鲁棒对比嵌入
Front Neuroinform. 2022 Nov 16;16:942978. doi: 10.3389/fninf.2022.942978. eCollection 2022.
5
Flexible manifold embedding: a framework for semi-supervised and unsupervised dimension reduction.柔性流形嵌入:一种半监督和无监督降维的框架。
IEEE Trans Image Process. 2010 Jul;19(7):1921-32. doi: 10.1109/TIP.2010.2044958. Epub 2010 Mar 8.
6
Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning.无监督和半监督流形学习的秩流嵌入。
IEEE Trans Image Process. 2023;32:2811-2826. doi: 10.1109/TIP.2023.3268868. Epub 2023 May 19.
7
A Study on Dimensionality Reduction and Parameters for Hyperspectral Imagery Based on Manifold Learning.基于流形学习的高光谱图像降维和参数研究
Sensors (Basel). 2024 Mar 25;24(7):2089. doi: 10.3390/s24072089.
8
Manifold adversarial training for supervised and semi-supervised learning.多流对抗训练用于监督学习和半监督学习。
Neural Netw. 2021 Aug;140:282-293. doi: 10.1016/j.neunet.2021.03.031. Epub 2021 Mar 26.
9
Unsupervised feature selection via latent representation learning and manifold regularization.基于潜在表示学习和流形正则化的无监督特征选择。
Neural Netw. 2019 Sep;117:163-178. doi: 10.1016/j.neunet.2019.04.015. Epub 2019 May 22.
10
How to approach machine learning-based prediction of drug/compound-target interactions.如何进行基于机器学习的药物/化合物-靶点相互作用预测。
J Cheminform. 2023 Feb 6;15(1):16. doi: 10.1186/s13321-023-00689-w.

引用本文的文献

1
Deep Learning Prediction of Drug-Induced Liver Toxicity by Manifold Embedding of Quantum Information of Drug Molecules.通过药物分子量子信息的流形嵌入实现药物性肝毒性的深度学习预测
Pharm Res. 2025 Jan;42(1):109-122. doi: 10.1007/s11095-024-03800-4. Epub 2024 Dec 12.

本文引用的文献

1
Exposing the Limitations of Molecular Machine Learning with Activity Cliffs.利用活性悬崖揭示分子机器学习的局限性。
J Chem Inf Model. 2022 Dec 12;62(23):5938-5951. doi: 10.1021/acs.jcim.2c01073. Epub 2022 Dec 1.
2
Informing geometric deep learning with electronic interactions to accelerate quantum chemistry.利用电子相互作用为几何深度学习提供信息,以加速量子化学。
Proc Natl Acad Sci U S A. 2022 Aug 2;119(31):e2205221119. doi: 10.1073/pnas.2205221119. Epub 2022 Jul 28.
3
Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models.
图神经网络能否为药物发现学习更好的分子表示?基于描述符和基于图的模型的比较研究。
J Cheminform. 2021 Feb 17;13(1):12. doi: 10.1186/s13321-020-00479-8.
4
Findings of the Second Challenge to Predict Aqueous Solubility.第二次预测水溶解度挑战的结果。
J Chem Inf Model. 2020 Oct 26;60(10):4791-4803. doi: 10.1021/acs.jcim.0c00701. Epub 2020 Sep 3.
5
A Comprehensive Survey on Graph Neural Networks.图神经网络综述。
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.
6
Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning.利用几何深度学习破译蛋白质分子表面的相互作用指纹。
Nat Methods. 2020 Feb;17(2):184-192. doi: 10.1038/s41592-019-0666-6. Epub 2019 Dec 9.
7
Analyzing Learned Molecular Representations for Property Prediction.分析用于性质预测的学习分子表示。
J Chem Inf Model. 2019 Aug 26;59(8):3370-3388. doi: 10.1021/acs.jcim.9b00237. Epub 2019 Aug 13.
8
Solubility Challenge Revisited after Ten Years, with Multilab Shake-Flask Data, Using Tight (SD ∼ 0.17 log) and Loose (SD ∼ 0.62 log) Test Sets.十年后重新审视溶解度挑战,采用紧密(SD ∼ 0.17 log)和宽松(SD ∼ 0.62 log)测试集的多实验室摇瓶数据。
J Chem Inf Model. 2019 Jun 24;59(6):3036-3040. doi: 10.1021/acs.jcim.9b00345. Epub 2019 May 9.
9
Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR.分子描述符预选的 QSAR/QSPR 中的互相关限制。
Mol Inform. 2019 Aug;38(8-9):e1800154. doi: 10.1002/minf.201800154. Epub 2019 Apr 4.
10
MoleculeNet: a benchmark for molecular machine learning.分子网络:分子机器学习的一个基准
Chem Sci. 2017 Oct 31;9(2):513-530. doi: 10.1039/c7sc02664a. eCollection 2018 Jan 14.