• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个快速而准确的量子力学特性计算的开源框架。

An open-source framework for fast-yet-accurate calculation of quantum mechanical features.

机构信息

Data Science and Modelling, Pharmaceutical Sciences, R & D, AstraZeneca, Gothenburg, Sweden.

出版信息

Phys Chem Chem Phys. 2022 May 4;24(17):10599-10610. doi: 10.1039/d2cp01165d.

DOI:10.1039/d2cp01165d
PMID:35446335
Abstract

We present the open-source framework that enables the efficient and robust calculation of quantum mechanical features for atoms and molecules. For a benchmark set of 49 experimental molecular polarizabilities, the predictive power of the presented method competes against second-order perturbation theory in a converged atomic-orbital basis set at a fraction of its computational costs. The calculation of isotropic molecular polarizabilities is robust for a data set of more than 80 000 molecules. We present furthermore a generally applicable van der Waals radius model that is rooted on atomic static polarizabilites. Efficiency tests show that such radii can even be calculated for small- to medium-size proteins where the largest system (SARS-CoV-2 spike protein) has 42 539 atoms. Following the work of Domingo-Alemenara [Domingo-Alemenara , 2019, , 5811], we present computational predictions for retention times for different chromatographic methods and describe how physicochemical features improve the predictive power of machine-learning models that otherwise only rely on two-dimensional features like molecular fingerprints. Additionally, we developed an internal benchmark set of experimental super-critical fluid chromatography retention times. For those methods, improvements of up to 10.6% are obtained when combining molecular fingerprints with physicochemical descriptors. Shapley additive explanation values show furthermore that the physical nature of the applied features can be retained within the final machine-learning models. We generally recommend the framework as a robust, low-cost, and physically motivated featurizer for upcoming state-of-the-art machine-learning studies.

摘要

我们提出了一个开源框架,能够高效、稳健地计算原子和分子的量子力学特性。对于一组 49 个实验分子极化率的基准集,所提出方法的预测能力在收敛原子轨道基组中与二级微扰理论相竞争,但其计算成本仅为其一小部分。各向同性分子极化率的计算对于超过 80000 个分子的数据集是稳健的。我们还提出了一种普遍适用的范德华半径模型,该模型基于原子静态极化率。效率测试表明,对于中小尺寸的蛋白质,甚至可以计算出这样的半径,其中最大的系统(SARS-CoV-2 刺突蛋白)有 42539 个原子。遵循 Domingo-Alemenara 的工作 [Domingo-Alemenara ,2019 , ,5811],我们为不同的色谱方法预测了保留时间,并描述了物理化学特征如何提高仅依赖于二维特征(如分子指纹)的机器学习模型的预测能力。此外,我们还开发了一个实验超临界流体色谱保留时间的内部基准集。对于这些方法,当将分子指纹与物理化学描述符结合使用时,可以将保留时间提高多达 10.6%。Shapley 加法解释值还表明,所应用特征的物理性质可以保留在最终的机器学习模型中。我们一般建议将 框架作为一种稳健、低成本且具有物理意义的特征提取器,用于即将到来的最先进的机器学习研究。

相似文献

1
An open-source framework for fast-yet-accurate calculation of quantum mechanical features.一个快速而准确的量子力学特性计算的开源框架。
Phys Chem Chem Phys. 2022 May 4;24(17):10599-10610. doi: 10.1039/d2cp01165d.
2
Static all-atom energetic mappings of the SARS-Cov-2 spike protein and dynamic stability analysis of "Up" versus "Down" protomer states.SARS-CoV-2 刺突蛋白的静态全原子能量映射和“向上”与“向下”构象状态的动态稳定性分析。
PLoS One. 2020 Nov 10;15(11):e0241168. doi: 10.1371/journal.pone.0241168. eCollection 2020.
3
Comprehensive characterization of the antibody responses to SARS-CoV-2 Spike protein finds additional vaccine-induced epitopes beyond those for mild infection.全面描述了针对 SARS-CoV-2 刺突蛋白的抗体反应,发现了除轻度感染诱导的表位之外的其他疫苗诱导的表位。
Elife. 2022 Jan 24;11:e73490. doi: 10.7554/eLife.73490.
4
Theoretical thermodynamics for large molecules: walking the thin line between accuracy and computational cost.大分子的理论热力学:在准确性和计算成本之间走钢丝。
Acc Chem Res. 2008 Apr;41(4):569-79. doi: 10.1021/ar700208h. Epub 2008 Mar 7.
5
Using Automated Machine Learning to Predict the Mortality of Patients With COVID-19: Prediction Model Development Study.利用自动化机器学习预测 COVID-19 患者的死亡率:预测模型开发研究。
J Med Internet Res. 2021 Feb 26;23(2):e23458. doi: 10.2196/23458.
6
Measurement of SARS-CoV-2 Antibody Titers Improves the Prediction Accuracy of COVID-19 Maximum Severity by Machine Learning in Non-Vaccinated Patients.基于机器学习的非接种患者 SARS-CoV-2 抗体滴度测量可提高 COVID-19 严重程度的预测精度。
Front Immunol. 2022 Jan 21;13:811952. doi: 10.3389/fimmu.2022.811952. eCollection 2022.
7
The Development and Validation of Simplified Machine Learning Algorithms to Predict Prognosis of Hospitalized Patients With COVID-19: Multicenter, Retrospective Study.中文译文:简化机器学习算法预测 COVID-19 住院患者预后的开发和验证:多中心回顾性研究。
J Med Internet Res. 2022 Jan 21;24(1):e31549. doi: 10.2196/31549.
8
Deep Learning Total Energies and Orbital Energies of Large Organic Molecules Using Hybridization of Molecular Fingerprints.使用分子指纹杂交深度学习大型有机分子的总能量和轨道能量。
J Chem Inf Model. 2020 Dec 28;60(12):5971-5983. doi: 10.1021/acs.jcim.0c00687. Epub 2020 Oct 29.
9
Machine learning in computational docking.计算对接中的机器学习。
Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.
10
COVID-19 Mortality Prediction From Deep Learning in a Large Multistate Electronic Health Record and Laboratory Information System Data Set: Algorithm Development and Validation.基于大型多状态电子健康记录和实验室信息系统数据集的深度学习预测 COVID-19 死亡率:算法开发与验证。
J Med Internet Res. 2021 Sep 28;23(9):e30157. doi: 10.2196/30157.

引用本文的文献

1
QupKake: Integrating Machine Learning and Quantum Chemistry for Micro-p Predictions.QupKake:将机器学习与量子化学相结合用于微观预测。
J Chem Theory Comput. 2024 Aug 13;20(15):6946-6956. doi: 10.1021/acs.jctc.4c00328. Epub 2024 Jun 4.
2
POxload: Machine Learning Estimates Drug Loadings of Polymeric Micelles.POxload:用于估算聚合物胶束药物载量的机器学习方法。
Mol Pharm. 2024 Jul 1;21(7):3356-3374. doi: 10.1021/acs.molpharmaceut.4c00086. Epub 2024 May 28.
3
Fast calculation of hydrogen-bond strengths and free energy of hydration of small molecules.
快速计算小分子的氢键强度和水合自由能。
Sci Rep. 2023 Mar 13;13(1):4143. doi: 10.1038/s41598-023-30089-x.