• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开源机器学习在计算化学中的应用。

Open-Source Machine Learning in Computational Chemistry.

机构信息

Institute of Technology, Resource and Energy-Efficient Engineering (TREE), University of Applied Sciences Bonn-Rhein-Sieg, 53757 Sankt Augustin, Germany.

Department of Electrical Engineering, Mechanical Engineering and Technical Journalism, University of Applied Sciences Bonn-Rhein-Sieg, 53757 Sankt Augustin, Germany.

出版信息

J Chem Inf Model. 2023 Aug 14;63(15):4505-4532. doi: 10.1021/acs.jcim.3c00643. Epub 2023 Jul 19.

DOI:10.1021/acs.jcim.3c00643
PMID:37466636
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10430767/
Abstract

The field of computational chemistry has seen a significant increase in the integration of machine learning concepts and algorithms. In this Perspective, we surveyed 179 open-source software projects, with corresponding peer-reviewed papers published within the last 5 years, to better understand the topics within the field being investigated by machine learning approaches. For each project, we provide a short description, the link to the code, the accompanying license type, and whether the training data and resulting models are made publicly available. Based on those deposited in GitHub repositories, the most popular employed Python libraries are identified. We hope that this survey will serve as a resource to learn about machine learning or specific architectures thereof by identifying accessible codes with accompanying papers on a topic basis. To this end, we also include computational chemistry open-source software for generating training data and fundamental Python libraries for machine learning. Based on our observations and considering the three pillars of collaborative machine learning work, open data, open source (code), and open models, we provide some suggestions to the community.

摘要

计算化学领域已经看到机器学习概念和算法的集成显著增加。在本观点中,我们调查了 179 个开源软件项目,这些项目都有相应的同行评审论文在过去 5 年内发表,以更好地了解机器学习方法所研究的领域内的主题。对于每个项目,我们提供简短描述、代码链接、伴随的许可证类型,以及是否公开提供训练数据和生成的模型。基于那些存储在 GitHub 存储库中的项目,确定了最流行的使用 Python 库。我们希望通过基于主题识别具有伴随论文的可访问代码,将这个调查作为了解机器学习或其特定架构的资源。为此,我们还提供了用于生成训练数据的计算化学开源软件和用于机器学习的基本 Python 库。基于我们的观察,并考虑协作机器学习工作的三个支柱,即开放数据、开源(代码)和开放模型,我们向社区提供了一些建议。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/861a/10430767/d5a914a44000/ci3c00643_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/861a/10430767/5791c910e84e/ci3c00643_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/861a/10430767/286dcaab3955/ci3c00643_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/861a/10430767/d5a914a44000/ci3c00643_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/861a/10430767/5791c910e84e/ci3c00643_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/861a/10430767/286dcaab3955/ci3c00643_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/861a/10430767/d5a914a44000/ci3c00643_0003.jpg

相似文献

1
Open-Source Machine Learning in Computational Chemistry.开源机器学习在计算化学中的应用。
J Chem Inf Model. 2023 Aug 14;63(15):4505-4532. doi: 10.1021/acs.jcim.3c00643. Epub 2023 Jul 19.
2
ASAS-NANP symposium: mathematical modeling in animal nutrition-Making sense of big data and machine learning: how open-source code can advance training of animal scientists.ASAS-NANP 研讨会:动物营养中的数学建模——从大数据和机器学习中得出意义:开源代码如何促进动物科学家的培训。
J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad317.
3
Ichor: A Python library for computational chemistry data management and machine learning force field development.Ichor:一个用于计算化学数据管理和机器学习力场开发的Python库。
J Comput Chem. 2024 Dec 15;45(32):2912-2928. doi: 10.1002/jcc.27477. Epub 2024 Aug 31.
4
PHOTONAI-A Python API for rapid machine learning model development.PHOTONAI-用于快速机器学习模型开发的 Python API。
PLoS One. 2021 Jul 21;16(7):e0254062. doi: 10.1371/journal.pone.0254062. eCollection 2021.
5
IDEPI: rapid prediction of HIV-1 antibody epitopes and other phenotypic features from sequence data using a flexible machine learning platform.IDEPI:使用灵活的机器学习平台从序列数据快速预测HIV-1抗体表位及其他表型特征。
PLoS Comput Biol. 2014 Sep 25;10(9):e1003842. doi: 10.1371/journal.pcbi.1003842. eCollection 2014 Sep.
6
ShinyLearner: A containerized benchmarking tool for machine-learning classification of tabular data.ShinyLearner:一个用于表格数据机器学习分类的容器化基准测试工具。
Gigascience. 2020 Apr 1;9(4). doi: 10.1093/gigascience/giaa026.
7
MMV_Im2Im: an open-source microscopy machine vision toolbox for image-to-image transformation.MMV_Im2Im:一个用于图像到图像转换的开源显微镜机器视觉工具箱。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giad120.
8
Semisupervised Machine Learning for Sensitive Open Modification Spectral Library Searching.基于半监督学习的敏感开放修饰谱库检索方法。
J Proteome Res. 2023 Feb 3;22(2):585-593. doi: 10.1021/acs.jproteome.2c00616. Epub 2023 Jan 23.
9
Deep learning algorithms applied to computational chemistry.应用于计算化学的深度学习算法。
Mol Divers. 2024 Aug;28(4):2375-2410. doi: 10.1007/s11030-023-10771-y. Epub 2023 Dec 27.
10
RatInABox, a toolkit for modelling locomotion and neuronal activity in continuous environments.盒子里的老鼠,一个用于模拟连续环境中运动和神经元活动的工具包。
Elife. 2024 Feb 9;13:e85274. doi: 10.7554/eLife.85274.

引用本文的文献

1
Enhancing diagnostic precision for thyroid C-TIRADS category 4 nodules: a hybrid deep learning and machine learning model integrating grayscale and elastographic ultrasound features.提高甲状腺C-TIRADS 4类结节的诊断准确性:一种整合灰阶和弹性成像超声特征的深度学习与机器学习混合模型
Quant Imaging Med Surg. 2025 Sep 1;15(9):7951-7963. doi: 10.21037/qims-2025-594. Epub 2025 Aug 11.
2
How to crack a SMILES: automatic crosschecked chemical structure resolution across multiple services using MoleculeResolver.如何破解SMILES:使用分子解析器跨多个服务自动交叉核对化学结构解析
J Cheminform. 2025 Aug 4;17(1):117. doi: 10.1186/s13321-025-01064-7.
3

本文引用的文献

1
Supervised learning of a chemistry functional with damped dispersion.带阻尼弥散的化学函数的有监督学习。
Nat Comput Sci. 2023 Jan;3(1):48-58. doi: 10.1038/s43588-022-00371-5. Epub 2022 Dec 23.
2
Exploring the Advantages of Quantum Generative Adversarial Networks in Generative Chemistry.探索量子生成对抗网络在生成化学中的优势。
J Chem Inf Model. 2023 Jun 12;63(11):3307-3318. doi: 10.1021/acs.jcim.3c00562. Epub 2023 May 12.
3
Basis for Accurate Protein p Prediction with Machine Learning.基于机器学习的蛋白质 p 值准确预测。
Data Checking of Asymmetric Catalysis Literature Using a Graph Neural Network Approach.
使用图神经网络方法对不对称催化文献进行数据检查
Molecules. 2025 Jan 16;30(2):355. doi: 10.3390/molecules30020355.
4
The Biology and Biochemistry of Kynurenic Acid, a Potential Nutraceutical with Multiple Biological Effects.犬尿酸的生物学和生物化学:一种具有多种生物学效应的潜在营养保健品。
Int J Mol Sci. 2024 Aug 21;25(16):9082. doi: 10.3390/ijms25169082.
5
RBPs: an RNA editor's choice.RNA结合蛋白:RNA编辑者的选择
Front Mol Biosci. 2024 Aug 6;11:1454241. doi: 10.3389/fmolb.2024.1454241. eCollection 2024.
6
Democratizing cheminformatics: interpretable chemical grouping using an automated KNIME workflow.化学信息学的民主化:使用自动化的KNIME工作流程进行可解释的化学分组
J Cheminform. 2024 Aug 16;16(1):101. doi: 10.1186/s13321-024-00894-1.
7
Machine Learning Prediction of Quantum Yields and Wavelengths of Aggregation-Induced Emission Molecules.聚集诱导发光分子量子产率和波长的机器学习预测
Materials (Basel). 2024 Apr 4;17(7):1664. doi: 10.3390/ma17071664.
J Chem Inf Model. 2023 May 22;63(10):2936-2947. doi: 10.1021/acs.jcim.3c00254. Epub 2023 May 5.
4
Exploring chemical compound space with quantum-based machine learning.利用基于量子的机器学习探索化合物空间。
Nat Rev Chem. 2020 Jul;4(7):347-358. doi: 10.1038/s41570-020-0189-9. Epub 2020 Jun 12.
5
Reorganization Energy Predictions with Graph Neural Networks Informed by Low-Cost Conformers.基于低成本构象体信息的图神经网络的重新组织能量预测。
J Phys Chem A. 2023 Apr 20;127(15):3484-3489. doi: 10.1021/acs.jpca.2c09030. Epub 2023 Apr 5.
6
HAC-Net: A Hybrid Attention-Based Convolutional Neural Network for Highly Accurate Protein-Ligand Binding Affinity Prediction.HAC-Net:一种基于混合注意力的卷积神经网络,用于高精度蛋白质-配体结合亲和力预测。
J Chem Inf Model. 2023 Apr 10;63(7):1947-1960. doi: 10.1021/acs.jcim.3c00251. Epub 2023 Mar 29.
7
MolScribe: Robust Molecular Structure Recognition with Image-to-Graph Generation.MolScribe:通过图像到图形生成实现强大的分子结构识别。
J Chem Inf Model. 2023 Apr 10;63(7):1925-1934. doi: 10.1021/acs.jcim.2c01480. Epub 2023 Mar 27.
8
AiZynthTrain: Robust, Reproducible, and Extensible Pipelines for Training Synthesis Prediction Models.AiZynthTrain:用于训练合成预测模型的强大、可重现且可扩展的管道。
J Chem Inf Model. 2023 Apr 10;63(7):1841-1846. doi: 10.1021/acs.jcim.2c01486. Epub 2023 Mar 23.
9
G2GT: Retrosynthesis Prediction with Graph-to-Graph Attention Neural Network and Self-Training.G2GT:基于图到图注意力神经网络和自训练的逆合成预测
J Chem Inf Model. 2023 Apr 10;63(7):1894-1905. doi: 10.1021/acs.jcim.2c01302. Epub 2023 Mar 22.
10
Determining Lennard-Jones Parameters Using Multiscale Target Data through Presampling-Enhanced, Surrogate-Assisted Global Optimization.通过预采样增强、代理辅助全局优化使用多尺度目标数据确定 Lennard-Jones 参数。
J Chem Inf Model. 2023 Apr 10;63(7):1872-1881. doi: 10.1021/acs.jcim.2c01231. Epub 2023 Mar 21.