• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大小无关紧要:基于数十种分子预测物理或生化性质。

Size Doesn't Matter: Predicting Physico- or Biochemical Properties Based on Dozens of Molecules.

机构信息

Department of Chemistry, Lomonosov Moscow State University, Leninskie Gory 1, Building 3, Moscow 119991, Russia.

Science Data Software, LLC, 14909 Forest Landing Circle, Rockville, Maryland 20850, United States.

出版信息

J Phys Chem Lett. 2021 Sep 30;12(38):9213-9219. doi: 10.1021/acs.jpclett.1c02477. Epub 2021 Sep 16.

DOI:10.1021/acs.jpclett.1c02477
PMID:34529429
Abstract

The use of machine learning in chemistry has become a common practice. At the same time, despite the success of modern machine learning methods, the lack of data limits their use. Using a transfer learning methodology can help solve this problem. This methodology assumes that a model built on a sufficient amount of data captures general features of the chemical compound structure on which it was trained and that the further reuse of these features on a data set with a lack of data will greatly improve the quality of the new model. In this paper, we develop this approach for small organic molecules, implementing transfer learning with graph convolutional neural networks. The paper shows a significant improvement in the performance of the models for target properties with a lack of data. The effects of the data set composition on the model's quality and the applicability domain of the resulting models are also considered.

摘要

机器学习在化学中的应用已经成为一种常见做法。同时,尽管现代机器学习方法取得了成功,但数据的缺乏限制了它们的使用。使用迁移学习方法可以帮助解决这个问题。该方法假设,在足够数量的数据上构建的模型捕获了在其上进行训练的化学化合物结构的一般特征,并且在缺乏数据的数据集上进一步重用这些特征将极大地提高新模型的质量。在本文中,我们针对小分子开发了这种方法,实现了基于图卷积神经网络的迁移学习。该论文表明,在缺乏数据的情况下,目标属性的模型性能有了显著提高。还考虑了数据集组成对模型质量和所得模型适用域的影响。

相似文献

1
Size Doesn't Matter: Predicting Physico- or Biochemical Properties Based on Dozens of Molecules.大小无关紧要:基于数十种分子预测物理或生化性质。
J Phys Chem Lett. 2021 Sep 30;12(38):9213-9219. doi: 10.1021/acs.jpclett.1c02477. Epub 2021 Sep 16.
2
Modeling Physico-Chemical ADMET Endpoints with Multitask Graph Convolutional Networks.运用多任务图卷积网络模拟理化 ADMET 终点。
Molecules. 2019 Dec 21;25(1):44. doi: 10.3390/molecules25010044.
3
Graph Convolutional Neural Networks as "General-Purpose" Property Predictors: The Universality and Limits of Applicability.图卷积神经网络作为“通用”属性预测器:适用性的普遍性和局限性。
J Chem Inf Model. 2020 Jan 27;60(1):22-28. doi: 10.1021/acs.jcim.9b00587. Epub 2020 Jan 3.
4
Dual graph convolutional neural network for predicting chemical networks.双图卷积神经网络用于预测化学网络。
BMC Bioinformatics. 2020 Apr 23;21(Suppl 3):94. doi: 10.1186/s12859-020-3378-0.
5
Predicting Energetics Materials' Crystalline Density from Chemical Structure by Machine Learning.通过机器学习从化学结构预测能质材料的结晶密度。
J Chem Inf Model. 2021 May 24;61(5):2147-2158. doi: 10.1021/acs.jcim.0c01318. Epub 2021 Apr 26.
6
Explainable Machine Learning Framework for Image Classification Problems: Case Study on Glioma Cancer Prediction.用于图像分类问题的可解释机器学习框架:脑胶质瘤癌症预测案例研究
J Imaging. 2020 May 28;6(6):37. doi: 10.3390/jimaging6060037.
7
A deep dive into understanding tumor foci classification using multiparametric MRI based on convolutional neural network.基于卷积神经网络,深入探究利用多参数磁共振成像进行肿瘤病灶分类。
Med Phys. 2020 Sep;47(9):4077-4086. doi: 10.1002/mp.14255. Epub 2020 Jun 12.
8
Hybrid Transfer Learning for Classification of Uterine Cervix Images for Cervical Cancer Screening.基于混合迁移学习的宫颈癌筛查子宫颈图像分类
J Digit Imaging. 2020 Jun;33(3):619-631. doi: 10.1007/s10278-019-00269-1.
9
A transfer learning model with multi-source domains for biomedical event trigger extraction.一种用于生物医学事件触发词提取的多源域迁移学习模型。
BMC Genomics. 2021 Jan 7;22(1):31. doi: 10.1186/s12864-020-07315-1.
10
Predicting Materials Properties with Little Data Using Shotgun Transfer Learning.利用散弹枪迁移学习以少量数据预测材料属性
ACS Cent Sci. 2019 Oct 23;5(10):1717-1730. doi: 10.1021/acscentsci.9b00804. Epub 2019 Sep 30.

引用本文的文献

1
Formulation Graphs for Mapping Structure-Composition of Battery Electrolytes to Device Performance.用于将电池电解质的结构-组成映射到器件性能的配方图。
J Chem Inf Model. 2023 Nov 27;63(22):6998-7010. doi: 10.1021/acs.jcim.3c01030. Epub 2023 Nov 10.
2
A transfer learning approach for reaction discovery in small data situations using generative model.一种使用生成模型在小数据情况下进行反应发现的迁移学习方法。
iScience. 2022 Jun 22;25(7):104661. doi: 10.1016/j.isci.2022.104661. eCollection 2022 Jul 15.