Suppr超能文献

使用Jupyter笔记本重新训练机器学习模型。

Using Jupyter Notebooks for re-training machine learning models.

作者信息

Smajić Aljoša, Grandits Melanie, Ecker Gerhard F

机构信息

Department of Pharmaceutical Sciences, University of Vienna, Vienna, Austria.

出版信息

J Cheminform. 2022 Aug 13;14(1):54. doi: 10.1186/s13321-022-00635-2.

Abstract

Machine learning (ML) models require an extensive, user-driven selection of molecular descriptors in order to learn from chemical structures to predict actives and inactives with a high reliability. In addition, privacy concerns often restrict the access to sufficient data, leading to models with a narrow chemical space. Therefore, we propose a framework of re-trainable models that can be transferred from one local instance to another, and further allow a less extensive descriptor selection. The models are shared via a Jupyter Notebook, allowing the evaluation and implementation of a broader chemical space by keeping most of the tunable parameters pre-defined. This enables the models to be updated in a decentralized, facile, and fast manner. Herein, the method was evaluated with six transporter datasets (BCRP, BSEP, OATP1B1, OATP1B3, MRP3, P-gp), which revealed the general applicability of this approach.

摘要

机器学习(ML)模型需要用户驱动广泛选择分子描述符,以便从化学结构中学习,从而高度可靠地预测活性和非活性物质。此外,隐私问题常常限制对足够数据的访问,导致模型的化学空间狭窄。因此,我们提出了一个可重新训练模型的框架,该框架可以从一个本地实例转移到另一个本地实例,并进一步允许进行不太广泛的描述符选择。这些模型通过Jupyter Notebook共享,通过预先定义大多数可调参数,允许对更广泛的化学空间进行评估和实施。这使得模型能够以分散、简便和快速的方式进行更新。在此,该方法用六个转运体数据集(BCRP、BSEP、OATP1B1、OATP1B3、MRP3、P-gp)进行了评估,结果表明了该方法的普遍适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c435/9375336/cb2f581cf033/13321_2022_635_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验