• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过机器学习加速配方设计:生成高通量洗发水配方数据集

Accelerating Formulation Design via Machine Learning: Generating a High-throughput Shampoo Formulations Dataset.

作者信息

Chitre Aniket, Querimit Robert C M, Rihm Simon D, Karan Dogancan, Zhu Benchuan, Wang Ke, Wang Long, Hippalgaonkar Kedar, Lapkin Alexei A

机构信息

Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK.

Cambridge Centre for Advanced Research and Education in Singapore, CARES Ltd. 1 CREATE Way, CREATE Tower #05-05, Singapore, 138602, Singapore.

出版信息

Sci Data. 2024 Jul 3;11(1):728. doi: 10.1038/s41597-024-03573-w.

DOI:10.1038/s41597-024-03573-w
PMID:38961122
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11222379/
Abstract

Liquid formulations are ubiquitous yet have lengthy product development cycles owing to the complex physical interactions between ingredients making it difficult to tune formulations to customer-defined property targets. Interpolative ML models can accelerate liquid formulations design but are typically trained on limited sets of ingredients and without any structural information, which limits their out-of-training predictive capacity. To address this challenge, we selected eighteen formulation ingredients covering a diverse chemical space to prepare an open experimental dataset for training ML models for rinse-off formulations development. The resulting design space has an over 50-fold increase in dimensionality compared to our previous work. Here, we present a dataset of 812 formulations, including 294 stable samples, which cover the entire design space, with phase stability, turbidity, and high-fidelity rheology measurements generated on our semi-automated, ML-driven liquid formulations workflow. Our dataset has the unique attribute of sample-specific uncertainty measurements to train predictive surrogate models.

摘要

液体制剂无处不在,但由于成分之间复杂的物理相互作用,其产品开发周期漫长,难以将制剂调整到客户定义的性能目标。插值机器学习模型可以加速液体制剂设计,但通常是在有限的成分集上进行训练,且没有任何结构信息,这限制了它们在训练之外的预测能力。为应对这一挑战,我们选择了涵盖不同化学空间的18种制剂成分,以制备一个开放的实验数据集,用于训练用于冲洗型制剂开发的机器学习模型。与我们之前的工作相比,由此产生的设计空间维度增加了50多倍。在这里,我们展示了一个包含812种制剂的数据集,其中包括294个稳定样品,这些样品覆盖了整个设计空间,并通过我们的半自动、机器学习驱动的液体制剂工作流程生成了相稳定性、浊度和高保真流变学测量结果。我们的数据集具有样本特定不确定性测量的独特属性,可用于训练预测替代模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/77385a911fd8/41597_2024_3573_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/42fbffa883c2/41597_2024_3573_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/4cf4ca5deeec/41597_2024_3573_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/d1e89bdd002c/41597_2024_3573_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/9866fe4445fa/41597_2024_3573_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/23c23192928e/41597_2024_3573_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/6cabc3d00f4a/41597_2024_3573_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/8ba0df8aecf7/41597_2024_3573_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/77385a911fd8/41597_2024_3573_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/42fbffa883c2/41597_2024_3573_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/4cf4ca5deeec/41597_2024_3573_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/d1e89bdd002c/41597_2024_3573_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/9866fe4445fa/41597_2024_3573_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/23c23192928e/41597_2024_3573_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/6cabc3d00f4a/41597_2024_3573_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/8ba0df8aecf7/41597_2024_3573_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4098/11222379/77385a911fd8/41597_2024_3573_Fig8_HTML.jpg

相似文献

1
Accelerating Formulation Design via Machine Learning: Generating a High-throughput Shampoo Formulations Dataset.通过机器学习加速配方设计:生成高通量洗发水配方数据集
Sci Data. 2024 Jul 3;11(1):728. doi: 10.1038/s41597-024-03573-w.
2
High throughput workflow for coacervate formation and characterization in shampoo systems.
J Cosmet Sci. 2007 Jul-Aug;58(4):375-83.
3
Data-driven development of an oral lipid-based nanoparticle formulation of a hydrophobic drug.基于数据驱动的方法开发疏水性药物的口服脂质纳米粒制剂。
Drug Deliv Transl Res. 2024 Jul;14(7):1872-1887. doi: 10.1007/s13346-023-01491-9. Epub 2023 Dec 29.
4
High-throughput screening and stability optimization of anti-streptavidin IgG1 and IgG2 formulations.抗链霉抗生物素蛋白IgG1和IgG2制剂的高通量筛选及稳定性优化
J Biomol Screen. 2014 Oct;19(9):1290-301. doi: 10.1177/1087057114542431. Epub 2014 Jul 14.
5
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
6
Optimizing interactions between soluble silk fibroin and capryl glucoside for design of a natural and high-performance co-surfactant system.优化丝素蛋白可溶性与辛基葡萄糖苷之间的相互作用,设计一种天然且高性能的协同表面活性剂体系。
Int J Cosmet Sci. 2021 Feb;43(1):68-77. doi: 10.1111/ics.12676. Epub 2020 Dec 30.
7
Hair Washing Formulations from Todaro Gel: The Potential for Making Hair Shampoo.来自托达罗凝胶的洗发配方:制造洗发水的潜力。
Adv Pharmacol Pharm Sci. 2020 Aug 29;2020:8835120. doi: 10.1155/2020/8835120. eCollection 2020.
8
Combining machine learning and high-throughput experimentation to discover photocatalytically active organic molecules.结合机器学习与高通量实验以发现具有光催化活性的有机分子。
Chem Sci. 2021 Jun 21;12(32):10742-10754. doi: 10.1039/d1sc02150h. eCollection 2021 Aug 18.
9
Predicting the Performance of Functional Materials Composed of Polymeric Multicomponent Systems Using Artificial Intelligence-Formulations of Cleansing Foams as an Example.以清洁泡沫配方为例,利用人工智能预测由聚合物多组分体系组成的功能材料的性能。
Polymers (Basel). 2023 Oct 25;15(21):4216. doi: 10.3390/polym15214216.
10
The graphical design space for the primary drying phase of freeze Drying: Factors affecting the dried product layer resistance.冷冻干燥初级干燥阶段的图形设计空间:影响干燥产品层阻力的因素。
Int J Pharm. 2023 Jan 5;630:122417. doi: 10.1016/j.ijpharm.2022.122417. Epub 2022 Nov 19.

本文引用的文献

1
Autonomous, multiproperty-driven molecular discovery: From predictions to measurements and back.自主的、多属性驱动的分子发现:从预测到测量再回归。
Science. 2023 Dec 22;382(6677):eadi1407. doi: 10.1126/science.adi1407.
2
FAIR for AI: An interdisciplinary and international community building perspective.公平对待人工智能:跨学科和国际化的社区建设视角。
Sci Data. 2023 Jul 26;10(1):487. doi: 10.1038/s41597-023-02298-6.
3
Addressing the sustainability challenges for polymers in liquid formulations.应对液体制剂中聚合物的可持续性挑战。
Chem Sci. 2023 May 31;14(25):6820-6825. doi: 10.1039/d3sc90086j. eCollection 2023 Jun 28.
4
Machine learning models to accelerate the design of polymeric long-acting injectables.机器学习模型加速聚合物长效注射剂的设计。
Nat Commun. 2023 Jan 10;14(1):35. doi: 10.1038/s41467-022-35343-w.
5
Automation isn't automatic.自动化并非自然而然就能实现的。
Chem Sci. 2021 Oct 27;12(47):15473-15490. doi: 10.1039/d1sc04588a. eCollection 2021 Dec 8.
6
Automated High-Throughput System Combining Small-Scale Synthesis with Bioassays and Reaction Screening.自动化高通量系统结合小规模合成与生物测定和反应筛选。
SLAS Technol. 2021 Dec;26(6):555-571. doi: 10.1177/24726303211047839. Epub 2021 Oct 26.
7
Molecular Dynamics Simulations and Experimental Results Provide Insight into Clinical Performance Differences between Sandimmune® and Neoral® Lipid-Based Formulations.分子动力学模拟和实验结果深入了解了 Sandimmune®和 Neoral®两种基于脂质的制剂在临床性能上的差异。
Pharm Res. 2021 Sep;38(9):1531-1547. doi: 10.1007/s11095-021-03099-5. Epub 2021 Sep 24.
8
Design of Biopharmaceutical Formulations Accelerated by Machine Learning.基于机器学习的生物制剂配方设计。
Mol Pharm. 2021 Oct 4;18(10):3843-3853. doi: 10.1021/acs.molpharmaceut.1c00469. Epub 2021 Sep 14.
9
Bias free multiobjective active learning for materials design and discovery.无偏多目标主动学习在材料设计与发现中的应用。
Nat Commun. 2021 Apr 19;12(1):2312. doi: 10.1038/s41467-021-22437-0.
10
Automated solubility screening platform using computer vision.使用计算机视觉的自动化溶解度筛选平台。
iScience. 2021 Feb 12;24(3):102176. doi: 10.1016/j.isci.2021.102176. eCollection 2021 Mar 19.