• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用无结构门捷列夫编码对电池化合物进行分类。

Classification of battery compounds using structure-free Mendeleev encodings.

作者信息

Zhuang Zixin, Barnard Amanda S

机构信息

School of Computing, Australian National University, 145 Science Road, Acton, 2601, ACT, Australia.

出版信息

J Cheminform. 2024 Apr 26;16(1):47. doi: 10.1186/s13321-024-00836-x.

DOI:10.1186/s13321-024-00836-x
PMID:38671512
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11055346/
Abstract

Machine learning is a valuable tool that can accelerate the discovery and design of materials occupying combinatorial chemical spaces. However, the prerequisite need for vast amounts of training data can be prohibitive when significant resources are needed to characterize or simulate candidate structures. Recent results have shown that structure-free encoding of complex materials, based entirely on chemical compositions, can overcome this impediment and perform well in unsupervised learning tasks. In this study, we extend this exploration to supervised classification, and show how structure-free encoding can accurately predict classes of material compounds for battery applications without time consuming measurement of bonding networks, lattices or densities. SCIENTIFIC CONTRIBUTION: The comprehensive evaluation of structure-free encodings of complex materials in classification tasks, including binary and multi-class separation, inclusive of three classifiers based on different logic function, is measured four metrics and learning curves. The encoding is applied to two data sets from computational and experimental sources, and the outcomes visualised using 5 approaches to confirms the suitability and superiority of Mendeleev encoding. These methods are general and accessible using source software, to provide simple, intuitive and interpretable materials informatics outcomes to accelerate materials design.

摘要

机器学习是一种有价值的工具,可加速对占据组合化学空间的材料的发现和设计。然而,当需要大量资源来表征或模拟候选结构时,对大量训练数据的前提需求可能会令人望而却步。最近的结果表明,完全基于化学成分的复杂材料的无结构编码可以克服这一障碍,并在无监督学习任务中表现良好。在本研究中,我们将这一探索扩展到监督分类,并展示无结构编码如何能够准确预测用于电池应用的材料化合物类别,而无需耗时测量键合网络、晶格或密度。科学贡献:在分类任务中对复杂材料的无结构编码进行全面评估,包括二元和多类分离,涵盖基于不同逻辑函数的三个分类器,通过四个指标和学习曲线进行衡量。该编码应用于来自计算和实验来源的两个数据集,并使用5种方法对结果进行可视化,以确认门捷列夫编码的适用性和优越性。这些方法具有通用性,可通过源软件获取,以提供简单、直观且可解释的材料信息学结果,从而加速材料设计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/f65b1644fe42/13321_2024_836_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/b4eb656116f4/13321_2024_836_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/d27b336d1668/13321_2024_836_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/120532ab8b2b/13321_2024_836_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/3d1552a3ad22/13321_2024_836_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/a6f50dcc3cbf/13321_2024_836_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/bfd66b7cddbe/13321_2024_836_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/5840b84600a1/13321_2024_836_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/8c6d7dc5f2ae/13321_2024_836_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/ad52c598ddcd/13321_2024_836_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/f65b1644fe42/13321_2024_836_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/b4eb656116f4/13321_2024_836_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/d27b336d1668/13321_2024_836_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/120532ab8b2b/13321_2024_836_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/3d1552a3ad22/13321_2024_836_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/a6f50dcc3cbf/13321_2024_836_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/bfd66b7cddbe/13321_2024_836_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/5840b84600a1/13321_2024_836_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/8c6d7dc5f2ae/13321_2024_836_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/ad52c598ddcd/13321_2024_836_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/135b/11055346/f65b1644fe42/13321_2024_836_Fig10_HTML.jpg

相似文献

1
Classification of battery compounds using structure-free Mendeleev encodings.使用无结构门捷列夫编码对电池化合物进行分类。
J Cheminform. 2024 Apr 26;16(1):47. doi: 10.1186/s13321-024-00836-x.
2
Interpretable molecular encodings and representations for machine learning tasks.用于机器学习任务的可解释分子编码和表示。
Comput Struct Biotechnol J. 2024 May 24;23:2326-2336. doi: 10.1016/j.csbj.2024.05.035. eCollection 2024 Dec.
3
GeneralizedDTA: combining pre-training and multi-task learning to predict drug-target binding affinity for unknown drug discovery.通用 DTA:结合预训练和多任务学习,预测未知药物发现的药物-靶标结合亲和力。
BMC Bioinformatics. 2022 Sep 7;23(1):367. doi: 10.1186/s12859-022-04905-6.
4
Comprehensive study of semi-supervised learning for DNA methylation-based supervised classification of central nervous system tumors.基于 DNA 甲基化的中枢神经系统肿瘤有监督分类的半监督学习综合研究。
BMC Bioinformatics. 2022 Jun 8;23(1):223. doi: 10.1186/s12859-022-04764-1.
5
Learning chemistry: exploring the suitability of machine learning for the task of structure-based chemical ontology classification.学习化学:探索机器学习在基于结构的化学本体分类任务中的适用性。
J Cheminform. 2021 Mar 16;13(1):23. doi: 10.1186/s13321-021-00500-8.
6
Robust identification of molecular phenotypes using semi-supervised learning.基于半监督学习的稳健分子表型识别。
BMC Bioinformatics. 2019 May 28;20(1):273. doi: 10.1186/s12859-019-2885-3.
7
Mapping the Porous and Chemical Structure-Function Relationships of Trace CHI Capture by Metal-Organic Frameworks using Machine Learning.利用机器学习绘制金属有机框架捕获痕量CHI的多孔结构和化学结构-功能关系
ACS Appl Mater Interfaces. 2022 Oct 19;14(41):47209-47221. doi: 10.1021/acsami.2c10861. Epub 2022 Oct 5.
8
Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction.比较多种机器学习算法和指标进行雌激素受体结合预测。
Mol Pharm. 2018 Oct 1;15(10):4361-4370. doi: 10.1021/acs.molpharmaceut.8b00546. Epub 2018 Aug 28.
9
Objective Supervised Machine Learning-Based Classification and Inference of Biological Neuronal Networks.基于监督机器学习的生物神经网络分类与推断。
Molecules. 2022 Sep 23;27(19):6256. doi: 10.3390/molecules27196256.
10
Planning Implications Related to Sterilization-Sensitive Science Investigations Associated with Mars Sample Return (MSR).与火星样本返回(MSR)相关的对灭菌敏感的科学研究的规划意义。
Astrobiology. 2022 Jun;22(S1):S112-S164. doi: 10.1089/AST.2021.0113. Epub 2022 May 19.

引用本文的文献

1
Solutions for Lithium Battery Materials Data Issues in Machine Learning: Overview and Future Outlook.机器学习中锂电池材料数据问题的解决方案:概述与未来展望
Adv Sci (Weinh). 2024 Dec;11(48):e2410065. doi: 10.1002/advs.202410065. Epub 2024 Nov 18.

本文引用的文献

1
Explainable discovery of disease biomarkers: The case of ovarian cancer to illustrate the best practice in machine learning and Shapley analysis.可解释的疾病生物标志物发现:以卵巢癌为例说明机器学习和 Shapley 分析的最佳实践。
J Biomed Inform. 2023 May;141:104365. doi: 10.1016/j.jbi.2023.104365. Epub 2023 Apr 14.
2
Classification of medicines and materials in hospital inventory management: a multi-criteria analysis.医院库存管理中的药品和材料分类:多标准分析。
BMC Med Inform Decis Mak. 2022 Dec 8;22(1):325. doi: 10.1186/s12911-022-02069-0.
3
Catalysis of Alloys: Classification, Principles, and Design for a Variety of Materials and Reactions.
合金催化:各类材料与反应的分类、原理及设计
Chem Rev. 2023 May 10;123(9):5859-5947. doi: 10.1021/acs.chemrev.2c00356. Epub 2022 Sep 28.
4
Nanoparticle classification, physicochemical properties, characterization, and applications: a comprehensive review for biologists.纳米颗粒分类、物理化学特性、表征及应用:生物学综合评述。
J Nanobiotechnology. 2022 Jun 7;20(1):262. doi: 10.1186/s12951-022-01477-8.
5
Highly accurate machine learning prediction of crystal point groups for ternary materials from chemical formula.基于化学式对三元材料晶体学点群进行高精度机器学习预测。
Sci Rep. 2022 Jan 28;12(1):1577. doi: 10.1038/s41598-022-05642-9.
6
Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems.结合机器学习和计算化学,对化学系统进行预测性洞察。
Chem Rev. 2021 Aug 25;121(16):9816-9872. doi: 10.1021/acs.chemrev.1c00107. Epub 2021 Jul 7.
7
Impact of atomistic or crystallographic descriptors for classification of gold nanoparticles.
Nanoscale. 2021 Jul 15;13(27):11887-11898. doi: 10.1039/d1nr02258j.
8
Big data and machine learning for materials science.用于材料科学的大数据与机器学习
Discov Mater. 2021;1(1):12. doi: 10.1007/s43939-021-00012-0. Epub 2021 Apr 19.
9
Machine Learning: Algorithms, Real-World Applications and Research Directions.机器学习:算法、实际应用与研究方向。
SN Comput Sci. 2021;2(3):160. doi: 10.1007/s42979-021-00592-x. Epub 2021 Mar 22.
10
Unsupervised structure classes vs. supervised property classes of silicon quantum dots using neural networks.使用神经网络对硅量子点进行无监督结构分类与有监督性质分类。
Nanoscale Horiz. 2021 Mar 1;6(3):277-282. doi: 10.1039/d0nh00637h. Epub 2021 Feb 2.