• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过主动学习获取用于金属有机框架中高效性质预测的信息性训练数据

Informative Training Data for Efficient Property Prediction in Metal-Organic Frameworks by Active Learning.

作者信息

Jose Ashna, Devijver Emilie, Jakse Noel, Poloni Roberta

机构信息

SIMaP, Grenoble-INP, CNRS, University of Grenoble Alpes, Grenoble 38042, France.

LiG, Grenoble-INP, CNRS, University of Grenoble Alpes, Grenoble 38042, France.

出版信息

J Am Chem Soc. 2024 Mar 6;146(9):6134-6144. doi: 10.1021/jacs.3c13687. Epub 2024 Feb 25.

DOI:10.1021/jacs.3c13687
PMID:38404041
Abstract

In recent data-driven approaches to material discovery, scenarios where target quantities are expensive to compute and measure are often overlooked. In such cases, it becomes imperative to construct a training set that includes the most diverse, representative, and informative samples. Here, a novel regression tree-based active learning algorithm is employed for such a purpose. It is applied to predict the band gap and adsorption properties of metal-organic frameworks (MOFs), a novel class of materials that results from the virtually infinite combinations of their building units. Simpler and low dimensional descriptors, such as those based on stoichiometric and geometric properties, are used to compute the feature space for this model owing to their ability to better represent MOFs in the low data regime. The partitions given by a regression tree constructed on the labeled part of the data set are used to select new samples to be added to the training set, thereby limiting its size while maximizing the prediction quality. Tests on the QMOF, hMOF, and dMOF data sets reveal that our method constructs small training data sets to learn regression models that predict the target properties more efficiently than existing active learning approaches, and with lower variance. Specifically, our active learning approach is highly beneficial when labels are unevenly distributed in the descriptor space and when the label distribution is imbalanced, which is often the case for real world data. The regions defined by the tree help in revealing patterns in the data, thereby offering a unique tool to efficiently analyze complex structure-property relationships in materials and accelerate materials discovery.

摘要

在近期基于数据驱动的材料发现方法中,目标量计算和测量成本高昂的情况常常被忽视。在这种情况下,构建一个包含最多样化、最具代表性和信息量最大的样本的训练集就变得势在必行。在此,一种基于回归树的新型主动学习算法被用于此目的。它被应用于预测金属有机框架(MOF)的带隙和吸附特性,MOF是一类新型材料,由其构建单元的几乎无限组合产生。由于能够在低数据量情况下更好地表示MOF,更简单和低维的描述符,如基于化学计量和几何性质的描述符,被用于计算该模型的特征空间。在数据集的标记部分构建的回归树给出的划分用于选择要添加到训练集中的新样本,从而在限制其大小的同时最大化预测质量。对QMOF、hMOF和dMOF数据集的测试表明,我们的方法构建了小的训练数据集来学习回归模型,该模型比现有的主动学习方法更有效地预测目标特性,且方差更低。具体而言,当标签在描述符空间中分布不均匀以及标签分布不平衡时,我们的主动学习方法非常有益,而这在现实世界数据中经常出现。树所定义的区域有助于揭示数据中的模式,从而提供了一个独特的工具来有效分析材料中复杂的结构-性质关系并加速材料发现。

相似文献

1
Informative Training Data for Efficient Property Prediction in Metal-Organic Frameworks by Active Learning.通过主动学习获取用于金属有机框架中高效性质预测的信息性训练数据
J Am Chem Soc. 2024 Mar 6;146(9):6134-6144. doi: 10.1021/jacs.3c13687. Epub 2024 Feb 25.
2
Machine Learning Using Combined Structural and Chemical Descriptors for Prediction of Methane Adsorption Performance of Metal Organic Frameworks (MOFs).基于结构和化学描述符的机器学习在预测金属有机骨架(MOFs)甲烷吸附性能中的应用。
ACS Comb Sci. 2017 Oct 9;19(10):640-645. doi: 10.1021/acscombsci.7b00056. Epub 2017 Sep 5.
3
Accelerating Discovery of Metal-Organic Frameworks for Methane Adsorption with Hierarchical Screening and Deep Learning.通过分级筛选和深度学习加速用于甲烷吸附的金属有机框架的发现
ACS Appl Mater Interfaces. 2020 Nov 25;12(47):52797-52807. doi: 10.1021/acsami.0c16516. Epub 2020 Nov 11.
4
Leveraging Machine Learning for Metal-Organic Frameworks: A Perspective.利用机器学习研究金属有机框架:一种观点。
Langmuir. 2023 Nov 14;39(45):15849-15863. doi: 10.1021/acs.langmuir.3c01964. Epub 2023 Nov 3.
5
Prediction of O/N Selectivity in Metal-Organic Frameworks via High-Throughput Computational Screening and Machine Learning.通过高通量计算筛选和机器学习预测金属有机框架中的O/N选择性
ACS Appl Mater Interfaces. 2022 Jan 12;14(1):736-749. doi: 10.1021/acsami.1c18521. Epub 2021 Dec 20.
6
The Importance of Highly Connected Building Units in Reticular Chemistry: Thoughtful Design of Metal-Organic Frameworks.高度连接的建筑单元在网状化学中的重要性:金属-有机框架的深思熟虑的设计。
Acc Chem Res. 2021 Sep 7;54(17):3298-3312. doi: 10.1021/acs.accounts.1c00214. Epub 2021 Jul 6.
7
Accelerating In Silico Discovery of Metal-Organic Frameworks for Ethane/Ethylene and Propane/Propylene Separation: A Synergistic Approach Integrating Molecular Simulation, Machine Learning, and Active Learning.加速用于乙烷/乙烯和丙烷/丙烯分离的金属有机框架的计算机辅助发现:一种整合分子模拟、机器学习和主动学习的协同方法。
ACS Appl Mater Interfaces. 2024 Feb 14;16(6):6971-6987. doi: 10.1021/acsami.3c14505. Epub 2024 Jan 30.
8
ML-Tree: a tree-structure-based approach to multilabel learning.ML-Tree:一种基于树结构的多标签学习方法。
IEEE Trans Neural Netw Learn Syst. 2015 Mar;26(3):430-43. doi: 10.1109/TNNLS.2014.2315296. Epub 2014 Apr 29.
9
Machine Learning Prediction on Properties of Nanoporous Materials Utilizing Pore Geometry Barcodes.利用孔隙几何条码对纳米多孔材料性能的机器学习预测。
J Chem Inf Model. 2019 Nov 25;59(11):4636-4644. doi: 10.1021/acs.jcim.9b00623. Epub 2019 Nov 12.
10
Enhancing Structure-Property Relationships in Porous Materials through Transfer Learning and Cross-Material Few-Shot Learning.通过迁移学习和跨材料少样本学习增强多孔材料中的结构-性能关系
ACS Appl Mater Interfaces. 2023 Dec 6;15(48):56375-56385. doi: 10.1021/acsami.3c10323. Epub 2023 Nov 20.

引用本文的文献

1
MOSAEC-DB: a comprehensive database of experimental metal-organic frameworks with verified chemical accuracy suitable for molecular simulations.MOSAEC-DB:一个具有经过验证的化学准确性、适用于分子模拟的实验性金属有机框架的综合数据库。
Chem Sci. 2025 Jan 31;16(9):4085-4100. doi: 10.1039/d4sc07438f. eCollection 2025 Feb 26.
2
From Data to Discovery: Recent Trends of Machine Learning in Metal-Organic Frameworks.从数据到发现:金属有机框架中机器学习的最新趋势
JACS Au. 2024 Sep 12;4(10):3727-3743. doi: 10.1021/jacsau.4c00618. eCollection 2024 Oct 28.