• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DigiMOF:通过文本挖掘生成的金属有机框架合成信息数据库。

DigiMOF: A Database of Metal-Organic Framework Synthesis Information Generated via Text Mining.

作者信息

Glasby Lawson T, Gubsch Kristian, Bence Rosalee, Oktavian Rama, Isoko Kesler, Moosavi Seyed Mohamad, Cordiner Joan L, Cole Jason C, Moghadam Peyman Z

机构信息

Department of Chemical and Biological Engineering, The University of Sheffield, Sheffield S1 3JD, U.K.

Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada.

出版信息

Chem Mater. 2023 May 18;35(11):4510-4524. doi: 10.1021/acs.chemmater.3c00788. eCollection 2023 Jun 13.

DOI:10.1021/acs.chemmater.3c00788
PMID:37332681
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10269341/
Abstract

The vastness of materials space, particularly that which is concerned with metal-organic frameworks (MOFs), creates the critical problem of performing efficient identification of promising materials for specific applications. Although high-throughput computational approaches, including the use of machine learning, have been useful in rapid screening and rational design of MOFs, they tend to neglect descriptors related to their synthesis. One way to improve the efficiency of MOF discovery is to data-mine published MOF papers to extract the materials informatics knowledge contained within journal articles. Here, by adapting the chemistry-aware natural language processing tool, ChemDataExtractor (CDE), we generated an open-source database of MOFs focused on their synthetic properties: the DigiMOF database. Using the CDE web scraping package alongside the Cambridge Structural Database (CSD) MOF subset, we automatically downloaded 43,281 unique MOF journal articles, extracted 15,501 unique MOF materials, and text-mined over 52,680 associated properties including the synthesis method, solvent, organic linker, metal precursor, and topology. Additionally, we developed an alternative data extraction technique to obtain and transform the chemical names assigned to each CSD entry in order to determine linker types for each structure in the CSD MOF subset. This data enabled us to match MOFs to a list of known linkers provided by Tokyo Chemical Industry UK Ltd. (TCI) and analyze the cost of these important chemicals. This centralized, structured database reveals the MOF synthetic data embedded within thousands of MOF publications and contains further topology, metal type, accessible surface area, largest cavity diameter, pore limiting diameter, open metal sites, and density calculations for all 3D MOFs in the CSD MOF subset. The DigiMOF database and associated software are publicly available for other researchers to rapidly search for MOFs with specific properties, conduct further analysis of alternative MOF production pathways, and create additional parsers to search for additional desirable properties.

摘要

材料空间的广阔性,尤其是与金属有机框架(MOF)相关的部分,带来了一个关键问题,即如何高效识别适用于特定应用的有前景的材料。尽管包括机器学习在内的高通量计算方法在MOF的快速筛选和合理设计中很有用,但它们往往忽略了与其合成相关的描述符。提高MOF发现效率的一种方法是对已发表的MOF论文进行数据挖掘,以提取期刊文章中包含的材料信息学知识。在这里,通过改编化学感知自然语言处理工具ChemDataExtractor(CDE),我们生成了一个专注于MOF合成性质的开源数据库:DigiMOF数据库。使用CDE网络爬虫包以及剑桥结构数据库(CSD)的MOF子集,我们自动下载了43281篇独特的MOF期刊文章,提取了15501种独特的MOF材料,并对超过52680个相关性质进行了文本挖掘,包括合成方法、溶剂、有机连接体、金属前驱体和拓扑结构。此外,我们开发了一种替代数据提取技术,以获取并转换分配给每个CSD条目的化学名称,从而确定CSD MOF子集中每个结构的连接体类型。这些数据使我们能够将MOF与英国东京化学工业有限公司(TCI)提供的已知连接体列表进行匹配,并分析这些重要化学品的成本。这个集中的、结构化的数据库揭示了数千篇MOF出版物中嵌入的MOF合成数据,并包含了CSD MOF子集中所有3D MOF的进一步拓扑结构、金属类型、可及表面积、最大空腔直径、孔隙限制直径、开放金属位点和密度计算。DigiMOF数据库及相关软件可供其他研究人员公开使用,以便快速搜索具有特定性质的MOF,对替代MOF生产途径进行进一步分析,并创建额外的解析器以搜索其他所需性质。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/388e68483884/cm3c00788_0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/75c672e93026/cm3c00788_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/b919eb3db315/cm3c00788_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/d6bc960d11da/cm3c00788_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/3b2bd1fe5fb6/cm3c00788_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/0c6a0b656dac/cm3c00788_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/551999d4eb9c/cm3c00788_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/3010a24d02b3/cm3c00788_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/701263856876/cm3c00788_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/794ada1ca5e7/cm3c00788_0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/388e68483884/cm3c00788_0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/75c672e93026/cm3c00788_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/b919eb3db315/cm3c00788_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/d6bc960d11da/cm3c00788_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/3b2bd1fe5fb6/cm3c00788_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/0c6a0b656dac/cm3c00788_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/551999d4eb9c/cm3c00788_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/3010a24d02b3/cm3c00788_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/701263856876/cm3c00788_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/794ada1ca5e7/cm3c00788_0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1f4/10269341/388e68483884/cm3c00788_0011.jpg

相似文献

1
DigiMOF: A Database of Metal-Organic Framework Synthesis Information Generated via Text Mining.DigiMOF:通过文本挖掘生成的金属有机框架合成信息数据库。
Chem Mater. 2023 May 18;35(11):4510-4524. doi: 10.1021/acs.chemmater.3c00788. eCollection 2023 Jun 13.
2
[Application of gas chromatography separation based on metal-organic framework material as stationary phase].基于金属有机骨架材料作为固定相的气相色谱分离应用
Se Pu. 2021 Jan;39(1):57-68. doi: 10.3724/SP.J.1123.2020.06028.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Targeted classification of metal-organic frameworks in the Cambridge structural database (CSD).剑桥结构数据库(CSD)中金属有机框架的靶向分类
Chem Sci. 2020 Jun 17;11(32):8373-8387. doi: 10.1039/d0sc01297a. eCollection 2020 Aug 21.
5
Physiochemical characterization of metal organic framework materials: A mini review.金属有机骨架材料的物理化学表征:一篇综述短文
Heliyon. 2023 Dec 16;10(1):e23840. doi: 10.1016/j.heliyon.2023.e23840. eCollection 2024 Jan 15.
6
Stepwise Synthesis of Metal-Organic Frameworks.逐步合成金属有机骨架。
Acc Chem Res. 2017 Apr 18;50(4):857-865. doi: 10.1021/acs.accounts.6b00457. Epub 2017 Mar 28.
7
Topological Characterization of Metal-Organic Frameworks: A Perspective.金属有机框架材料的拓扑表征:综述
Chem Mater. 2024 Jul 22;36(19):9013-9030. doi: 10.1021/acs.chemmater.4c00762. eCollection 2024 Oct 8.
8
MOF Synthesis Prediction Enabled by Automatic Data Mining and Machine Learning.基于自动数据挖掘和机器学习的 MOF 合成预测。
Angew Chem Int Ed Engl. 2022 May 2;61(19):e202200242. doi: 10.1002/anie.202200242. Epub 2022 Mar 10.
9
Postsynthetic Tuning of Metal-Organic Frameworks for Targeted Applications.后合成调谐金属-有机骨架用于靶向应用。
Acc Chem Res. 2017 Apr 18;50(4):805-813. doi: 10.1021/acs.accounts.6b00577. Epub 2017 Feb 8.
10
Charge Transport in Zirconium-Based Metal-Organic Frameworks.锆基金属有机框架中的电荷输运。
Acc Chem Res. 2020 Jun 16;53(6):1187-1195. doi: 10.1021/acs.accounts.0c00106. Epub 2020 May 13.

引用本文的文献

1
Metal-Organic Frameworks for Precision Catalysis.用于精准催化的金属有机框架
Precis Chem. 2025 Jun 12;3(8):478-479. doi: 10.1021/prechem.5c00052. eCollection 2025 Aug 25.
2
Artificial Intelligence Paradigms for Next-Generation Metal-Organic Framework Research.面向下一代金属有机框架研究的人工智能范式
J Am Chem Soc. 2025 Jul 9;147(27):23367-23380. doi: 10.1021/jacs.5c08214. Epub 2025 Jun 24.
3
Biomedical Metal-Organic Framework Materials: Perspectives and Challenges.生物医学金属有机框架材料:前景与挑战

本文引用的文献

1
Computational Characterization of Zr-Oxide MOFs for Adsorption Applications.用于吸附应用的 Zr-氧化物 MOFs 的计算特性研究。
ACS Appl Mater Interfaces. 2022 Dec 28;14(51):56938-56947. doi: 10.1021/acsami.2c13391. Epub 2022 Dec 14.
2
Data-driven design of molecular nanomagnets.基于数据驱动的分子纳米磁体设计。
Nat Commun. 2022 Dec 9;13(1):7626. doi: 10.1038/s41467-022-35336-9.
3
BatteryDataExtractor: battery-aware text-mining software embedded with BERT models.电池数据提取器:嵌入BERT模型的电池感知文本挖掘软件。
Adv Funct Mater. 2023 Nov 21;34(43). doi: 10.1002/adfm.202308589. eCollection 2024 Oct.
4
From Data to Discovery: Recent Trends of Machine Learning in Metal-Organic Frameworks.从数据到发现:金属有机框架中机器学习的最新趋势
JACS Au. 2024 Sep 12;4(10):3727-3743. doi: 10.1021/jacsau.4c00618. eCollection 2024 Oct 28.
5
Topological Characterization of Metal-Organic Frameworks: A Perspective.金属有机框架材料的拓扑表征:综述
Chem Mater. 2024 Jul 22;36(19):9013-9030. doi: 10.1021/acs.chemmater.4c00762. eCollection 2024 Oct 8.
6
Crystal Structure Landscape of Diarylethene-Based Crystalline Solids: A Comprehensive CSD Analysis.基于二芳基乙烯的晶体固体的晶体结构全景:一项全面的剑桥晶体结构数据库分析
Cryst Growth Des. 2024 Jul 23;24(15):6284-6291. doi: 10.1021/acs.cgd.4c00556. eCollection 2024 Aug 7.
7
Deep learning-based recommendation system for metal-organic frameworks (MOFs).基于深度学习的金属有机框架(MOF)推荐系统
Digit Discov. 2024 Jun 10;3(7):1410-1420. doi: 10.1039/d4dd00116h. eCollection 2024 Jul 10.
8
ChatMOF: an artificial intelligence system for predicting and generating metal-organic frameworks using large language models.ChatMOF:一种使用大语言模型预测和生成金属有机框架的人工智能系统。
Nat Commun. 2024 Jun 3;15(1):4705. doi: 10.1038/s41467-024-48998-4.
9
Expanding the Horizons of Machine Learning in Nanomaterials to Chiral Nanostructures.将机器学习在纳米材料领域的应用拓展至手性纳米结构
Adv Mater. 2024 May;36(18):e2308912. doi: 10.1002/adma.202308912. Epub 2024 Feb 3.
10
Accelerated Discovery of Metal-Organic Frameworks for CO Capture by Artificial Intelligence.通过人工智能加速发现用于捕获二氧化碳的金属有机框架材料。
Ind Eng Chem Res. 2023 Dec 25;63(1):37-48. doi: 10.1021/acs.iecr.3c03817. eCollection 2024 Jan 10.
Chem Sci. 2022 Sep 23;13(39):11487-11495. doi: 10.1039/d2sc04322j. eCollection 2022 Oct 12.
4
Mining Insights on Metal-Organic Framework Synthesis from Scientific Literature Texts.从科学文献文本中挖掘金属有机框架合成的见解
J Chem Inf Model. 2022 Mar 14;62(5):1190-1198. doi: 10.1021/acs.jcim.1c01297. Epub 2022 Feb 23.
5
MOF Synthesis Prediction Enabled by Automatic Data Mining and Machine Learning.基于自动数据挖掘和机器学习的 MOF 合成预测。
Angew Chem Int Ed Engl. 2022 May 2;61(19):e202200242. doi: 10.1002/anie.202200242. Epub 2022 Mar 10.
6
The development of a comprehensive toolbox based on multi-level, high-throughput screening of MOFs for CO/N separations.基于多级高通量筛选金属有机框架用于CO/N分离的综合工具箱的开发。
Chem Sci. 2021 Aug 11;12(36):12068-12081. doi: 10.1039/d1sc01588e. eCollection 2021 Sep 22.
7
Programmable Logic in Metal-Organic Frameworks for Catalysis.用于催化的金属有机框架中的可编程逻辑
Adv Mater. 2021 Nov;33(46):e2007442. doi: 10.1002/adma.202007442. Epub 2021 May 28.
8
Metal-Organic Frameworks for Drug Delivery: A Design Perspective.金属有机框架用于药物传递:设计视角。
ACS Appl Mater Interfaces. 2021 Feb 17;13(6):7004-7020. doi: 10.1021/acsami.1c01089. Epub 2021 Feb 7.
9
Targeted classification of metal-organic frameworks in the Cambridge structural database (CSD).剑桥结构数据库(CSD)中金属有机框架的靶向分类
Chem Sci. 2020 Jun 17;11(32):8373-8387. doi: 10.1039/d0sc01297a. eCollection 2020 Aug 21.
10
Wiz: A Web-Based Tool for Interactive Visualization of Big Data.Wiz:一个用于大数据交互式可视化的基于网络的工具。
Patterns (N Y). 2020 Sep 23;1(8):100107. doi: 10.1016/j.patter.2020.100107. eCollection 2020 Nov 13.