• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越化学结构:面向下一代分子数据库的经验教训与指导原则

Beyond chemical structures: lessons and guiding principles for the next generation of molecular databases.

作者信息

Sommer Timo, Clarke Cian, García-Melchor Max

机构信息

School of Chemistry, CRANN and AMBER Research Centres, Trinity College Dublin, College Green Dublin 2 Ireland

Center for Cooperative Research on Alternative Energy (CIC EnergiGUNE), Basque Research and Technology Alliance (BRTA), Alava Technology Park Albert Einstein 48 01510 Vitoria-Gasteiz Spain.

出版信息

Chem Sci. 2024 Nov 28;16(3):1002-1016. doi: 10.1039/d4sc04064c. eCollection 2025 Jan 15.

DOI:10.1039/d4sc04064c
PMID:39660292
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11626465/
Abstract

Databases of molecules and materials are indispensable for advancing chemical research, especially when enriched with electronic structure information from quantum chemistry methods like density functional theory. In this perspective, we review and analyze the current landscape of materials and molecular databases containing quantum chemical data. Our analysis reveals that the materials community has significantly benefited from data platforms such as the Materials Project, which seamlessly integrate chemical structures, electronic structure data, and open-source software. Conversely, quantum chemical data for molecular systems remains largely fragmented across individual datasets, lacking the comprehensive framework of a unified database. We distilled insights from these existing data resources into seven guiding principles termed QUANTUM, which build upon the foundational FAIR principles of data sharing (Findable, Accessible, Interoperable, and Reusable). These principles are aimed at advancing the development of molecular databases into robust, integrated data platforms. We conclude with an outlook on both short- and long-term objectives, guided by these QUANTUM principles, to foster future advancements in molecular quantum databases and enhance their utility for the research community.

摘要

分子和材料数据库对于推动化学研究不可或缺,特别是当这些数据库富含来自量子化学方法(如密度泛函理论)的电子结构信息时。从这个角度来看,我们回顾并分析了包含量子化学数据的材料和分子数据库的当前格局。我们的分析表明,材料领域从诸如材料项目等数据平台中受益匪浅,这些平台无缝集成了化学结构、电子结构数据和开源软件。相反,分子系统的量子化学数据在很大程度上仍分散在各个数据集中,缺乏统一数据库的综合框架。我们从这些现有数据资源中提炼出七条指导原则,称为QUANTUM,它们建立在数据共享的基础FAIR原则(可查找、可访问、可互操作和可重用)之上。这些原则旨在推动分子数据库发展成为强大的集成数据平台。我们以短期和长期目标的展望作为结尾,在这些QUANTUM原则的指导下,促进分子量子数据库的未来发展,并提高它们对研究界的实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/e5519339df6d/d4sc04064c-p3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/2f8e4692a279/d4sc04064c-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/7e141b781aa3/d4sc04064c-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/3353c219b3cb/d4sc04064c-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/6389c4ad066b/d4sc04064c-p1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/5f6dc803589c/d4sc04064c-p2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/e5519339df6d/d4sc04064c-p3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/2f8e4692a279/d4sc04064c-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/7e141b781aa3/d4sc04064c-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/3353c219b3cb/d4sc04064c-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/6389c4ad066b/d4sc04064c-p1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/5f6dc803589c/d4sc04064c-p2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cf0/11734157/e5519339df6d/d4sc04064c-p3.jpg

相似文献

1
Beyond chemical structures: lessons and guiding principles for the next generation of molecular databases.超越化学结构:面向下一代分子数据库的经验教训与指导原则
Chem Sci. 2024 Nov 28;16(3):1002-1016. doi: 10.1039/d4sc04064c. eCollection 2025 Jan 15.
2
A Data Transformation Methodology to Create Findable, Accessible, Interoperable, and Reusable Health Data: Software Design, Development, and Evaluation Study.一种创建可发现、可访问、可互操作和可重用健康数据的数据转换方法:软件设计、开发和评估研究。
J Med Internet Res. 2023 Mar 8;25:e42822. doi: 10.2196/42822.
3
Initiatives, Concepts, and Implementation Practices of FAIR (Findable, Accessible, Interoperable, and Reusable) Data Principles in Health Data Stewardship Practice: Protocol for a Scoping Review.健康数据管理实践中FAIR(可查找、可访问、可互操作和可重用)数据原则的倡议、概念及实施实践:一项范围综述方案
JMIR Res Protoc. 2021 Feb 2;10(2):e22505. doi: 10.2196/22505.
4
Publicly available imaging datasets for age-related macular degeneration: Evaluation according to the Findable, Accessible, Interoperable, Reusable (FAIR) principles.公开可用的年龄相关性黄斑变性成像数据集:根据可查找、可访问、可互操作、可重用(FAIR)原则进行评估。
Exp Eye Res. 2025 Jun;255:110342. doi: 10.1016/j.exer.2025.110342. Epub 2025 Mar 13.
5
Making Metadata Machine-Readable as the First Step to Providing Findable, Accessible, Interoperable, and Reusable Population Health Data: Framework Development and Implementation Study.将元数据转化为机器可读形式作为提供可查找、可访问、可互操作和可重用的人群健康数据的第一步:框架开发与实施研究
Online J Public Health Inform. 2024 Aug 1;16:e56237. doi: 10.2196/56237.
6
Developing a framework for open and FAIR data management practices for next generation risk- and benefit assessment of fish and seafood.为鱼类和海鲜的下一代风险和益处评估制定开放且符合FAIR原则的数据管理实践框架。
EFSA J. 2022 Dec 14;20(Suppl 2):e200917. doi: 10.2903/j.efsa.2022.e200917. eCollection 2022 Dec.
7
CALIFRAME: a proposed method of calibrating reporting guidelines with FAIR principles to foster reproducibility of AI research in medicine.CALIFRAME:一种提议的将报告指南与FAIR原则校准以促进医学人工智能研究可重复性的方法。
JAMIA Open. 2024 Oct 18;7(4):ooae105. doi: 10.1093/jamiaopen/ooae105. eCollection 2024 Dec.
8
Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review.健康数据治理中可发现性、可访问性、互操作性和可重用性数据原则的举措、概念和实施实践:范围综述。
J Med Internet Res. 2023 Aug 28;25:e45013. doi: 10.2196/45013.
9
Exchange of Clinical and Omics Data According to FAIR Principles: A Review of Open Source Solutions.根据 FAIR 原则交换临床和组学数据:开源解决方案综述。
Methods Inf Med. 2020 Jun;59(S 01):e13-e20. doi: 10.1055/s-0040-1712968. Epub 2020 Jul 3.
10
FAIR4Health: Findable, Accessible, Interoperable and Reusable data to foster Health Research.FAIR4Health:可查找、可访问、可互操作且可重复使用的数据,以促进健康研究。
Open Res Eur. 2022 May 31;2:34. doi: 10.12688/openreseurope.14349.2. eCollection 2022.

引用本文的文献

1
Computational tools for the prediction of site- and regioselectivity of organic reactions.用于预测有机反应位点和区域选择性的计算工具。
Chem Sci. 2025 Mar 4;16(13):5383-5412. doi: 10.1039/d5sc00541h. eCollection 2025 Mar 26.

本文引用的文献

1
AABBA Graph Kernel: Atom-Atom, Bond-Bond, and Bond-Atom Autocorrelations for Machine Learning.AABBA图核:用于机器学习的原子-原子、键-键和键-原子自相关
J Chem Inf Model. 2024 Dec 9;64(23):8756-8769. doi: 10.1021/acs.jcim.4c01583. Epub 2024 Nov 24.
2
High-throughput Density Functional Perturbation Theory and Machine Learning Predictions of Infrared, Piezoelectric and Dielectric Responses.高通量密度泛函微扰理论及红外、压电和介电响应的机器学习预测
NPJ Comput Mater. 2020;6(1). doi: 10.1038/s41524-020-0337-2.
3
Directional multiobjective optimization of metal complexes at the billion-system scale.
十亿系统规模下金属配合物的定向多目标优化
Nat Comput Sci. 2024 Apr;4(4):263-273. doi: 10.1038/s43588-024-00616-5. Epub 2024 Mar 29.
4
Effectiveness of molecular fingerprints for exploring the chemical space of natural products.分子指纹图谱在探索天然产物化学空间方面的有效性。
J Cheminform. 2024 Mar 25;16(1):35. doi: 10.1186/s13321-024-00830-3.
5
Learning properties of ordered and disordered materials from multi-fidelity data.从多保真度数据中学习有序和无序材料的特性。
Nat Comput Sci. 2021 Jan;1(1):46-53. doi: 10.1038/s43588-020-00002-x. Epub 2021 Jan 14.
6
A universal graph deep learning interatomic potential for the periodic table.一种用于元素周期表的通用图深度学习原子间势能。
Nat Comput Sci. 2022 Nov;2(11):718-728. doi: 10.1038/s43588-022-00349-3. Epub 2022 Nov 28.
7
High-throughput ab initio design of atomic interfaces using InterMatch.使用InterMatch进行原子界面的高通量从头设计。
Nat Commun. 2023 Dec 1;14(1):7921. doi: 10.1038/s41467-023-43496-5.
8
ReaLigands: A Ligand Library Cultivated from Experiment and Intended for Molecular Computational Catalyst Design.ReaLigands:一个从实验中培育出来的配体库,旨在用于分子计算催化剂设计。
J Chem Inf Model. 2023 Dec 11;63(23):7412-7422. doi: 10.1021/acs.jcim.3c01310. Epub 2023 Nov 21.
9
The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.2023 年的 ChEMBL 数据库:一个涵盖多种生物活性数据类型和时间段的药物发现平台。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1180-D1192. doi: 10.1093/nar/gkad1004.
10
Data-Driven Discovery of Organic Electronic Materials Enabled by Hybrid Top-Down/Bottom-Up Design.通过混合自上而下/自下而上设计实现的数据驱动的有机电子材料发现
Adv Mater. 2024 Jan;36(2):e2305602. doi: 10.1002/adma.202305602. Epub 2023 Nov 23.