• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PubChemQC B3LYP/6-31G*//PM6 数据集:使用 B3LYP/6-31G* 计算得到的 8600 万个分子的电子结构。

PubChemQC B3LYP/6-31G*//PM6 Data Set: The Electronic Structures of 86 Million Molecules Using B3LYP/6-31G* Calculations.

机构信息

RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.

Software Technology and Artificial Intelligence Research Laboratory, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba 275-0016, Japan.

出版信息

J Chem Inf Model. 2023 Sep 25;63(18):5734-5754. doi: 10.1021/acs.jcim.3c00899. Epub 2023 Sep 7.

DOI:10.1021/acs.jcim.3c00899
PMID:37677147
Abstract

The presented "PubChemQC B3LYP/6-31G*//PM6" data set is composed of the electronic properties of 85,938,443 molecules, encompassing a broad spectrum of molecules from essential compounds to biomolecules with a molecular weight up to 1000. These molecules account for 94.0% of the original PubChem Compound catalog as of August 29, 2016. The electronic properties, including orbitals, orbital energies, total energies, dipole moments, and other pertinent properties, were computed by using the B3LYP/6-31G* and PM6 methods. The data set, available in three formats, namely, GAMESS quantum chemistry program files, selected JSON output files, and a PostgreSQL database, provides researchers with the ability to query molecular properties. It is further subdivided into five subdata sets for more specific data. The first two subsets encompass molecules with carbon, hydrogen, oxygen, and nitrogen with molecular weights under 300 and 500, respectively. The third and fourth subsets incorporate molecules with carbon, hydrogen, nitrogen, oxygen, phosphorus, sulfur, fluorine, and chlorine, with molecular weights under 300 and 500, respectively. The fifth subset comprises molecules with carbon, hydrogen, nitrogen, oxygen, phosphorus, sulfur, fluorine, chlorine, sodium, potassium, magnesium, and calcium, with a molecular weight of under 500. The coefficients of determination for the highest occupied molecular orbital-lowest unoccupied molecular orbital energy gap range from 0.892 (for CHON500) to 0.803 (for the whole data set). These comprehensive results pave the way for applications in drug discovery and materials science, among others. The data sets can be accessed under the Creative Commons Attribution 4.0 International license at the following web address: https://nakatamaho.riken.jp/pubchemqc.riken.jp/b3lyp_pm6_datasets.html.

摘要

所呈现的“PubChemQC B3LYP/6-31G*//PM6”数据集由 85938443 个分子的电子性质组成,涵盖了从基本化合物到生物分子的广泛分子范围,分子量高达 1000。这些分子占原始 PubChem 化合物目录的 94.0%,截至 2016 年 8 月 29 日。电子性质,包括轨道、轨道能量、总能量、偶极矩和其他相关性质,是使用 B3LYP/6-31G*和 PM6 方法计算的。该数据集以三种格式提供,即 GAMESS 量子化学程序文件、选定的 JSON 输出文件和 PostgreSQL 数据库,为研究人员提供了查询分子性质的能力。它进一步细分为五个子数据集,以提供更具体的数据。前两个子集分别包含分子量小于 300 和 500 的含碳、氢、氧和氮的分子。第三和第四个子集分别包含分子量小于 300 和 500 的含碳、氢、氮、氧、磷、硫、氟和氯的分子。第五个子集包含分子量小于 500 的含碳、氢、氮、氧、磷、硫、氟、氯、钠、钾、镁和钙的分子。最高占据分子轨道-最低未占据分子轨道能量间隙的决定系数范围从 0.892(对于 CHON500)到 0.803(对于整个数据集)。这些综合结果为药物发现和材料科学等领域的应用铺平了道路。这些数据集可以在以下网址下以知识共享署名 4.0 国际许可证获得:https://nakatamaho.riken.jp/pubchemqc.riken.jp/b3lyp_pm6_datasets.html。

相似文献

1
PubChemQC B3LYP/6-31G*//PM6 Data Set: The Electronic Structures of 86 Million Molecules Using B3LYP/6-31G* Calculations.PubChemQC B3LYP/6-31G*//PM6 数据集:使用 B3LYP/6-31G* 计算得到的 8600 万个分子的电子结构。
J Chem Inf Model. 2023 Sep 25;63(18):5734-5754. doi: 10.1021/acs.jcim.3c00899. Epub 2023 Sep 7.
2
PubChemQC PM6: Data Sets of 221 Million Molecules with Optimized Molecular Geometries and Electronic Properties.PubChemQC PM6:具有优化分子几何和电子性质的 2.21 亿个分子数据集。
J Chem Inf Model. 2020 Dec 28;60(12):5891-5899. doi: 10.1021/acs.jcim.0c00740. Epub 2020 Oct 26.
3
PubChemQC Project: A Large-Scale First-Principles Electronic Structure Database for Data-Driven Chemistry.PubChemQC 项目:用于数据驱动化学的大规模第一性原理电子结构数据库。
J Chem Inf Model. 2017 Jun 26;57(6):1300-1308. doi: 10.1021/acs.jcim.7b00083. Epub 2017 May 19.
4
First principal studies of spectroscopic (IR and Raman, UV-visible), molecular structure, linear and nonlinear optical properties of L-arginine p-nitrobenzoate monohydrate (LANB): A new non-centrosymmetric material.对一水合对硝基苯甲酸L-精氨酸盐(LANB)的光谱(红外和拉曼光谱、紫外-可见光谱)、分子结构、线性和非线性光学性质的第一性原理研究:一种新型非中心对称材料。
Spectrochim Acta A Mol Biomol Spectrosc. 2015 Aug 5;147:84-92. doi: 10.1016/j.saa.2015.02.111. Epub 2015 Mar 16.
5
ENHANCEMENT OF THE RESPONSE OF THE HYDROGEN FLAME IONIZATION DETECTOR TO COMPOUNDS CONTAINING HALOGENS AND PHOSPHORUS.提高氢火焰离子化检测器对含卤素和磷化合物的响应
Nature. 1964 Mar 21;201:1204-5. doi: 10.1038/2011204a0.
6
Atomic properties of selected biomolecules: quantum topological atom types of hydrogen, oxygen, nitrogen and sulfur occurring in natural amino acids and their derivatives.选定生物分子的原子性质:天然氨基酸及其衍生物中氢、氧、氮和硫的量子拓扑原子类型。
Chemistry. 2003 Mar 3;9(5):1207-16. doi: 10.1002/chem.200390138.
7
Parameterization of DFTB3/3OB for Sulfur and Phosphorus for Chemical and Biological Applications.用于化学和生物学应用的硫和磷的DFTB3/3OB参数化
J Chem Theory Comput. 2014 Apr 8;10(4):1518-1537. doi: 10.1021/ct401002w. Epub 2014 Mar 12.
8
Electron Donor and Acceptor Influence on the Nonlinear Optical Response of Diacetylene-Functionalized Organic Materials (DFOMs): Density Functional Theory Calculations.电子给体和受体对二乙炔基功能化有机材料(DFOMs)非线性光学响应的影响:密度泛函理论计算。
Molecules. 2019 Jun 2;24(11):2096. doi: 10.3390/molecules24112096.
9
A big data approach to the ultra-fast prediction of DFT-calculated bond energies.一种大数据方法,可实现对 DFT 计算键能的超快速预测。
J Cheminform. 2013 Jul 12;5:34. doi: 10.1186/1758-2946-5-34. eCollection 2013.
10
The performance of selected semi-empirical and DFT methods in studying C₆₀ fullerene derivatives.所选半经验方法和密度泛函理论方法在研究C₆₀富勒烯衍生物中的性能。
Nanotechnology. 2015 Nov 13;26(45):455702. doi: 10.1088/0957-4484/26/45/455702. Epub 2015 Oct 16.

引用本文的文献

1
Quantum chemical properties of chlorinated polycyclic aromatic hydrocarbons for delta machine learning.用于δ机器学习的氯化多环芳烃的量子化学性质
Sci Data. 2025 Jun 21;12(1):1059. doi: 10.1038/s41597-025-05383-0.
2
The QCML dataset, Quantum chemistry reference data from 33.5M DFT and 14.7B semi-empirical calculations.QCML数据集,来自3350万个密度泛函理论(DFT)计算和147亿个半经验计算的量子化学参考数据。
Sci Data. 2025 Mar 8;12(1):406. doi: 10.1038/s41597-025-04720-7.
3
Spectrophotometric and computational characterization of charge transfer complex of selumetinib with 2,3-dichloro-5,6-dicyano-1,4-benzoquinone and its utilization in developing an innovative green and high throughput microwell assay for analysis of bulk form and pharmaceutical formulation.
司美替尼与2,3-二氯-5,6-二氰基-1,4-苯醌电荷转移络合物的分光光度法和计算表征及其在开发用于分析原料药和药物制剂的创新绿色高通量微孔测定法中的应用。
BMC Chem. 2025 Jan 29;19(1):27. doi: 10.1186/s13065-024-01353-6.
4
Beyond chemical structures: lessons and guiding principles for the next generation of molecular databases.超越化学结构:面向下一代分子数据库的经验教训与指导原则
Chem Sci. 2024 Nov 28;16(3):1002-1016. doi: 10.1039/d4sc04064c. eCollection 2025 Jan 15.
5
Synthesis, spectral analysis, and DFT studies of the novel pyrano[3,2-] quinoline-based 1,3,4-thiadiazole for enhanced solar cell performance.用于提高太阳能电池性能的新型吡喃并[3,2 -]喹啉基1,3,4 -噻二唑的合成、光谱分析及密度泛函理论研究
Heliyon. 2024 Oct 17;10(20):e39468. doi: 10.1016/j.heliyon.2024.e39468. eCollection 2024 Oct 30.
6
QM9star, two Million DFT-computed Equilibrium Structures for Ions and Radicals with Atomic Information.QM9星,两百万个通过密度泛函理论计算得到的带有原子信息的离子和自由基平衡结构。
Sci Data. 2024 Oct 21;11(1):1158. doi: 10.1038/s41597-024-03933-6.
7
Quantum Topological Atomic Properties of 44K molecules.44K分子的量子拓扑原子性质
Sci Data. 2024 Aug 29;11(1):945. doi: 10.1038/s41597-024-03723-0.
8
Quantum Chemistry Dataset with Ground- and Excited-state Properties of 450 Kilo Molecules.包含45万个分子基态和激发态性质的量子化学数据集。
Sci Data. 2024 Aug 29;11(1):948. doi: 10.1038/s41597-024-03788-x.
9
Current Challenges in Monitoring Low Contaminant Levels of Per- and Polyfluoroalkyl Substances in Water Matrices in the Field.现场监测水基质中低浓度全氟和多氟烷基物质的当前挑战
Toxics. 2024 Aug 20;12(8):610. doi: 10.3390/toxics12080610.