• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于δ机器学习的氯化多环芳烃的量子化学性质

Quantum chemical properties of chlorinated polycyclic aromatic hydrocarbons for delta machine learning.

作者信息

Frolov Dmitry, Ibraev Ilya, Sedov Igor

机构信息

Sirius University of Science and Technology, 1 Olympic Ave, 354340, Sirius, Russia.

Chemical Institute, Kazan Federal University, Kremlevskaya 18, 420008, Kazan, Russia.

出版信息

Sci Data. 2025 Jun 21;12(1):1059. doi: 10.1038/s41597-025-05383-0.

DOI:10.1038/s41597-025-05383-0
PMID:40544156
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12182570/
Abstract

Promising Δ-machine learning approaches aim to correct the values of molecular properties calculated using computationally inexpensive approaches to the accuracy of precise methods. Training such models requires data obtained at different levels of quantum chemical theory. While several large and chemically diverse datasets have been published, studies in many areas require specialized datasets for structurally related molecules. Chlorinated polycyclic aromatic hydrocarbons (Cl-PAHs), the products of incomplete combustion of organic substances and materials, are hazardous pollutants with carcinogenic and mutagenic activity. Quantum chemistry methods are important to understand their formation mechanisms and properties. We describe a dataset, PACHQA, containing geometries, physical properties, wavefunctions, and electron densities for 3551 molecules including 3417 Cl-PAHs with up to 6 rings and different number of chlorine atoms, and 134 parent hydrocarbons. Most of them were not included in any published quantum chemical datasets. The calculations were performed at three different levels of theory. The dataset may be useful to develop and validate computational, machine learning, or experimental approaches, and study the structure-property relationships for Cl-PAHs.

摘要

有前景的Δ机器学习方法旨在将使用计算成本较低的方法计算出的分子性质值校正到精确方法的精度。训练此类模型需要不同量子化学理论水平下获得的数据。虽然已经发表了几个大型且化学性质多样的数据集,但许多领域的研究需要针对结构相关分子的专门数据集。氯化多环芳烃(Cl-PAHs)是有机物质和材料不完全燃烧的产物,是具有致癌和致突变活性的有害污染物。量子化学方法对于理解它们的形成机制和性质很重要。我们描述了一个数据集PACHQA,其中包含3551个分子的几何结构、物理性质、波函数和电子密度,包括3417个最多含6个环且氯原子数量不同的Cl-PAHs以及134个母体烃。其中大多数未包含在任何已发表的量子化学数据集中。计算是在三种不同的理论水平上进行的。该数据集可能有助于开发和验证计算、机器学习或实验方法,并研究Cl-PAHs的结构-性质关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/2cfdc8fc3161/41597_2025_5383_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/b23c808944a2/41597_2025_5383_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/99eb456dd6d4/41597_2025_5383_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/2dd757f3494f/41597_2025_5383_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/408c8988d3b6/41597_2025_5383_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/997750d3cb57/41597_2025_5383_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/3cae4a497182/41597_2025_5383_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/1bf26afe4d98/41597_2025_5383_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/5d3f175aeab2/41597_2025_5383_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/2cfdc8fc3161/41597_2025_5383_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/b23c808944a2/41597_2025_5383_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/99eb456dd6d4/41597_2025_5383_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/2dd757f3494f/41597_2025_5383_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/408c8988d3b6/41597_2025_5383_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/997750d3cb57/41597_2025_5383_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/3cae4a497182/41597_2025_5383_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/1bf26afe4d98/41597_2025_5383_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/5d3f175aeab2/41597_2025_5383_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4813/12182570/2cfdc8fc3161/41597_2025_5383_Fig9_HTML.jpg

相似文献

1
Quantum chemical properties of chlorinated polycyclic aromatic hydrocarbons for delta machine learning.用于δ机器学习的氯化多环芳烃的量子化学性质
Sci Data. 2025 Jun 21;12(1):1059. doi: 10.1038/s41597-025-05383-0.
2
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Photoemission spectroscopy of organic molecules using plane wave/pseudopotential density functional theory and machine learning: A comprehensive and predictive computational protocol for isolated molecules, molecular aggregates, and organic thin films.使用平面波/赝势密度泛函理论和机器学习的有机分子光发射光谱:一种针对孤立分子、分子聚集体和有机薄膜的全面且具有预测性的计算方案。
J Chem Phys. 2025 Jun 28;162(24). doi: 10.1063/5.0272583.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
7
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
9
Dietary interventions for recurrent abdominal pain in childhood.儿童复发性腹痛的饮食干预措施
Cochrane Database Syst Rev. 2017 Mar 23;3(3):CD010972. doi: 10.1002/14651858.CD010972.pub2.
10
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.

本文引用的文献

1
Insights into the Formation of Chlorinated Polycyclic Aromatic Hydrocarbons Related to Chlorine in Salt-Tolerant Rice: Profiles in Market Samples, Effects of Saline Cultivation, and Household Cooking.耐盐水稻中与氯有关的氯化多环芳烃形成的研究进展:市场样品中的特征、盐田种植的影响以及家庭烹饪。
J Agric Food Chem. 2024 Nov 6;72(44):24833-24846. doi: 10.1021/acs.jafc.4c06295. Epub 2024 Oct 23.
2
Predictive crystallography at scale: mapping, validating, and learning from 1000 crystal energy landscapes.大规模预测晶体学:绘制、验证并从1000个晶体能量景观中学习。
Faraday Discuss. 2025 Jan 14;256(0):434-458. doi: 10.1039/d4fd00105b.
3
Dataset for quantum-mechanical exploration of conformers and solvent effects in large drug-like molecules.
用于探索大型类药物分子构象和溶剂效应的量子力学数据集。
Sci Data. 2024 Jul 7;11(1):742. doi: 10.1038/s41597-024-03521-8.
4
Multitask methods for predicting molecular properties from heterogeneous data.从异构数据预测分子性质的多任务方法。
J Chem Phys. 2024 Jul 7;161(1). doi: 10.1063/5.0201681.
5
COMPAS-3: a dataset of -condensed polybenzenoid hydrocarbons.COMPAS - 3:稠合多苯并芳烃数据集。
Phys Chem Chem Phys. 2024 May 29;26(21):15344-15357. doi: 10.1039/d4cp01027b.
6
COMPAS-2: a dataset of cata-condensed hetero-polycyclic aromatic systems.COMPAS-2:一个包含缩合杂多环芳烃体系的数据集。
Sci Data. 2024 Jan 19;11(1):97. doi: 10.1038/s41597-024-02927-8.
7
Δ machine learning for reaction property prediction.用于反应性质预测的机器学习
Chem Sci. 2023 Jul 19;14(46):13392-13401. doi: 10.1039/d3sc02408c. eCollection 2023 Nov 29.
8
Multifidelity Machine Learning for Molecular Excitation Energies.用于分子激发能的多保真机器学习
J Chem Theory Comput. 2023 Nov 14;19(21):7658-7670. doi: 10.1021/acs.jctc.3c00882. Epub 2023 Oct 20.
9
PubChemQC B3LYP/6-31G*//PM6 Data Set: The Electronic Structures of 86 Million Molecules Using B3LYP/6-31G* Calculations.PubChemQC B3LYP/6-31G*//PM6 数据集:使用 B3LYP/6-31G* 计算得到的 8600 万个分子的电子结构。
J Chem Inf Model. 2023 Sep 25;63(18):5734-5754. doi: 10.1021/acs.jcim.3c00899. Epub 2023 Sep 7.
10
Kernel based quantum machine learning at record rate: Many-body distribution functionals as compact representations.基于核的量子机器学习创纪录速度:多体分布泛函作为紧凑表示。
J Chem Phys. 2023 Jul 21;159(3). doi: 10.1063/5.0152215.