• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度神经网络和分子电子密度对物质进行健康危害分类。

Classification of substances by health hazard using deep neural networks and molecular electron densities.

作者信息

Singh Satnam, Zeh Gina, Freiherr Jessica, Bauer Thilo, Türkmen Isik, Grasskamp Andreas T

机构信息

Department of Sensory Analytics and Technologies, Fraunhofer Institute for Process Engineering and Packaging IVV, Giggenhauser Str. 35, 85354, Freising, Germany.

Department of Psychiatry and Psychotherapy, Friedrich-Alexander-Universität Erlangen-Nürnberg, Schwabachanlage 6, 91054, Erlangen, Germany.

出版信息

J Cheminform. 2024 Apr 16;16(1):45. doi: 10.1186/s13321-024-00835-y.

DOI:10.1186/s13321-024-00835-y
PMID:38627862
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11302296/
Abstract

In this paper we present a method that allows leveraging 3D electron density information to train a deep neural network pipeline to segment regions of high, medium and low electronegativity and classify substances as health hazardous or non-hazardous. We show that this can be used for use-cases such as cosmetics and food products. For this purpose, we first generate 3D electron density cubes using semiempirical molecular calculations for a custom European Chemicals Agency (ECHA) subset consisting of substances labelled as hazardous and non-hazardous for cosmetic usage. Together with their 3-class electronegativity maps we train a modified 3D-UNet with electron density cubes to segment reactive sites in molecules and classify substances with an accuracy of 78.1%. We perform the same process on a custom food dataset (CompFood) consisting of hazardous and non-hazardous substances compiled from European Food Safety Authority (EFSA) OpenFoodTox, Food and Drug Administration (FDA) Generally Recognized as Safe (GRAS) and FooDB datasets to achieve a classification accuracy of 64.1%. Our results show that 3D electron densities and particularly masked electron densities, calculated by taking a product of original electron densities and regions of high and low electronegativity can be used to classify molecules for different use-cases and thus serve not only to guide safe-by-design product development but also aid in regulatory decisions. SCIENTIFIC CONTRIBUTION: We aim to contribute to the diverse 3D molecular representations used for training machine learning algorithms by showing that a deep learning network can be trained on 3D electron density representation of molecules. This approach has previously not been used to train machine learning models and it allows utilization of the true spatial domain of the molecule for prediction of properties such as their suitability for usage in cosmetics and food products and in future, to other molecular properties. The data and code used for training is accessible at https://github.com/s-singh-ivv/eDen-Substances .

摘要

在本文中,我们提出了一种方法,该方法能够利用三维电子密度信息来训练深度神经网络管道,以分割高、中、低电负性区域,并将物质分类为对健康有害或无害。我们表明,这可用于化妆品和食品等用例。为此,我们首先使用半经验分子计算为一个定制的欧洲化学品管理局(ECHA)子集生成三维电子密度立方体,该子集由标记为化妆品使用有害和无害的物质组成。连同它们的三类电负性图,我们使用电子密度立方体训练一个改进的三维U-Net,以分割分子中的反应位点,并对物质进行分类,准确率达到78.1%。我们对一个定制的食品数据集(CompFood)执行相同的过程,该数据集由从欧洲食品安全局(EFSA)的OpenFoodTox、美国食品药品监督管理局(FDA)的一般认为安全(GRAS)和FooDB数据集中汇编的有害和无害物质组成,以实现64.1%的分类准确率。我们的结果表明,三维电子密度,特别是通过将原始电子密度与高、低电负性区域相乘计算得到的掩码电子密度,可用于对不同用例的分子进行分类,从而不仅有助于指导按设计安全的产品开发,还有助于监管决策。科学贡献:我们旨在通过表明可以在分子的三维电子密度表示上训练深度学习网络,为用于训练机器学习算法的各种三维分子表示做出贡献。这种方法以前未用于训练机器学习模型,它允许利用分子的真实空间域来预测其性质,例如它们在化妆品和食品中的适用性,以及未来的其他分子性质。用于训练的数据和代码可在https://github.com/s-singh-ivv/eDen-Substances上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/bc379b60e8f0/13321_2024_835_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/da0d8877a18d/13321_2024_835_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/c820c9bf24e1/13321_2024_835_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/32487d7d5017/13321_2024_835_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/c1d17432a247/13321_2024_835_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/85083ba042c7/13321_2024_835_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/bc379b60e8f0/13321_2024_835_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/da0d8877a18d/13321_2024_835_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/c820c9bf24e1/13321_2024_835_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/32487d7d5017/13321_2024_835_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/c1d17432a247/13321_2024_835_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/85083ba042c7/13321_2024_835_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c10a/11302296/bc379b60e8f0/13321_2024_835_Fig6_HTML.jpg

相似文献

1
Classification of substances by health hazard using deep neural networks and molecular electron densities.利用深度神经网络和分子电子密度对物质进行健康危害分类。
J Cheminform. 2024 Apr 16;16(1):45. doi: 10.1186/s13321-024-00835-y.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
COSMOS next generation - A public knowledge base leveraging chemical and biological data to support the regulatory assessment of chemicals.COSMOS下一代——一个利用化学和生物数据支持化学品监管评估的公共知识库。
Comput Toxicol. 2021 Aug;19:100175. doi: 10.1016/j.comtox.2021.100175.
4
Deep Learning for Validating and Estimating Resolution of Cryo-Electron Microscopy Density Maps .深度学习在验证和估计冷冻电子显微镜密度图分辨率中的应用
Molecules. 2019 Mar 26;24(6):1181. doi: 10.3390/molecules24061181.
5
Volumetric macromolecule identification in cryo-electron tomograms using capsule networks.基于胶囊网络的冷冻电子断层图像中大分子的体积识别。
BMC Bioinformatics. 2022 Aug 30;23(1):360. doi: 10.1186/s12859-022-04901-w.
6
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
7
QPoweredCompound2DeNovoDrugPropMax - a novel programmatic tool incorporating deep learning and methods for automated in silico bio-activity discovery for any compound of interest.QPoweredCompound2DeNovoDrugPropMax——一种新颖的编程工具,融合深度学习和方法,可对任何感兴趣的化合物进行自动化的计算机虚拟生物活性发现。
J Biomol Struct Dyn. 2023 Mar;41(5):1790-1797. doi: 10.1080/07391102.2021.2024450. Epub 2022 Jan 10.
8
ChampKit: A framework for rapid evaluation of deep neural networks for patch-based histopathology classification.ChampKit:一种基于补丁的组织病理学分类的深度神经网络快速评估框架。
Comput Methods Programs Biomed. 2023 Sep;239:107631. doi: 10.1016/j.cmpb.2023.107631. Epub 2023 May 30.
9
Blood Stain Classification with Hyperspectral Imaging and Deep Neural Networks.高光谱成像与深度神经网络在血痕分类中的应用。
Sensors (Basel). 2020 Nov 21;20(22):6666. doi: 10.3390/s20226666.
10
Employing neural density functionals to generate potential energy surfaces.利用神经密度泛函生成势能面。
J Mol Model. 2024 Feb 10;30(3):65. doi: 10.1007/s00894-024-05834-2.

引用本文的文献

1
LAGNet: better electron density prediction for LCAO-based data and drug-like substances.LAGNet:基于LCAO的数据和类药物物质的更好电子密度预测。
J Cheminform. 2025 Apr 29;17(1):65. doi: 10.1186/s13321-025-01010-7.

本文引用的文献

1
DCAMCP: A deep learning model based on capsule network and attention mechanism for molecular carcinogenicity prediction.DCAMCP:一种基于胶囊网络和注意力机制的深度学习模型,用于分子致癌性预测。
J Cell Mol Med. 2023 Oct;27(20):3117-3126. doi: 10.1111/jcmm.17889. Epub 2023 Jul 31.
2
OWSum: algorithmic odor prediction and insight into structure-odor relationships.OWSum:算法气味预测与结构-气味关系洞察
J Cheminform. 2023 May 7;15(1):51. doi: 10.1186/s13321-023-00722-y.
3
Predicting Chemical Carcinogens Using a Hybrid Neural Network Deep Learning Method.
使用混合神经网络深度学习方法预测化学致癌物。
Sensors (Basel). 2022 Oct 26;22(21):8185. doi: 10.3390/s22218185.
4
SELFIES and the future of molecular string representations.自拍与分子串表示法的未来。
Patterns (N Y). 2022 Oct 14;3(10):100588. doi: 10.1016/j.patter.2022.100588.
5
A pocket-based 3D molecule generative model fueled by experimental electron density.基于口袋的 3D 分子生成模型,由实验电子密度提供燃料。
Sci Rep. 2022 Sep 6;12(1):15100. doi: 10.1038/s41598-022-19363-6.
6
Inverse design of 3d molecular structures with conditional generative neural networks.用条件生成神经网络进行 3D 分子结构的反向设计。
Nat Commun. 2022 Feb 21;13(1):973. doi: 10.1038/s41467-022-28526-y.
7
Learning Electron Densities in the Condensed Phase.凝聚相中的电子密度学习
J Chem Theory Comput. 2021 Nov 9;17(11):7203-7214. doi: 10.1021/acs.jctc.1c00576. Epub 2021 Oct 20.
8
3D-Scaffold: A Deep Learning Framework to Generate 3D Coordinates of Drug-like Molecules with Desired Scaffolds.3D 支架:一个深度学习框架,用于生成具有所需支架的类药物分子的 3D 坐标。
J Phys Chem B. 2021 Nov 11;125(44):12166-12176. doi: 10.1021/acs.jpcb.1c06437. Epub 2021 Oct 18.
9
From Electronegativity towards Reactivity-Searching for a Measure of Atomic Reactivity.从电负性到反应活性——探寻原子反应活性的一种度量方法
Molecules. 2021 Jun 16;26(12):3680. doi: 10.3390/molecules26123680.
10
Generalised Dice Overlap as a Deep Learning Loss Function for Highly Unbalanced Segmentations.广义骰子重叠作为高度不平衡分割的深度学习损失函数
Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2017). 2017;2017:240-248. doi: 10.1007/978-3-319-67558-9_28. Epub 2017 Sep 9.