一种基于深度学习的方法用于识别植物源天然化合物的药用用途。

A Deep Learning-Based Approach for Identifying the Medicinal Uses of Plant-Derived Natural Compounds.

作者信息

Yoo Sunyong, Yang Hyung Chae, Lee Seongyeong, Shin Jaewook, Min Seyoung, Lee Eunjoo, Song Minkeun, Lee Doheon

机构信息

School of Electronics and Computer Engineering, Chonnam National University, Gwangju, South Korea.

Department of Otorhinolaryngology-Head and Neck Surgery, Chonnam National University Medical School and Chonnam National University Hospital, Gwangju, South Korea.

出版信息

Front Pharmacol. 2020 Nov 30;11:584875. doi: 10.3389/fphar.2020.584875. eCollection 2020.

DOI:10.3389/fphar.2020.584875

PMID:33519445

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7845697/

Abstract

Medicinal plants and their extracts have been used as important sources for drug discovery. In particular, plant-derived natural compounds, including phytochemicals, antioxidants, vitamins, and minerals, are gaining attention as they promote health and prevent disease. Although several methods have been developed to confirm the biological activities of natural compounds, there is still considerable room to reduce time and cost. To overcome these limitations, several methods have been proposed for conducting large-scale analysis, but they are still limited in terms of dealing with incomplete and heterogeneous natural compound data. Here, we propose a deep learning-based approach to identify the medicinal uses of natural compounds by exploiting massive and heterogeneous drug and natural compound data. The rationale behind this approach is that deep learning can effectively utilize heterogeneous features to alleviate incomplete information. Based on latent knowledge, molecular interactions, and chemical property features, we generated 686 dimensional features for 4,507 natural compounds and 2,882 approved and investigational drugs. The deep learning model was trained using the generated features and verified drug indication information. When the features of natural compounds were applied as input to the trained model, potential efficacies were successfully predicted with high accuracy, sensitivity, and specificity.

摘要

药用植物及其提取物一直是药物发现的重要来源。特别是，植物衍生的天然化合物，包括植物化学物质、抗氧化剂、维生素和矿物质，因其促进健康和预防疾病而受到关注。尽管已经开发了几种方法来确认天然化合物的生物活性，但在减少时间和成本方面仍有很大空间。为了克服这些限制，已经提出了几种进行大规模分析的方法，但在处理不完整和异质的天然化合物数据方面仍然存在局限性。在此，我们提出一种基于深度学习的方法，通过利用大量异质的药物和天然化合物数据来识别天然化合物的药用用途。这种方法背后的基本原理是深度学习可以有效地利用异质特征来缓解不完整信息。基于潜在知识、分子相互作用和化学性质特征，我们为4507种天然化合物以及2882种已批准和正在研究的药物生成了686维特征。使用生成的特征训练深度学习模型，并验证药物适应症信息。当将天然化合物的特征作为输入应用于训练好的模型时，成功地以高精度、高灵敏度和高特异性预测了潜在疗效。

相似文献

A Deep Learning-Based Approach for Identifying the Medicinal Uses of Plant-Derived Natural Compounds.一种基于深度学习的方法用于识别植物源天然化合物的药用用途。

Front Pharmacol. 2020 Nov 30;11:584875. doi: 10.3389/fphar.2020.584875. eCollection 2020.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Deep Learning-Assisted Repurposing of Plant Compounds for Treating Vascular Calcification: An In Silico Study with Experimental Validation.深度学习辅助植物化合物再利用治疗血管钙化：具有实验验证的计算研究。

Oxid Med Cell Longev. 2022 Jan 5;2022:4378413. doi: 10.1155/2022/4378413. eCollection 2022.

Phenotype-oriented network analysis for discovering pharmacological effects of natural compounds.基于表型的网络分析发现天然化合物的药理作用。

Sci Rep. 2018 Aug 3;8(1):11667. doi: 10.1038/s41598-018-30138-w.

In silico drug metabolism and pharmacokinetic profiles of natural products from medicinal plants in the Congo basin.刚果盆地药用植物天然产物的计算机模拟药物代谢和药代动力学概况

In Silico Pharmacol. 2013 Aug 8;1:12. doi: 10.1186/2193-9616-1-12. eCollection 2013.

Natural products as starting points for future anti-malarial therapies: going back to our roots?天然产物作为未来抗疟疗法的起点：回归本源？

Malar J. 2011 Mar 15;10 Suppl 1(Suppl 1):S3. doi: 10.1186/1475-2875-10-S1-S3.

Antileishmanial Activities of Medicinal Herbs and Phytochemicals In Vitro and In Vivo: An Update for the Years 2015 to 2021.抗利什曼原虫草药和植物化学物质的体外和体内抗利什曼原虫活性：2015 至 2021 年的更新。

Molecules. 2022 Nov 4;27(21):7579. doi: 10.3390/molecules27217579.

Novel in silico screening system for plant defense activators using deep learning-based prediction of reactive oxygen species accumulation.基于深度学习预测活性氧积累的植物防御激活剂新型计算机筛选系统

Plant Methods. 2023 Dec 8;19(1):142. doi: 10.1186/s13007-023-01118-7.

Antioxidative and therapeutic potential of selected Australian plants: A review.抗氧化和治疗潜力的选定的澳大利亚植物: 一个审查。

J Ethnopharmacol. 2021 Mar 25;268:113580. doi: 10.1016/j.jep.2020.113580. Epub 2020 Nov 13.

QPoweredCompound2DeNovoDrugPropMax - a novel programmatic tool incorporating deep learning and methods for automated in silico bio-activity discovery for any compound of interest.QPoweredCompound2DeNovoDrugPropMax——一种新颖的编程工具，融合深度学习和方法，可对任何感兴趣的化合物进行自动化的计算机虚拟生物活性发现。

J Biomol Struct Dyn. 2023 Mar;41(5):1790-1797. doi: 10.1080/07391102.2021.2024450. Epub 2022 Jan 10.

引用本文的文献

Grouped semantic-feature relation extraction from texts to represent medicinal-plant property knowledge on social media.从文本中提取分组语义特征关系以表示社交媒体上的药用植物特性知识。

Front Artif Intell. 2025 Aug 8;8:1579357. doi: 10.3389/frai.2025.1579357. eCollection 2025.

Predicting herb-disease associations using network-based measures in human protein interactome.基于人类蛋白质互作网络的度量指标预测草药-疾病关联

BMC Complement Med Ther. 2024 Jun 6;24(Suppl 2):218. doi: 10.1186/s12906-024-04503-4.

Phytochemicals in Pancreatic Cancer Treatment: A Machine Learning Study.胰腺癌治疗中的植物化学物质：一项机器学习研究。

ACS Omega. 2023 Dec 27;9(1):413-421. doi: 10.1021/acsomega.3c05861. eCollection 2024 Jan 9.

Systems pharmacology approaches in herbal medicine research: a brief review.系统药理学方法在草药研究中的应用：简要综述。

BMB Rep. 2022 Sep;55(9):417-428. doi: 10.5483/BMBRep.2022.55.9.102.

Natural product drug discovery in the artificial intelligence era.人工智能时代的天然产物药物发现

Chem Sci. 2021 Dec 13;13(6):1526-1546. doi: 10.1039/d1sc04471k. eCollection 2022 Feb 9.

Graph Neural Networks as a Potential Tool in Improving Virtual Screening Programs.图神经网络作为改进虚拟筛选程序的潜在工具。

Front Chem. 2022 Jan 20;9:787194. doi: 10.3389/fchem.2021.787194. eCollection 2021.

Natural Compounds for Preventing Ear, Nose, and Throat-Related Oral Infections.预防耳鼻喉相关口腔感染的天然化合物

Plants (Basel). 2021 Sep 6;10(9):1847. doi: 10.3390/plants10091847.

EGFR and ERK activation resists flavonoid quercetin-induced anticancer activities in human cervical cancer cells .表皮生长因子受体（EGFR）和细胞外信号调节激酶（ERK）的激活可抵抗黄酮类化合物槲皮素对人宫颈癌细胞的抗癌活性。

Oncol Lett. 2021 Nov;22(5):754. doi: 10.3892/ol.2021.13015. Epub 2021 Aug 31.

本文引用的文献

Machine learning approaches for elucidating the biological effects of natural products.机器学习方法在阐明天然产物的生物学效应中的应用。

Nat Prod Rep. 2021 Mar 4;38(2):346-361. doi: 10.1039/d0np00043d.

Cheminformatics in Natural Product-based Drug Discovery.天然产物药物发现中的 cheminformatics。

Mol Inform. 2020 Dec;39(12):e2000171. doi: 10.1002/minf.202000171. Epub 2020 Sep 6.

5-Caffeoylquinic Acid Ameliorates Cognitive Decline and Reduces Aβ Deposition by Modulating Aβ Clearance Pathways in APP/PS2 Transgenic Mice.5-咖啡酰奎宁酸通过调节 APP/PS2 转基因小鼠的 Aβ 清除途径改善认知下降并减少 Aβ 沉积。

Nutrients. 2020 Feb 14;12(2):494. doi: 10.3390/nu12020494.

Benefits and Risks of Clopidogrel vs. Aspirin Monotherapy after Recent Ischemic Stroke: A Systematic Review and Meta-Analysis.近期缺血性脑卒中后氯吡格雷单药与阿司匹林单药治疗的获益与风险：系统评价和荟萃分析。

Cardiovasc Ther. 2019 Dec 1;2019:1607181. doi: 10.1155/2019/1607181. eCollection 2019.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT：一种用于生物医学文本挖掘的预训练生物医学语言表示模型。

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

Tangeretin Inhibits Oxidative Stress and Inflammation via Upregulating Nrf-2 Signaling Pathway in Collagen-Induced Arthritic Rats.蜜橘素通过上调 Nrf-2 信号通路抑制胶原诱导性关节炎大鼠的氧化应激和炎症。

Pharmacology. 2019;104(3-4):187-195. doi: 10.1159/000501163. Epub 2019 Jul 25.

A Meta-Analysis of Resveratrol Protects against Myocardial Ischemia/Reperfusion Injury: Evidence from Small Animal Studies and Insight into Molecular Mechanisms.白藜芦醇对心肌缺血/再灌注损伤的保护作用的 Meta 分析：来自小动物研究的证据及对分子机制的深入了解。

Oxid Med Cell Longev. 2019 Apr 28;2019:5793867. doi: 10.1155/2019/5793867. eCollection 2019.

Informatics and Computational Methods in Natural Product Drug Discovery: A Review and Perspectives.天然产物药物发现中的信息学与计算方法：综述与展望

Front Genet. 2019 Apr 30;10:368. doi: 10.3389/fgene.2019.00368. eCollection 2019.

Discovering Health Benefits of Phytochemicals with Integrated Analysis of the Molecular Network, Chemical Properties and Ethnopharmacological Evidence.综合分析分子网络、化学性质和民族药理学证据发现植物化学物质的健康益处。

Nutrients. 2018 Aug 8;10(8):1042. doi: 10.3390/nu10081042.

Phenotype-oriented network analysis for discovering pharmacological effects of natural compounds.基于表型的网络分析发现天然化合物的药理作用。

Sci Rep. 2018 Aug 3;8(1):11667. doi: 10.1038/s41598-018-30138-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于深度学习的方法用于识别植物源天然化合物的药用用途。

A Deep Learning-Based Approach for Identifying the Medicinal Uses of Plant-Derived Natural Compounds.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献