• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组范围内的基因-癌症关联研究,用于预测肿瘤学中的新型治疗靶点。

Genome-wide investigation of gene-cancer associations for the prediction of novel therapeutic targets in oncology.

机构信息

Department of Genetics, University of Cambridge, Cambridge, UK.

STORM Therapeutics Ltd, Cambridge, UK.

出版信息

Sci Rep. 2020 Jul 1;10(1):10787. doi: 10.1038/s41598-020-67846-1.

DOI:10.1038/s41598-020-67846-1
PMID:32612205
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7330039/
Abstract

A major cause of failed drug discovery programs is suboptimal target selection, resulting in the development of drug candidates that are potent inhibitors, but ineffective at treating the disease. In the genomics era, the availability of large biomedical datasets with genome-wide readouts has the potential to transform target selection and validation. In this study we investigate how computational intelligence methods can be applied to predict novel therapeutic targets in oncology. We compared different machine learning classifiers applied to the task of drug target classification for nine different human cancer types. For each cancer type, a set of "known" target genes was obtained and equally-sized sets of "non-targets" were sampled multiple times from the human protein-coding genes. Models were trained on mutation, gene expression (TCGA), and gene essentiality (DepMap) data. In addition, we generated a numerical embedding of the interaction network of protein-coding genes using deep network representation learning and included the results in the modeling. We assessed feature importance using a random forests classifier and performed feature selection based on measuring permutation importance against a null distribution. Our best models achieved good generalization performance based on the AUROC metric. With the best model for each cancer type, we ran predictions on more than 15,000 protein-coding genes to identify potential novel targets. Our results indicate that this approach may be useful to inform early stages of the drug discovery pipeline.

摘要

药物研发项目失败的一个主要原因是目标选择不当,导致开发出的候选药物虽然是有效的抑制剂,但对治疗疾病无效。在基因组学时代,具有全基因组读数的大型生物医学数据集的可用性有可能改变目标选择和验证。在这项研究中,我们研究了计算智能方法如何应用于预测肿瘤学中的新型治疗靶标。我们比较了不同的机器学习分类器在九种不同人类癌症类型的药物靶标分类任务中的应用。对于每种癌症类型,我们获得了一组“已知”的靶基因,并从人类蛋白编码基因中多次随机抽取大小相等的“非靶”基因集。模型是基于突变、基因表达(TCGA)和基因必需性(DepMap)数据进行训练的。此外,我们使用深度网络表示学习生成了蛋白编码基因互作网络的数值嵌入,并将结果包含在建模中。我们使用随机森林分类器评估特征重要性,并根据对空分布的置换重要性进行特征选择。我们的最佳模型基于 AUROC 指标实现了良好的泛化性能。对于每种癌症类型的最佳模型,我们对超过 15000 个蛋白编码基因进行了预测,以确定潜在的新靶标。我们的结果表明,这种方法可能有助于为药物发现管道的早期阶段提供信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/1d57d9f399de/41598_2020_67846_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/8dfb354217fe/41598_2020_67846_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/e9053caa4bca/41598_2020_67846_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/793116758bae/41598_2020_67846_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/1d57d9f399de/41598_2020_67846_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/8dfb354217fe/41598_2020_67846_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/e9053caa4bca/41598_2020_67846_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/793116758bae/41598_2020_67846_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eedf/7330039/1d57d9f399de/41598_2020_67846_Fig4_HTML.jpg

相似文献

1
Genome-wide investigation of gene-cancer associations for the prediction of novel therapeutic targets in oncology.全基因组范围内的基因-癌症关联研究,用于预测肿瘤学中的新型治疗靶点。
Sci Rep. 2020 Jul 1;10(1):10787. doi: 10.1038/s41598-020-67846-1.
2
Machine learning prediction of oncology drug targets based on protein and network properties.基于蛋白质和网络特性的肿瘤药物靶点的机器学习预测。
BMC Bioinformatics. 2020 Mar 14;21(1):104. doi: 10.1186/s12859-020-3442-9.
3
Application of network link prediction in drug discovery.网络链接预测在药物发现中的应用。
BMC Bioinformatics. 2021 Apr 12;22(1):187. doi: 10.1186/s12859-021-04082-y.
4
In silico prediction of novel therapeutic targets using gene-disease association data.利用基因-疾病关联数据进行新型治疗靶点的计算机预测。
J Transl Med. 2017 Aug 29;15(1):182. doi: 10.1186/s12967-017-1285-6.
5
A Machine Learning Approach for Drug-target Interaction Prediction using Wrapper Feature Selection and Class Balancing.基于包装特征选择和类别平衡的药物-靶标相互作用预测的机器学习方法。
Mol Inform. 2020 May;39(5):e1900062. doi: 10.1002/minf.201900062. Epub 2020 Feb 11.
6
Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology.利用机器学习进展进行药物发现和分子生物学中的数据整合
Methods Mol Biol. 2021;2190:167-184. doi: 10.1007/978-1-0716-0826-5_7.
7
Multi-view self-attention for interpretable drug-target interaction prediction.多视角自注意力模型可实现药物-靶标相互作用的可解释预测。
J Biomed Inform. 2020 Oct;110:103547. doi: 10.1016/j.jbi.2020.103547. Epub 2020 Aug 27.
8
SCNrank: spectral clustering for network-based ranking to reveal potential drug targets and its application in pancreatic ductal adenocarcinoma.SCNrank:基于网络的排序的谱聚类揭示潜在的药物靶点及其在胰腺导管腺癌中的应用。
BMC Med Genomics. 2020 Apr 3;13(Suppl 5):50. doi: 10.1186/s12920-020-0681-6.
9
Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform.通过开放靶点平台中的机器学习方法赋予发现新的靶-疾病关联的能力。
BMC Bioinformatics. 2022 Jun 16;23(1):232. doi: 10.1186/s12859-022-04753-4.
10
Signature microRNAs and long noncoding RNAs in laryngeal cancer recurrence identified using a competing endogenous RNA network.基于竞争性内源 RNA 网络的喉癌复发相关特征 microRNAs 和长链非编码 RNA。
Mol Med Rep. 2019 Jun;19(6):4806-4818. doi: 10.3892/mmr.2019.10143. Epub 2019 Apr 10.

引用本文的文献

1
PINNED: identifying characteristics of druggable human proteins using an interpretable neural network.PINNED:使用可解释神经网络识别可成药人类蛋白质的特征
J Cheminform. 2023 Jul 19;15(1):64. doi: 10.1186/s13321-023-00735-7.
2
OncoRTT: Predicting novel oncology-related therapeutic targets using BERT embeddings and omics features.OncoRTT:使用BERT嵌入和组学特征预测新型肿瘤相关治疗靶点。
Front Genet. 2023 Apr 6;14:1139626. doi: 10.3389/fgene.2023.1139626. eCollection 2023.
3
Omics Data and Data Representations for Deep Learning-Based Predictive Modeling.

本文引用的文献

1
Rethinking drug design in the artificial intelligence era.人工智能时代的药物设计再思考。
Nat Rev Drug Discov. 2020 May;19(5):353-364. doi: 10.1038/s41573-019-0050-3. Epub 2019 Dec 4.
2
Use of big data in drug development for precision medicine: an update.大数据在精准医学药物研发中的应用:最新进展
Expert Rev Precis Med Drug Dev. 2019;4(3):189-200. doi: 10.1080/23808993.2019.1617632. Epub 2019 May 20.
3
Trends in clinical success rates and therapeutic focus.临床成功率及治疗重点的趋势
基于组学数据和数据表示的深度学习预测模型
Int J Mol Sci. 2022 Oct 14;23(20):12272. doi: 10.3390/ijms232012272.
4
dCas9-VPR-mediated transcriptional activation of functionally equivalent genes for gene therapy.dCas9-VPR 介导的功能等效基因的转录激活用于基因治疗。
Nat Protoc. 2022 Mar;17(3):781-818. doi: 10.1038/s41596-021-00666-3. Epub 2022 Feb 7.
5
The promise of automated machine learning for the genetic analysis of complex traits.自动化机器学习在复杂性状的遗传分析中的应用前景。
Hum Genet. 2022 Sep;141(9):1529-1544. doi: 10.1007/s00439-021-02393-x. Epub 2021 Oct 28.
Nat Rev Drug Discov. 2019 Jul;18(7):495-496. doi: 10.1038/d41573-019-00074-z.
4
Applications of machine learning in drug discovery and development.机器学习在药物发现和开发中的应用。
Nat Rev Drug Discov. 2019 Jun;18(6):463-477. doi: 10.1038/s41573-019-0024-5.
5
ZFP57 suppress proliferation of breast cancer cells through down-regulation of MEST-mediated Wnt/β-catenin signalling pathway.ZFP57 通过下调 MEST 介导的 Wnt/β-连环蛋白信号通路抑制乳腺癌细胞增殖。
Cell Death Dis. 2019 Feb 20;10(3):169. doi: 10.1038/s41419-019-1335-5.
6
Epithelial insulin receptor expression-prognostic relevance in colorectal cancer.上皮胰岛素受体表达与结直肠癌预后的相关性
Oncotarget. 2018 Dec 25;9(101):37497-37508. doi: 10.18632/oncotarget.26490.
7
HBV infection suppresses the expression of inflammatory macrophage miR‑210.HBV 感染抑制炎症巨噬细胞 miR-210 的表达。
Mol Med Rep. 2019 Mar;19(3):1833-1839. doi: 10.3892/mmr.2018.9795. Epub 2018 Dec 24.
8
Expression of Insulinoma-Associated Protein 1 (INSM1) and Orthopedia Homeobox (OTP) in Tumors with Neuroendocrine Differentiation at Rare Sites.罕见部位神经内分泌分化肿瘤中胰岛素瘤相关蛋白 1(INSM1)和同源盒蛋白(OTP)的表达。
Endocr Pathol. 2019 Mar;30(1):35-42. doi: 10.1007/s12022-018-9559-y.
9
The BioGRID interaction database: 2019 update.生物相互作用数据库(BioGRID):2019 年更新版。
Nucleic Acids Res. 2019 Jan 8;47(D1):D529-D541. doi: 10.1093/nar/gky1079.
10
Open Targets Platform: new developments and updates two years on.开放靶点平台:两年的新发展和更新。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1056-D1065. doi: 10.1093/nar/gky1133.