Suppr超能文献

利用分子描述符和人工神经网络预测参与免疫治疗、转移和 RNA 结合的乳腺癌蛋白。

Prediction of breast cancer proteins involved in immunotherapy, metastasis, and RNA-binding using molecular descriptors and artificial neural networks.

机构信息

Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Mariscal Sucre Avenue, Quito, 170129, Ecuador.

RNASA-IMEDIR, Computer Science Faculty, University of Coruna, Coruna, 15071, Spain.

出版信息

Sci Rep. 2020 May 22;10(1):8515. doi: 10.1038/s41598-020-65584-y.

Abstract

Breast cancer (BC) is a heterogeneous disease where genomic alterations, protein expression deregulation, signaling pathway alterations, hormone disruption, ethnicity and environmental determinants are involved. Due to the complexity of BC, the prediction of proteins involved in this disease is a trending topic in drug design. This work is proposing accurate prediction classifier for BC proteins using six sets of protein sequence descriptors and 13 machine-learning methods. After using a univariate feature selection for the mix of five descriptor families, the best classifier was obtained using multilayer perceptron method (artificial neural network) and 300 features. The performance of the model is demonstrated by the area under the receiver operating characteristics (AUROC) of 0.980 ± 0.0037, and accuracy of 0.936 ± 0.0056 (3-fold cross-validation). Regarding the prediction of 4,504 cancer-associated proteins using this model, the best ranked cancer immunotherapy proteins related to BC were RPS27, SUPT4H1, CLPSL2, POLR2K, RPL38, AKT3, CDK3, RPS20, RASL11A and UBTD1; the best ranked metastasis driver proteins related to BC were S100A9, DDA1, TXN, PRNP, RPS27, S100A14, S100A7, MAPK1, AGR3 and NDUFA13; and the best ranked RNA-binding proteins related to BC were S100A9, TXN, RPS27L, RPS27, RPS27A, RPL38, MRPL54, PPAN, RPS20 and CSRP1. This powerful model predicts several BC-related proteins that should be deeply studied to find new biomarkers and better therapeutic targets. Scripts can be downloaded at https://github.com/muntisa/neural-networks-for-breast-cancer-proteins.

摘要

乳腺癌 (BC) 是一种异质性疾病,涉及基因组改变、蛋白质表达失调、信号通路改变、激素紊乱、种族和环境决定因素。由于 BC 的复杂性,预测涉及该疾病的蛋白质是药物设计中的一个热门话题。本工作使用六组蛋白质序列描述符和 13 种机器学习方法,为 BC 蛋白质提出了准确的预测分类器。在对五种描述符家族的混合物进行单变量特征选择后,使用多层感知器方法 (人工神经网络) 和 300 个特征获得了最佳分类器。该模型的性能通过接收器工作特征 (AUROC) 的 0.980 ± 0.0037 和 0.936 ± 0.0056(3 倍交叉验证)的面积来证明。关于使用该模型预测 4504 种癌症相关蛋白,与 BC 相关的最佳排名癌症免疫治疗蛋白为 RPS27、SUPT4H1、CLPSL2、POLR2K、RPL38、AKT3、CDK3、RPS20、RASL11A 和 UBTD1;与 BC 相关的最佳排名转移驱动蛋白为 S100A9、DDA1、TXN、PRNP、RPS27、S100A14、S100A7、MAPK1、AGR3 和 NDUFA13;与 BC 相关的最佳排名 RNA 结合蛋白为 S100A9、TXN、RPS27L、RPS27、RPS27A、RPL38、MRPL54、PPAN、RPS20 和 CSRP1。该强大的模型预测了几种与 BC 相关的蛋白质,应深入研究这些蛋白质以寻找新的生物标志物和更好的治疗靶点。脚本可在 https://github.com/muntisa/neural-networks-for-breast-cancer-proteins 下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/637a/7244564/8d01c501ab48/41598_2020_65584_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验