DeeP4med：用于 P4 医学的深度学习，以预测多种人类组织中的正常和癌症转录组。

DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues.

机构信息

Department of Medical Biotechnology, School of Advanced Technologies, Shahrekord University of Medical Sciences, Shahrekord, Iran.

Laboratory of Systems Biology and Bioinformatics (LBB), University of Tehran, Tehran, Iran.

出版信息

BMC Bioinformatics. 2023 Jul 4;24(1):275. doi: 10.1186/s12859-023-05400-2.

DOI:10.1186/s12859-023-05400-2

PMID:37403016

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10320882/

Abstract

BACKGROUND

P4 medicine (predict, prevent, personalize, and participate) is a new approach to diagnosing and predicting diseases on a patient-by-patient basis. For the prevention and treatment of diseases, prediction plays a fundamental role. One of the intelligent strategies is the design of deep learning models that can predict the state of the disease using gene expression data.

RESULTS

We create an autoencoder deep learning model called DeeP4med, including a Classifier and a Transferor that predicts cancer's gene expression (mRNA) matrix from its matched normal sample and vice versa. The range of the F1 score of the model, depending on tissue type in the Classifier, is from 0.935 to 0.999 and in Transferor from 0.944 to 0.999. The accuracy of DeeP4med for tissue and disease classification was 0.986 and 0.992, respectively, which performed better compared to seven classic machine learning models (Support Vector Classifier, Logistic Regression, Linear Discriminant Analysis, Naive Bayes, Decision Tree, Random Forest, K Nearest Neighbors).

CONCLUSIONS

Based on the idea of DeeP4med, by having the gene expression matrix of a normal tissue, we can predict its tumor gene expression matrix and, in this way, find effective genes in transforming a normal tissue into a tumor tissue. Results of Differentially Expressed Genes (DEGs) and enrichment analysis on the predicted matrices for 13 types of cancer showed a good correlation with the literature and biological databases. This led that by using the gene expression matrix, to train the model with features of each person in a normal and cancer state, this model could predict diagnosis based on gene expression data from healthy tissue and be used to identify possible therapeutic interventions for those patients.

摘要

背景

P4 医学（预测、预防、个体化和参与）是一种新的方法，可以对患者进行个体化的疾病诊断和预测。对于疾病的预防和治疗，预测起着根本性的作用。其中一种智能策略是设计深度学习模型，可以使用基因表达数据预测疾病的状态。

结果

我们创建了一个称为 DeeP4med 的自动编码器深度学习模型，包括一个分类器和一个转换器，用于从匹配的正常样本预测癌症的基因表达（mRNA）矩阵，反之亦然。该模型的 F1 分数范围，取决于分类器中的组织类型，从 0.935 到 0.999，在转换器中从 0.944 到 0.999。DeeP4med 对组织和疾病分类的准确性分别为 0.986 和 0.992，与七种经典机器学习模型（支持向量分类器、逻辑回归、线性判别分析、朴素贝叶斯、决策树、随机森林、K 最近邻）相比表现更好。

结论

基于 DeeP4med 的思想，通过获得正常组织的基因表达矩阵，我们可以预测其肿瘤基因表达矩阵，从而找到将正常组织转化为肿瘤组织的有效基因。对 13 种癌症的预测矩阵进行差异表达基因（DEGs）和富集分析的结果与文献和生物数据库有很好的相关性。这使得通过使用基因表达矩阵，用正常和癌症状态下每个人的特征来训练模型，该模型可以根据健康组织的基因表达数据预测诊断，并用于识别这些患者可能的治疗干预措施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75b0/10320882/12f63d6fe8bb/12859_2023_5400_Fig1_HTML.jpg

相似文献

DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues.

BMC Bioinformatics. 2023 Jul 4;24(1):275. doi: 10.1186/s12859-023-05400-2.

Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods.

BMC Bioinformatics. 2022 Oct 1;23(1):410. doi: 10.1186/s12859-022-04965-8.

Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease.

Comput Intell Neurosci. 2023 Mar 14;2023:9266889. doi: 10.1155/2023/9266889. eCollection 2023.

Machine learning model to predict the efficacy of antiseizure medications in patients with familial genetic generalized epilepsy.

Epilepsy Res. 2022 Mar;181:106888. doi: 10.1016/j.eplepsyres.2022.106888. Epub 2022 Feb 11.

A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.

Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.

A Linear Regression and Deep Learning Approach for Detecting Reliable Genetic Alterations in Cancer Using DNA Methylation and Gene Expression Data.

Genes (Basel). 2020 Aug 12;11(8):931. doi: 10.3390/genes11080931.

PlncRNA-HDeep: plant long noncoding RNA prediction using hybrid deep learning based on two encoding styles.

BMC Bioinformatics. 2021 May 12;22(Suppl 3):242. doi: 10.1186/s12859-020-03870-2.

Application of supervised machine learning algorithms for classification and prediction of type-2 diabetes disease status in Afar regional state, Northeastern Ethiopia 2021.

Sci Rep. 2023 May 13;13(1):7779. doi: 10.1038/s41598-023-34906-1.

Utilizing machine learning algorithms to predict subject genetic mutation class from in silico models of neuronal networks.

BMC Med Inform Decis Mak. 2022 Nov 9;22(1):290. doi: 10.1186/s12911-022-02038-7.

DEGnext: classification of differentially expressed genes from RNA-seq data using a convolutional neural network with transfer learning.

BMC Bioinformatics. 2022 Jan 6;23(1):17. doi: 10.1186/s12859-021-04527-4.

本文引用的文献

[Bioinformatics Analysis of Core Genes and Key Pathways in Myelodysplastic Syndrome].

Zhongguo Shi Yan Xue Ye Xue Za Zhi. 2022 Jun;30(3):804-812. doi: 10.19746/j.cnki.issn.1009-2137.2022.03.023.

The evolution, evolvability and engineering of gene regulatory DNA.

Nature. 2022 Mar;603(7901):455-463. doi: 10.1038/s41586-022-04506-6. Epub 2022 Mar 9.

Analysis of a large prostate cancer family identifies novel and recurrent gene fusion events providing evidence for inherited predisposition.

Prostate. 2022 Apr;82(5):540-550. doi: 10.1002/pros.24300. Epub 2022 Jan 7.

[Bioinformatics-based identification of the key genes associated with prostate cancer].

Zhonghua Nan Ke Xue. 2021 Jun;27(6):489-498.

Effective gene expression prediction from sequence by integrating long-range interactions.

Nat Methods. 2021 Oct;18(10):1196-1203. doi: 10.1038/s41592-021-01252-x. Epub 2021 Oct 4.

Ras/ERK and PI3K/AKT signaling differentially regulate oncogenic ERG mediated transcription in prostate cells.

PLoS Genet. 2021 Jul 27;17(7):e1009708. doi: 10.1371/journal.pgen.1009708. eCollection 2021 Jul.

FOXP1 and NDRG1 act differentially as downstream effectors of RAD9-mediated prostate cancer cell functions.

Cell Signal. 2021 Oct;86:110091. doi: 10.1016/j.cellsig.2021.110091. Epub 2021 Jul 21.

Deep learning predicts gene expression as an intermediate data modality to identify susceptibility patterns in Mycobacterium tuberculosis infected Diversity Outbred mice.

EBioMedicine. 2021 May;67:103388. doi: 10.1016/j.ebiom.2021.103388. Epub 2021 May 14.

A Novel Method to Predict Drug-Target Interactions Based on Large-Scale Graph Representation Learning.

Cancers (Basel). 2021 Apr 27;13(9):2111. doi: 10.3390/cancers13092111.

Appyters: Turning Jupyter Notebooks into data-driven web apps.

Patterns (N Y). 2021 Mar 4;2(3):100213. doi: 10.1016/j.patter.2021.100213. eCollection 2021 Mar 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

DeeP4med：用于 P4 医学的深度学习，以预测多种人类组织中的正常和癌症转录组。

DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献