Department of Hematology, The First Affiliated Hospital of Wannan Medical College, Wuhu, Anhui 241001, China.
Primary and Secondary Health Care Department, Lahore, Pakistan.
Dis Markers. 2022 Oct 7;2022:7826393. doi: 10.1155/2022/7826393. eCollection 2022.
The epithelial mesenchymal transition (EMT) gene has been shown to be significantly associated with the prognosis of solid tumors; however, there is a lack of models for the EMT gene to predict the prognosis of AML patients.
First, we downloaded clinical data and raw transcriptome sequencing data from the TCGA database of acute myeloid leukemia (AML) patients. All currently confirmed EMT-related genes were obtained from the dbEMT 2.0 database, and 30% of the TCGA data were randomly selected as the test set. Univariate Cox regression analysis, random forest, and lasso regression were used to optimize the number of genes for model construction, and multivariate Cox regression was used for model construction. Area under the ROC curve was used to assess the efficacy of the model application, and the internal validation set was used to assess the stability of the model.
A total of 173 AML samples were downloaded, and a total of 1184 EMT-related genes were downloaded. The results of univariate batch Cox regression analysis suggested that 212 genes were associated with patient prognosis, random forest and lasso regression yielded 18 and 8 prognosis-related EMT genes, respectively, and the results of multifactorial COX regression model suggested that 5 genes, CBR1, HS3ST3B1, LIMA1, MIR573, and PTP4A3, were considered as independent risk factors affecting patient prognosis. The model ROC results suggested that the area under the curve was 0.868 and the internal validation results showed that the area under the curve was 0.815.
During this study, we constructed a signature model of five EMT-related genes to predict overall survival in patients with AML; it will provide a useful tool for clinical decision making.
上皮间质转化(EMT)基因已被证明与实体瘤的预后显著相关;然而,目前缺乏 EMT 基因模型来预测 AML 患者的预后。
首先,我们从 TCGA 数据库中下载了急性髓系白血病(AML)患者的临床数据和原始转录组测序数据。从 dbEMT 2.0 数据库中获得所有目前确认的 EMT 相关基因,将 TCGA 数据的 30%随机选择为测试集。采用单因素 Cox 回归分析、随机森林和套索回归优化模型构建的基因数量,采用多因素 Cox 回归进行模型构建。使用 ROC 曲线下面积评估模型应用的效果,使用内部验证集评估模型的稳定性。
共下载了 173 个 AML 样本,共下载了 1184 个 EMT 相关基因。单因素批 Cox 回归分析结果提示 212 个基因与患者预后相关,随机森林和套索回归分别得出 18 个和 8 个与预后相关的 EMT 基因,多因素 COX 回归模型结果提示 5 个基因 CBR1、HS3ST3B1、LIMA1、MIR573 和 PTP4A3 被认为是影响患者预后的独立危险因素。模型 ROC 结果提示曲线下面积为 0.868,内部验证结果表明曲线下面积为 0.815。
在本研究中,我们构建了一个由五个 EMT 相关基因组成的特征模型,用于预测 AML 患者的总生存率;它将为临床决策提供有用的工具。