School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China.
Department of Automation, Shanghai Jiao Tong University, Shanghai, China.
Dis Markers. 2017;2017:5745724. doi: 10.1155/2017/5745724. Epub 2017 Aug 29.
Lymph node (LN) metastasis was an independent risk factor for stomach cancer recurrence, and the presence of LN metastasis has great influence on the overall survival of stomach cancer patients. Thus, accurate prediction of the presence of lymph node metastasis can provide guarantee of credible prognosis evaluation of stomach cancer patients. Recently, increasing evidence demonstrated that the aberrant DNA methylation first appears before symptoms of the disease become clinically apparent.
Selecting key biomarkers for LN metastasis presence prediction for stomach cancer using clinical DNA methylation based on a machine learning method.
To reduce the overfitting risk of prediction task, we applied a three-step feature selection method according to the property of DNA methylation data.
The feature selection procedure extracted several cancer-related and lymph node metastasis-related genes, such as TP73, PDX1, FUT8, HOXD1, NMT1, and SEMA3E. The prediction performance was evaluated on the public DNA methylation dataset. The results showed that the three-step feature procedure can largely improve the prediction performance and implied the reliability of the biomarkers selected.
With the selected biomarkers, the prediction method can achieve higher accuracy in detecting LN metastasis and the results also proved the reliability of the selected biomarkers indirectly.
淋巴结(LN)转移是胃癌复发的独立危险因素,LN 转移的存在对胃癌患者的总生存有很大影响。因此,准确预测淋巴结转移的存在可以为胃癌患者提供可靠的预后评估保障。最近,越来越多的证据表明,异常的 DNA 甲基化首先出现在疾病出现临床症状之前。
基于机器学习方法,从临床 DNA 甲基化数据中选择用于预测胃癌 LN 转移存在的关键生物标志物。
为了降低预测任务的过拟合风险,我们根据 DNA 甲基化数据的特性应用了三步特征选择方法。
特征选择过程提取了几个与癌症和淋巴结转移相关的基因,如 TP73、PDX1、FUT8、HOXD1、NMT1 和 SEMA3E。在公共 DNA 甲基化数据集上评估了预测性能。结果表明,三步特征选择过程可以显著提高预测性能,并暗示了所选择生物标志物的可靠性。
利用这些选定的生物标志物,预测方法可以实现更高的 LN 转移检测准确性,结果也间接证明了所选生物标志物的可靠性。