School of Electric Engineering and Automation, Hefei University of Technology, Hefei, Anhui, 230009, China.
Mol Inform. 2020 Mar;39(3):e1900028. doi: 10.1002/minf.201900028. Epub 2019 Sep 6.
Accurate outcome prediction is crucial for precision medicine and personalized treatment of cancer. Researchers have found that multi-dimensional cancer omics studies outperform each data type (mRNA, microRNA, methylation or somatic copy number alteration) study in human disease research. Existing methods leveraging multiple level of molecular data often suffer from various limitations, e. g., heterogeneity, poor robustness or loss of generality. To overcome these limitations, we presented the tree-based dimensionality reduction approach for the identification of smooth tree graph and developed accurate predictive model for clinical outcome prediction. We demonstrated that 1) Discriminative Dimensionality Reduction via learning a Tree (DDRTree) achieved reduced dimension space tree with statistical significance; 2) Tree based support vector machine (SVM) classifier improved prediction performance of cancer recurrence as compared to t-test based SVM classifier; 3) Tree based SVM classifier was robust with regard to the different number of multi-markers; 4) Combining multiple omics data improved prediction performance of cancer recurrence as compared to a single-omics data; and 5) Tree based SVM classifier achieved similar or better prediction performance when compared to the features from state-of-the-art feature selection methods. Our results demonstrated great potential of the tree-based dimensionality reduction approach based clinical outcome prediction.
准确的预后预测对于癌症的精准医学和个性化治疗至关重要。研究人员发现,多维癌症组学研究在人类疾病研究中优于每种数据类型(mRNA、microRNA、甲基化或体细胞拷贝数改变)的研究。现有的利用多个分子数据水平的方法往往存在各种局限性,例如异质性、稳健性差或通用性丧失。为了克服这些限制,我们提出了基于树的降维方法来识别平滑树图,并开发了用于临床结果预测的准确预测模型。我们证明了:1)通过学习树进行有判别力的维度减少(DDRTree)在统计上实现了降维空间树;2)基于树的支持向量机(SVM)分类器与基于 t 检验的 SVM 分类器相比,提高了癌症复发的预测性能;3)基于树的 SVM 分类器对不同数量的多标记具有稳健性;4)与单一组学数据相比,组合多个组学数据提高了癌症复发的预测性能;5)与最先进的特征选择方法的特征相比,基于树的 SVM 分类器具有相似或更好的预测性能。我们的结果表明,基于树的降维方法在临床结果预测方面具有巨大的潜力。