Yang Xinan, Li Haiquan, Regan Kelly, Li Jianrong, Huang Yong, Lussier Yves A
Center for Biomedical Informatics, Dept. of Medicine, Section of Hematology/Oncology, The University of Chicago, Chicago, IL 60637, USA.
AMIA Annu Symp Proc. 2012;2012:1040-9. Epub 2012 Nov 3.
We aim to provide clinically applicable, reproducible, mechanistic interpretations of gene expression changes that lack in gene overlap among predictive gene-signatures. Using a method we recently developed, Functional Analysis of Individual Microarray Expression (FAIME), we provide evidence that Gene Ontology-anchored signatures (GO-signatures) show reliable prognosis in lung cancer. In order to demonstrate the biological congruence and reproducibility of FAIME-derived mechanism classifiers, we chose a disease where gene expression classifiers signatures alone had failed to significantly stratify a larger collection of samples and that exhibited poor or no genetic overlap. For each patient in the two lung adenocarcinoma studies, personalized FAIME-profiles of GO biological processes are generated from genome-wide expression profiles. For both training studies, GO-signatures significantly associated to patient mortality were identified (Prediction Analysis for Microarrays; three-fold cross-validation). These two GO-signatures could effectively stratify patients from an independent validation cohort into sub-groups that show significant differences in disease-free survival (log-rank test P=0.019; P=0.001). Importantly, significant mechanism overlaps assessed by information-theory similarity were detected between the two GO-signatures (Fischer Exact Test p=0.001). Hence, together with machine learning technologies, FAIME could be utilized to develop an ontology-driven and expression-anchored prognostic signature that is personalized for an individual patient.
我们旨在对预测性基因特征中缺乏基因重叠的基因表达变化提供临床适用、可重复的机制性解释。使用我们最近开发的一种方法——个体微阵列表达功能分析(FAIME),我们提供证据表明基因本体锚定特征(GO特征)在肺癌中显示出可靠的预后。为了证明FAIME衍生的机制分类器的生物学一致性和可重复性,我们选择了一种疾病,在该疾病中仅基因表达分类器特征未能显著分层大量样本集合,并且该疾病表现出很少或没有基因重叠。对于两项肺腺癌研究中的每一位患者,从全基因组表达谱中生成GO生物学过程的个性化FAIME概况。对于两项训练研究,确定了与患者死亡率显著相关的GO特征(微阵列预测分析;三倍交叉验证)。这两个GO特征可以有效地将独立验证队列中的患者分层为在无病生存期有显著差异的亚组(对数秩检验P = 0.019;P = 0.001)。重要的是,通过信息论相似性评估的两个GO特征之间检测到显著的机制重叠(费舍尔精确检验p = 0.001)。因此,与机器学习技术一起,FAIME可用于开发一种本体驱动且基于表达的预后特征,该特征是针对个体患者个性化的。