Tian Suyan
Division of Clinical Research, The First Hospital of Jilin University, Changchun, Jilin 130021, P.R. China.
Oncol Lett. 2018 Jun;15(6):8545-8555. doi: 10.3892/ol.2018.8418. Epub 2018 Apr 4.
Lung cancer (LC) is a leading cause of cancer-associated mortalities worldwide. Adenocarcinoma (AC) and squamous cell carcinoma (SCC) account for ~70% of all cases of LC. Since AC and SCC are two distinct diseases, their corresponding prognostic genes associated with patient survival time are expected to be different. To date, only a few studies have distinguished patients with good prognosis from those with poor prognosis for each specific subtype. In the present study, the Cox filter model, a feature selection algorithm that identifies subtype-specific prognostic genes to incorporate pathway information and eliminate redundant genes, was adopted. By applying the proposed model to data on non-small cell lung cancer (NSCLC), it was demonstrated that both redundant gene elimination and search space restriction can improve the predictive capacity and the model stability of resulting prognostic gene signatures. To conclude, a pre-filtering procedure that incorporates pathway information for screening likely irrelevant genes prior to complex downstream analysis is recommended. Furthermore, a feature selection algorithm that considers redundant gene elimination may be preferable to one without such a consideration.
肺癌(LC)是全球癌症相关死亡的主要原因。腺癌(AC)和鳞状细胞癌(SCC)占所有LC病例的约70%。由于AC和SCC是两种不同的疾病,预计它们与患者生存时间相关的相应预后基因会有所不同。迄今为止,只有少数研究针对每种特定亚型区分了预后良好和预后不良的患者。在本研究中,采用了Cox过滤模型,这是一种特征选择算法,可识别特定亚型的预后基因,以纳入通路信息并消除冗余基因。通过将所提出的模型应用于非小细胞肺癌(NSCLC)数据,结果表明冗余基因消除和搜索空间限制均可提高所得预后基因特征的预测能力和模型稳定性。总之,建议在进行复杂的下游分析之前,采用一种纳入通路信息以筛选可能无关基因的预过滤程序。此外,考虑冗余基因消除的特征选择算法可能比不考虑此类因素的算法更可取。