Koo Bonil, Lee Dohoon, Lee Sangseon, Sung Inyoung, Kim Sun, Lee Sunho
Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Korea.
Bioinformatics Institute, Seoul National University, Seoul 08826, Korea.
Cancers (Basel). 2022 Aug 25;14(17):4120. doi: 10.3390/cancers14174120.
Patient stratification is a clinically important task because it allows us to establish and develop efficient treatment strategies for particular groups of patients. Molecular subtypes have been successfully defined using transcriptomic profiles, and they are used effectively in clinical practice, e.g., PAM50 subtypes of breast cancer. Survival prediction contributed to understanding diseases and also identifying genes related to prognosis. It is desirable to stratify patients considering these two aspects simultaneously. However, there are no methods for patient stratification that consider molecular subtypes and survival outcomes at once. Here, we propose a methodology to deal with the problem. A genetic algorithm is used to select a gene set from transcriptome data, and their expression quantities are utilized to assign a risk score to each patient. The patients are ordered and stratified according to the score. A gene set was selected by our method on a breast cancer cohort (TCGA-BRCA), and we examined its clinical utility using an independent cohort (SCAN-B). In this experiment, our method was successful in stratifying patients with respect to both molecular subtype and survival outcome. We demonstrated that the orders of patients were consistent across repeated experiments, and prognostic genes were successfully nominated. Additionally, it was observed that the risk score can be used to evaluate the molecular aggressiveness of individual patients.
患者分层是一项具有重要临床意义的任务,因为它使我们能够为特定患者群体制定和开发有效的治疗策略。利用转录组谱已成功定义了分子亚型,并且它们在临床实践中得到了有效应用,例如乳腺癌的PAM50亚型。生存预测有助于理解疾病并识别与预后相关的基因。期望同时考虑这两个方面对患者进行分层。然而,目前尚无同时考虑分子亚型和生存结果的患者分层方法。在此,我们提出一种解决该问题的方法。使用遗传算法从转录组数据中选择一组基因,并利用它们的表达量为每位患者分配一个风险评分。根据该评分对患者进行排序和分层。我们的方法在一个乳腺癌队列(TCGA - BRCA)上选择了一组基因,并使用一个独立队列(SCAN - B)检验了其临床效用。在这个实验中,我们的方法成功地在分子亚型和生存结果两方面对患者进行了分层。我们证明了在重复实验中患者的排序是一致的,并且成功地确定了预后基因。此外,观察到风险评分可用于评估个体患者的分子侵袭性。