Zhang Peng, Qin Meizhong, Li Fen, Hu Kunpeng, Huang He, Li Cuicui
Department of Thyroid And Breast Surgery, the Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, 510630, Guangdong Province, P.R. China.
Department of Gastrointestinal Surgery, the Third Affiliated Hospital of Sun Yat-sen University Lingnan Hospital, Guangzhou, 510530, Guangdong Province, P.R. China.
Discov Oncol. 2025 Jun 23;16(1):1186. doi: 10.1007/s12672-025-02918-0.
Thyroid cancer (THCA) exhibits high molecular heterogeneity, posing challenges for precise prognosis and personalized therapy. Most existing models rely on single-omics data and limited algorithms, reducing robustness and clinical value.
We integrated five omics layers from THCA patients using eleven clustering algorithms to identify molecular subtypes. Based on stable prognosis-related genes (SPRGs), we applied 99 combinations of ten machine learning methods to construct a robust prognostic model-Consensus Machine Learning-Driven Signature (CMLS). The model was validated across multiple internal and external cohorts. Immunogenomic characteristics and drug sensitivity were also evaluated.
Three molecular subtypes (CS1-CS3) with distinct clinical outcomes and molecular features were identified; CS2 showed the worst prognosis. A nine-gene CMLS was established, demonstrating strong prognostic performance across cohorts. Patients in the low-CMLS group had better outcomes, stronger immune infiltration, higher TMB/TNB, and greater predicted responsiveness to immunotherapy. Conversely, the high-CMLS group exhibited poor prognosis and lower immunotherapy sensitivity. Drug screening identified six candidate agents for high-CMLS patients.
Our study provides a robust multiomics-based classification of THCA and develops a clinically relevant CMLS model for prognostic prediction and therapy guidance. These findings may facilitate risk stratification and inform personalized treatment strategies in clinical practice.
甲状腺癌(THCA)表现出高度的分子异质性,这给精确预后和个性化治疗带来了挑战。大多数现有模型依赖单一组学数据和有限的算法,降低了稳健性和临床价值。
我们使用十一种聚类算法整合了THCA患者的五个组学层面,以识别分子亚型。基于稳定的预后相关基因(SPRGs),我们应用十种机器学习方法的99种组合来构建一个稳健的预后模型——共识机器学习驱动特征(CMLS)。该模型在多个内部和外部队列中进行了验证。还评估了免疫基因组特征和药物敏感性。
识别出三种具有不同临床结果和分子特征的分子亚型(CS1 - CS3);CS2显示出最差的预后。建立了一个九基因的CMLS,在各队列中显示出强大的预后性能。低CMLS组的患者有更好的预后、更强的免疫浸润、更高的肿瘤突变负荷/肿瘤新抗原负荷,以及对免疫治疗更高的预测反应性。相反,高CMLS组预后较差且免疫治疗敏感性较低。药物筛选确定了六种针对高CMLS患者的候选药物。
我们的研究提供了一种基于多组学的稳健的THCA分类方法,并开发了一个与临床相关的CMLS模型用于预后预测和治疗指导。这些发现可能有助于临床实践中的风险分层并为个性化治疗策略提供依据。