Cai D Q, Cai DianKui, Zhao Zhen, Zheng Zehao, Jian Zhixiang, Shi Mude, Chen Yajin, Chen Jueming, Lin Ye
Guangdong Cardiovascular Institute, Department of General Surgery, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Southern Medical University, Guangzhou, Guangdong Province, 510080, China.
Department of Hepatobiliary Surgery, Sun Yat-Sen Memorial Hospital, Guangzhou, Guangdong Province, 510120, China; Guangdong Laboratory, Guangzhou, Guangdong Province, 510320, China.
Comput Biol Med. 2025 Sep;196(Pt A):110685. doi: 10.1016/j.compbiomed.2025.110685. Epub 2025 Jul 6.
Considerable evidence highlights the intricate association between liquid-liquid phase separation (LLPS) and tumorigenesis, progression, and therapy resistance. However, there has been limited exploration of the role of LLPS in hepatoblastoma (HB). This study integrates machine learning techniques with single-cell RNA sequencing (scRNA-seq) to systematically analyze the molecular features of LLPS-associated genes in HB and establish the first LLPS-based prognostic prediction model for HB.
RNA-seq data from the Gene Expression Omnibus (GEO) database were utilized, integrating GSE133039 and GSE75271 as the training cohort and GSE81928 and GSE132037 as the validation cohort. Differential expression analysis was performed on the training cohort, identifying 124 HB-specific differentially expressed genes. Weighted gene co-expression network analysis (WGCNA) was then applied to identify four functional modules containing 11,757 genes. A set of 3612 known LLPS-related genes was used to filter 11 hub genes. Hub genes were further selected through Random Forest and Support Vector Machine-Recursive Feature Elimination (SVM-RFE), with ZCCHC12, CDH13, and CDKN2A identified as the optimal candidates. A LLPS-based risk score (LlpsHBScore) was constructed using Lasso regression, and its performance was evaluated in the testing cohort. The correlation between the immune microenvironment and LlpsHBScore was analyzed using CIBERSORT, EPIC, MCPcounter, and quantiSeq algorithms, and the results were validated using the scRNA-seq dataset GSE186975.
In both the training and testing cohort, as well as single-cell dataset, expression levels of ZCCHC12, CDH13, and CDKN2A were significantly upregulated in HB. The LlpsHBScore was positively correlated with the expression of these genes. The nomogram based on LlpsHBScore effectively diagnosed HB and accurately predicted the 2-year and 5-year survival rates of patients. Further analysis revealed that the LlpsHBScore was significantly correlated with the expression of drug resistance gene clusters and the infiltration level of CD8 T cells.
This study identifies elevated expression levels of ZCCHC12, CDH13, and CDKN2A in HB, contributing to a better understanding of their potential involvement in the disease. We successfully developed a machine learning-based risk score (LlpsHBScore), and a visual diagnostic tool (nomogram) with high clinical application potential. These findings offer new insights into the molecular subtyping, drug resistance mechanisms, and optimization of targeted therapies for HB.
大量证据凸显了液-液相分离(LLPS)与肿瘤发生、进展及治疗耐药性之间的复杂关联。然而,LLPS在肝母细胞瘤(HB)中的作用探索有限。本研究将机器学习技术与单细胞RNA测序(scRNA-seq)相结合,系统分析HB中LLPS相关基因的分子特征,并建立首个基于LLPS的HB预后预测模型。
利用来自基因表达综合数据库(GEO)的RNA-seq数据,将GSE133039和GSE75271作为训练队列,GSE81928和GSE132037作为验证队列。对训练队列进行差异表达分析,鉴定出124个HB特异性差异表达基因。随后应用加权基因共表达网络分析(WGCNA)来识别包含11,757个基因的四个功能模块。使用一组3612个已知的LLPS相关基因筛选出11个枢纽基因。通过随机森林和支持向量机递归特征消除(SVM-RFE)进一步选择枢纽基因,并确定ZCCHC12、CDH13和CDKN2A为最佳候选基因。使用套索回归构建基于LLPS的风险评分(LlpsHBScore),并在测试队列中评估其性能。使用CIBERSORT、EPIC、MCPcounter和quantiSeq算法分析免疫微环境与LlpsHBScore之间的相关性,并使用scRNA-seq数据集GSE186975验证结果。
在训练队列、测试队列以及单细胞数据集中,HB中ZCCHC12、CDH13和CDKN2A的表达水平均显著上调。LlpsHBScore与这些基因的表达呈正相关。基于LlpsHBScore的列线图有效诊断了HB,并准确预测了患者的2年和5年生存率。进一步分析表明,LlpsHBScore与耐药基因簇的表达及CD8 T细胞的浸润水平显著相关。
本研究确定了HB中ZCCHC12、CDH13和CDKN2A的表达水平升高,有助于更好地理解它们在该疾病中的潜在作用。我们成功开发了一种基于机器学习的风险评分(LlpsHBScore)和一种具有高临床应用潜力的可视化诊断工具(列线图)。这些发现为HB的分子亚型分类、耐药机制及靶向治疗优化提供了新见解。