Abdilleh Kawther, Aguilar Boris, Acquaah-Mensah George
Pancreatic Cancer Action Network, El Segundo, California.
Institute for Systems Biology, Seattle, Washington.
Clin Breast Cancer. 2025 Apr;25(3):e301-e311. doi: 10.1016/j.clbc.2024.11.015. Epub 2024 Nov 28.
There are documented differences in Breast cancer (BrCA) presentations and outcomes between Black and White patients. In addition to molecular factors, socioeconomic, racial, and clinical factors result in disparities in outcomes for women in the United States. Using machine learning and unsupervised biclustering methods within a multiomics framework, here we sought to shed light on the biological and clinical underpinnings of observed differences between Black and White BrCA patients.
We examined The Cancer Genome Atlas BrCA samples from stage II patients aged 50 or younger that are Black (BAA50) or White (W50) (n = 139 patients; 36 BAA50 and 103 W50) These patients were chosen because marked differences in survival were observed in an earlier study. A variety of multiomic data sets were analyzed to further characterize the clinical and molecular disparities for insights.
We coupled RNAseq data with protein-protein interaction as well as BrCA-specific protein co-expression network data to identify 2 novel biclusters. These biclusters are significantly associated with clinical features including race, number of lymph nodes involved with disease, estrogen receptor status, progesterone receptor status and menopausal status. There were also differentially mutated genes. Using DNA methylation data, we identified differentially methylated genes. Machine learning algorithms were trained on differential methylation values of driver genes. The trained algorithms were successful in predicting the bicluster assignment of each sample.
These results demonstrate that there was a significant association between the cluster membership and BAA50 and W50 cohorts, indicating that these biclusters accurately stratify these cohorts.
有文献记载,黑人与白人乳腺癌(BrCA)患者在临床表现和预后方面存在差异。除分子因素外,社会经济、种族和临床因素也导致美国女性在预后方面存在差异。在此,我们利用多组学框架内的机器学习和无监督双聚类方法,试图揭示黑人和白人BrCA患者间观察到的差异背后的生物学和临床基础。
我们研究了来自年龄50岁及以下的II期患者的癌症基因组图谱BrCA样本,这些患者为黑人(BAA50)或白人(W50)(n = 139例患者;36例BAA50和103例W50)。选择这些患者是因为在早期研究中观察到了生存方面的显著差异。分析了各种多组学数据集,以进一步表征临床和分子差异,从而获得见解。
我们将RNA测序数据与蛋白质-蛋白质相互作用以及BrCA特异性蛋白质共表达网络数据相结合,以识别2个新的双聚类。这些双聚类与包括种族、疾病累及的淋巴结数量、雌激素受体状态、孕激素受体状态和绝经状态等临床特征显著相关。也存在差异突变基因。利用DNA甲基化数据,我们识别出了差异甲基化基因。在驱动基因的差异甲基化值上训练机器学习算法。训练后的算法成功预测了每个样本的双聚类归属。
这些结果表明,聚类成员与BAA50和W50队列之间存在显著关联,表明这些双聚类准确地对这些队列进行了分层。