Yu Xinjian, Liu Yongjing, Chen Ming
Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China.
Quantitative and Computational Biosciences Graduate Program, Baylor College of Medicine, Houston, TX 77030, USA.
Cancers (Basel). 2022 May 24;14(11):2571. doi: 10.3390/cancers14112571.
Triple-negative breast cancer (TNBC) is a heterogeneous disease with diverse, often poor prognoses and treatment responses. In order to identify targetable biomarkers and guide personalized care, scientists have developed multiple molecular classification systems for TNBC based on transcriptomic profiling. However, there is no consensus on the molecular subtypes of TNBC, likely due to discrepancies in technical and computational methods used by different research groups. Here, we reassessed the major steps for TNBC subtyping, validated the reproducibility of established TNBC subtypes, and identified two more subtypes with a larger sample size. By comparing results from different workflows, we demonstrated the limitations of formalin-fixed, paraffin-embedded samples, as well as batch effect removal across microarray platforms. We also refined the usage of computational tools for TNBC subtyping. Furthermore, we integrated high-quality multi-institutional TNBC datasets (discovery set: = 457; validation set: = 165). Performing unsupervised clustering on the discovery and validation sets independently, we validated four previously discovered subtypes: luminal androgen receptor, mesenchymal, immunomodulatory, and basal-like immunosuppressed. Additionally, we identified two potential intermediate states of TNBC tumors based on their resemblance with more than one well-characterized subtype. In summary, we addressed the issues and limitations of previous TNBC subtyping through comprehensive analyses. Our results promote the rational design of future subtyping studies and provide new insights into TNBC patient stratification.
三阴性乳腺癌(TNBC)是一种异质性疾病,预后和治疗反应多样,通常较差。为了识别可靶向的生物标志物并指导个性化治疗,科学家们基于转录组分析开发了多种TNBC分子分类系统。然而,TNBC的分子亚型尚无共识,这可能是由于不同研究小组使用的技术和计算方法存在差异。在此,我们重新评估了TNBC亚型分类的主要步骤,验证了已确立的TNBC亚型的可重复性,并通过更大样本量确定了另外两种亚型。通过比较不同工作流程的结果,我们证明了福尔马林固定石蜡包埋样本的局限性,以及跨微阵列平台去除批次效应的情况。我们还完善了TNBC亚型分类计算工具的使用。此外,我们整合了高质量的多机构TNBC数据集(发现集:n = 457;验证集:n = 165)。在发现集和验证集上分别进行无监督聚类,我们验证了四种先前发现的亚型:腔面雄激素受体型、间充质型、免疫调节型和基底样免疫抑制型。此外,我们根据TNBC肿瘤与一种以上特征明确的亚型的相似性,确定了两种潜在的中间状态。总之,我们通过综合分析解决了先前TNBC亚型分类中的问题和局限性。我们的结果促进了未来亚型分类研究的合理设计,并为TNBC患者分层提供了新的见解。