新型生物标志物的发现改善了乳腺癌内在亚型预测并协调了METABRIC数据集中的标签。

The Discovery of Novel Biomarkers Improves Breast Cancer Intrinsic Subtype Prediction and Reconciles the Labels in the METABRIC Data Set.

作者信息

Milioli Heloisa Helena, Vimieiro Renato, Riveros Carlos, Tishchenko Inna, Berretta Regina, Moscato Pablo

机构信息

Priority Research Centre for Bioinformatics, Biomarker Discovery and Information-Based Medicine, Hunter Medical Research Institute, New Lambton Heights, NSW, Australia; School of Environmental and Life Science, The University of Newcastle, Callaghan, NSW, Australia.

Priority Research Centre for Bioinformatics, Biomarker Discovery and Information-Based Medicine, Hunter Medical Research Institute, New Lambton Heights, NSW, Australia; Centro de Informática, Universidade Federal de Pernambuco, Recife, PE, Brazil.

出版信息

PLoS One. 2015 Jul 1;10(7):e0129711. doi: 10.1371/journal.pone.0129711. eCollection 2015.

DOI:10.1371/journal.pone.0129711

PMID:26132585

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4488510/

Abstract

BACKGROUND

The prediction of breast cancer intrinsic subtypes has been introduced as a valuable strategy to determine patient diagnosis and prognosis, and therapy response. The PAM50 method, based on the expression levels of 50 genes, uses a single sample predictor model to assign subtype labels to samples. Intrinsic errors reported within this assay demonstrate the challenge of identifying and understanding the breast cancer groups. In this study, we aim to: a) identify novel biomarkers for subtype individuation by exploring the competence of a newly proposed method named CM1 score, and b) apply an ensemble learning, as opposed to the use of a single classifier, for sample subtype assignment. The overarching objective is to improve class prediction.

METHODS AND FINDINGS

The microarray transcriptome data sets used in this study are: the METABRIC breast cancer data recorded for over 2000 patients, and the public integrated source from ROCK database with 1570 samples. We first computed the CM1 score to identify the probes with highly discriminative patterns of expression across samples of each intrinsic subtype. We further assessed the ability of 42 selected probes on assigning correct subtype labels using 24 different classifiers from the Weka software suite. For comparison, the same method was applied on the list of 50 genes from the PAM50 method.

CONCLUSIONS

The CM1 score portrayed 30 novel biomarkers for predicting breast cancer subtypes, with the confirmation of the role of 12 well-established genes. Intrinsic subtypes assigned using the CM1 list and the ensemble of classifiers are more consistent and homogeneous than the original PAM50 labels. The new subtypes show accurate distributions of current clinical markers ER, PR and HER2, and survival curves in the METABRIC and ROCK data sets. Remarkably, the paradoxical attribution of the original labels reinforces the limitations of employing a single sample classifiers to predict breast cancer intrinsic subtypes.

摘要

背景

乳腺癌内在亚型的预测已被视为确定患者诊断、预后及治疗反应的一项重要策略。基于50个基因表达水平的PAM50方法，使用单样本预测模型为样本分配亚型标签。该检测方法中报告的内在误差表明了识别和理解乳腺癌分组的挑战。在本研究中，我们旨在：a）通过探索一种新提出的名为CM1评分的方法的能力，识别用于亚型区分的新型生物标志物；b）应用集成学习，而非使用单个分类器，来进行样本亚型分配。总体目标是改善分类预测。

方法与结果

本研究中使用的微阵列转录组数据集为：记录了2000多名患者的METABRIC乳腺癌数据，以及来自ROCK数据库的包含1570个样本的公共综合数据源。我们首先计算CM1评分，以识别在各内在亚型样本中具有高度判别性表达模式的探针。我们进一步使用来自Weka软件套件的24种不同分类器，评估42个选定探针分配正确亚型标签的能力。为作比较，对PAM50方法中的50个基因列表应用相同方法。

结论

CM1评分描绘了30个用于预测乳腺癌亚型的新型生物标志物，同时证实了12个已确立基因的作用。使用CM1列表和分类器集合分配的内在亚型比原始的PAM50标签更一致、更均匀。新的亚型在METABRIC和ROCK数据集中显示出当前临床标志物ER、PR和HER2的准确分布以及生存曲线。值得注意的是，原始标签的矛盾归因强化了使用单样本分类器预测乳腺癌内在亚型的局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fee9/4488510/b2c02eed263b/pone.0129711.g001.jpg

相似文献

The Discovery of Novel Biomarkers Improves Breast Cancer Intrinsic Subtype Prediction and Reconciles the Labels in the METABRIC Data Set.

PLoS One. 2015 Jul 1;10(7):e0129711. doi: 10.1371/journal.pone.0129711. eCollection 2015.

Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset.

BioData Min. 2016 Jan 13;9:2. doi: 10.1186/s13040-015-0078-9. eCollection 2016.

Quantification of intrinsic subtype ambiguity in Luminal A breast cancer and its relationship to clinical outcomes.

BMC Cancer. 2019 Mar 8;19(1):215. doi: 10.1186/s12885-019-5392-z.

PAM50 breast cancer subtyping by RT-qPCR and concordance with standard clinical molecular markers.

BMC Med Genomics. 2012 Oct 4;5:44. doi: 10.1186/1755-8794-5-44.

Development and verification of the PAM50-based Prosigna breast cancer gene signature assay.

BMC Med Genomics. 2015 Aug 22;8:54. doi: 10.1186/s12920-015-0129-6.

Mixture classification model based on clinical markers for breast cancer prognosis.

Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.

A three-gene model to robustly identify breast cancer molecular subtypes.

J Natl Cancer Inst. 2012 Feb 22;104(4):311-25. doi: 10.1093/jnci/djr545. Epub 2012 Jan 18.

Discordance of the PAM50 Intrinsic Subtypes Compared with Immunohistochemistry-Based Surrogate in Breast Cancer Patients: Potential Implication of Genomic Alterations of Discordance.

Cancer Res Treat. 2019 Apr;51(2):737-747. doi: 10.4143/crt.2018.342. Epub 2018 Sep 5.

A deep learning image-based intrinsic molecular subtype classifier of breast tumors reveals tumor heterogeneity that may affect survival.

Breast Cancer Res. 2020 Jan 28;22(1):12. doi: 10.1186/s13058-020-1248-3.

Prediction consistency and clinical presentations of breast cancer molecular subtypes for Han Chinese population.

J Transl Med. 2012 Sep 19;10 Suppl 1(Suppl 1):S10. doi: 10.1186/1479-5876-10-S1-S10.

引用本文的文献

Comprehensive multi-omics analysis of breast cancer reveals distinct long-term prognostic subtypes.

Oncogenesis. 2024 Jun 13;13(1):22. doi: 10.1038/s41389-024-00521-6.

Systemically Identifying Triple-Negative Breast Cancer Subtype-Specific Prognosis Signatures, Based on Single-Cell RNA-Seq Data.

Cells. 2023 Jan 19;12(3):367. doi: 10.3390/cells12030367.

MiR-205 suppressed the malignant behaviors of breast cancer cells by targeting CLDN11 via modulation of the epithelial-to-mesenchymal transition.

Aging (Albany NY). 2021 May 8;13(9):13073-13086. doi: 10.18632/aging.202988.

Pathway-Based Drug-Repurposing Schemes in Cancer: The Role of Translational Bioinformatics.

Front Oncol. 2021 Jan 14;10:605680. doi: 10.3389/fonc.2020.605680. eCollection 2020.

Identification of novel prognostic biomarkers in renal cell carcinoma.

Aging (Albany NY). 2020 Nov 21;12(24):25304-25318. doi: 10.18632/aging.104131.

Integrative Network Fusion: A Multi-Omics Approach in Molecular Profiling.

Front Oncol. 2020 Jun 30;10:1065. doi: 10.3389/fonc.2020.01065. eCollection 2020.

CLCA2 is a positive regulator of store-operated calcium entry and TMEM16A.

PLoS One. 2018 May 14;13(5):e0196512. doi: 10.1371/journal.pone.0196512. eCollection 2018.

Robust genomic copy number predictor of pan cancer metastasis.

Genes Cancer. 2018 Jan;9(1-2):66-77. doi: 10.18632/genesandcancer.165.

Protein biomarkers for subtyping breast cancer and implications for future research.

Expert Rev Proteomics. 2018 Feb;15(2):131-152. doi: 10.1080/14789450.2018.1421071. Epub 2018 Jan 3.

A Joint Bayesian Model for Integrating Microarray and RNA Sequencing Transcriptomic Data.

J Comput Biol. 2017 Jul;24(7):647-662. doi: 10.1089/cmb.2017.0056. Epub 2017 May 25.

本文引用的文献

Comparison of frequencies and prognostic effect of molecular subtypes between young and elderly breast cancer patients.

Mol Oncol. 2014 Jul;8(5):1014-25. doi: 10.1016/j.molonc.2014.03.022. Epub 2014 Apr 8.

Estrogen-dependent sushi domain containing 3 regulates cytoskeleton organization and migration in breast cancer cells.

Oncogene. 2015 Jan 15;34(3):323-33. doi: 10.1038/onc.2013.553. Epub 2014 Jan 13.

The CDK1 inhibitor RO3306 improves the response of BRCA-proﬁcient breast cancer cells to PARP inhibition.

Int J Oncol. 2014 Mar;44(3):735-44. doi: 10.3892/ijo.2013.2240. Epub 2013 Dec 31.

Differential network analysis applied to preoperative breast cancer chemotherapy response.

PLoS One. 2013 Dec 9;8(12):e81784. doi: 10.1371/journal.pone.0081784. eCollection 2013.

Novel functional assay for spindle-assembly checkpoint by cyclin-dependent kinase activity to predict taxane chemosensitivity in breast tumor patient.

J Cancer. 2013 Nov 14;4(9):697-702. doi: 10.7150/jca.6248. eCollection 2013.

Prognostic discrimination using a 70-gene signature among patients with estrogen receptor-positive breast cancer and an intermediate 21-gene recurrence score.

Int J Mol Sci. 2013 Dec 4;14(12):23685-99. doi: 10.3390/ijms141223685.

Dysregulation of microRNA expression drives aberrant DNA hypermethylation in basal-like breast cancer.

Int J Oncol. 2014 Feb;44(2):563-72. doi: 10.3892/ijo.2013.2197. Epub 2013 Nov 29.

Pds5B is required for cohesion establishment and Aurora B accumulation at centromeres.

EMBO J. 2013 Nov 13;32(22):2938-49. doi: 10.1038/emboj.2013.230. Epub 2013 Oct 18.

Stromal matrix metalloproteinase-11 is involved in the mammary gland postnatal development.

Oncogene. 2014 Jul 31;33(31):4050-9. doi: 10.1038/onc.2013.434. Epub 2013 Oct 21.

Additive effect of the AZGP1, PIP, S100A8 and UBE2C molecular biomarkers improves outcome prediction in breast carcinoma.

Int J Cancer. 2014 Apr 1;134(7):1617-29. doi: 10.1002/ijc.28497. Epub 2013 Oct 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

新型生物标志物的发现改善了乳腺癌内在亚型预测并协调了METABRIC数据集中的标签。

The Discovery of Novel Biomarkers Improves Breast Cancer Intrinsic Subtype Prediction and Reconciles the Labels in the METABRIC Data Set.

作者信息

机构信息

出版信息

BACKGROUND

METHODS AND FINDINGS

CONCLUSIONS

背景

方法与结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献