Suppr超能文献

用于预测特定亚型乳腺癌事件的评估方案。

An evaluation protocol for subtype-specific breast cancer event prediction.

机构信息

Molecular Diagnostics Department, Philips Research, Eindhoven, The Netherlands.

出版信息

PLoS One. 2011;6(7):e21681. doi: 10.1371/journal.pone.0021681. Epub 2011 Jul 8.

Abstract

In recent years increasing evidence appeared that breast cancer may not constitute a single disease at the molecular level, but comprises a heterogeneous set of subtypes. This suggests that instead of building a single monolithic predictor, better predictors might be constructed that solely target samples of a designated subtype, which are believed to represent more homogeneous sets of samples. An unavoidable drawback of developing subtype-specific predictors, however, is that a stratification by subtype drastically reduces the number of samples available for their construction. As numerous studies have indicated sample size to be an important factor in predictor construction, it is therefore questionable whether the potential benefit of subtyping can outweigh the drawback of a severe loss in sample size. Factors like unequal class distributions and differences in the number of samples per subtype, further complicate comparisons. We present a novel experimental protocol that facilitates a comprehensive comparison between subtype-specific predictors and predictors that do not take subtype information into account. Emphasis lies on careful control of sample size as well as class and subtype distributions. The methodology is applied to a large breast cancer compendium involving over 1500 arrays, using a state-of-the-art subtyping scheme. We show that the resulting subtype-specific predictors outperform those that do not take subtype information into account, especially when taking sample size considerations into account.

摘要

近年来,越来越多的证据表明,乳腺癌在分子水平上可能不是单一疾病,而是由一组异质的亚型组成。这表明,与其构建单一的整体预测器,不如构建更好的仅针对指定亚型样本的预测器,这些样本被认为代表了更同质的样本集。然而,开发特定于亚型的预测器不可避免的缺点是,通过亚型分层会大大减少用于构建预测器的样本数量。由于许多研究表明样本量是预测器构建的一个重要因素,因此,是否可以通过亚组分析来获得潜在的益处,是否可以超过样本量严重减少的缺点,这是值得怀疑的。像类别的不均衡分布和每个亚型的样本数量差异等因素,进一步使比较变得复杂。我们提出了一种新的实验方案,可以在特定于亚型的预测器和不考虑亚型信息的预测器之间进行全面比较。重点是仔细控制样本量以及类别和亚型分布。该方法应用于一个包含超过 1500 个阵列的大型乳腺癌文献,使用了一种最先进的亚组分析方案。我们表明,由此产生的特定于亚型的预测器优于那些不考虑亚型信息的预测器,尤其是在考虑样本量的情况下。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19cd/3132736/91d16e85d702/pone.0021681.g001.jpg

相似文献

1
An evaluation protocol for subtype-specific breast cancer event prediction.
PLoS One. 2011;6(7):e21681. doi: 10.1371/journal.pone.0021681. Epub 2011 Jul 8.
2
Breast cancer subtype predictors revisited: from consensus to concordance?
BMC Med Genomics. 2016 Jun 3;9(1):26. doi: 10.1186/s12920-016-0185-6.
3
A three-gene model to robustly identify breast cancer molecular subtypes.
J Natl Cancer Inst. 2012 Feb 22;104(4):311-25. doi: 10.1093/jnci/djr545. Epub 2012 Jan 18.
5
Module-based outcome prediction using breast cancer compendia.
PLoS One. 2007 Oct 17;2(10):e1047. doi: 10.1371/journal.pone.0001047.
6
Study design requirements for RNA sequencing-based breast cancer diagnostics.
Sci Rep. 2016 Feb 1;6:20200. doi: 10.1038/srep20200.
8
A high-risk luminal A dominant breast cancer subtype with increased mobility.
Breast Cancer Res Treat. 2019 Jun;175(2):459-472. doi: 10.1007/s10549-019-05135-w. Epub 2019 Feb 19.
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

引用本文的文献

1
Predicting gastrointestinal drug effects using contextualized metabolic models.
PLoS Comput Biol. 2019 Jun 26;15(6):e1007100. doi: 10.1371/journal.pcbi.1007100. eCollection 2019 Jun.
4
A data similarity-based strategy for meta-analysis of transcriptional profiles in cancer.
PLoS One. 2013;8(1):e54979. doi: 10.1371/journal.pone.0054979. Epub 2013 Jan 29.
5
Reuse of public genome-wide gene expression data.
Nat Rev Genet. 2013 Feb;14(2):89-99. doi: 10.1038/nrg3394. Epub 2012 Dec 27.

本文引用的文献

1
Class prediction for high-dimensional class-imbalanced data.
BMC Bioinformatics. 2010 Oct 20;11:523. doi: 10.1186/1471-2105-11-523.
2
Clinical implementation of the intrinsic subtypes of breast cancer.
Lancet Oncol. 2010 Aug;11(8):718-9; author reply 720-1. doi: 10.1016/S1470-2045(10)70176-5.
3
Breast cancer molecular profiling with single sample predictors: a retrospective analysis.
Lancet Oncol. 2010 Apr;11(4):339-49. doi: 10.1016/S1470-2045(10)70008-5. Epub 2010 Feb 22.
4
A fuzzy gene expression-based computational approach improves breast cancer prognostication.
Genome Biol. 2010;11(2):R18. doi: 10.1186/gb-2010-11-2-r18. Epub 2010 Feb 15.
5
7
Can survival prediction be improved by merging gene expression data sets?
PLoS One. 2009 Oct 23;4(10):e7431. doi: 10.1371/journal.pone.0007431.
8
Effects of sample size on robustness and prediction accuracy of a prognostic gene signature.
BMC Bioinformatics. 2009 May 16;10:147. doi: 10.1186/1471-2105-10-147.
9
Supervised risk predictor of breast cancer based on intrinsic subtypes.
J Clin Oncol. 2009 Mar 10;27(8):1160-7. doi: 10.1200/JCO.2008.18.1370. Epub 2009 Feb 9.
10
arrayQualityMetrics--a bioconductor package for quality assessment of microarray data.
Bioinformatics. 2009 Feb 1;25(3):415-6. doi: 10.1093/bioinformatics/btn647. Epub 2008 Dec 23.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验