Suppr超能文献

分子分类器验证实践的实证评估。

An empirical assessment of validation practices for molecular classifiers.

机构信息

Institute for Clinical Research and Health Policy Studies at Tufts Medical Center, USA.

出版信息

Brief Bioinform. 2011 May;12(3):189-202. doi: 10.1093/bib/bbq073. Epub 2011 Feb 7.

Abstract

Proposed molecular classifiers may be overfit to idiosyncrasies of noisy genomic and proteomic data. Cross-validation methods are often used to obtain estimates of classification accuracy, but both simulations and case studies suggest that, when inappropriate methods are used, bias may ensue. Bias can be bypassed and generalizability can be tested by external (independent) validation. We evaluated 35 studies that have reported on external validation of a molecular classifier. We extracted information on study design and methodological features, and compared the performance of molecular classifiers in internal cross-validation versus external validation for 28 studies where both had been performed. We demonstrate that the majority of studies pursued cross-validation practices that are likely to overestimate classifier performance. Most studies were markedly underpowered to detect a 20% decrease in sensitivity or specificity between internal cross-validation and external validation [median power was 36% (IQR, 21-61%) and 29% (IQR, 15-65%), respectively]. The median reported classification performance for sensitivity and specificity was 94% and 98%, respectively, in cross-validation and 88% and 81% for independent validation. The relative diagnostic odds ratio was 3.26 (95% CI 2.04-5.21) for cross-validation versus independent validation. Finally, we reviewed all studies (n = 758) which cited those in our study sample, and identified only one instance of additional subsequent independent validation of these classifiers. In conclusion, these results document that many cross-validation practices employed in the literature are potentially biased and genuine progress in this field will require adoption of routine external validation of molecular classifiers, preferably in much larger studies than in current practice.

摘要

提出的分子分类器可能过度拟合于嘈杂的基因组和蛋白质组数据的特征。交叉验证方法通常用于获得分类准确性的估计,但模拟和案例研究都表明,当使用不适当的方法时,可能会出现偏差。通过外部(独立)验证可以避免偏差并测试可推广性。我们评估了 35 项报告分子分类器外部验证的研究。我们提取了关于研究设计和方法特征的信息,并比较了 28 项同时进行内部交叉验证和外部验证的研究中分子分类器的性能。我们证明,大多数研究采用的交叉验证实践很可能高估了分类器的性能。大多数研究在内部交叉验证和外部验证之间检测灵敏度或特异性降低 20%的能力明显不足[中位数效能分别为 36%(IQR,21%-61%)和 29%(IQR,15%-65%)]。报告的分类性能中位数为灵敏度和特异性分别为 94%和 98%,交叉验证和 88%和 81%,独立验证。交叉验证与独立验证的相对诊断比值比为 3.26(95%CI 2.04-5.21)。最后,我们回顾了所有引用我们研究样本中研究的(n=758)文献,并仅发现一次对这些分类器进行额外独立验证的实例。总之,这些结果表明,文献中使用的许多交叉验证实践可能存在偏差,而该领域的真正进展将需要采用分子分类器的常规外部验证,最好是在比当前实践更大的研究中进行。

相似文献

1
An empirical assessment of validation practices for molecular classifiers.分子分类器验证实践的实证评估。
Brief Bioinform. 2011 May;12(3):189-202. doi: 10.1093/bib/bbq073. Epub 2011 Feb 7.
4
Reviewing ensemble classification methods in breast cancer.综述乳腺癌中的集成分类方法。
Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20.

引用本文的文献

1
4
PREDICTION OF HEREDITARY CANCERS USING NEURAL NETWORKS.使用神经网络预测遗传性癌症
Ann Appl Stat. 2022 Mar;16(1):495-520. doi: 10.1214/21-aoas1510. Epub 2022 Mar 28.

本文引用的文献

1
Over-optimism in bioinformatics: an illustration.生物信息学中的过度乐观:一个例证。
Bioinformatics. 2010 Aug 15;26(16):1990-8. doi: 10.1093/bioinformatics/btq323. Epub 2010 Jun 26.
2
Expectations, validity, and reality in omics.组学中的期望、有效性和现实。
J Clin Epidemiol. 2010 Sep;63(9):945-9. doi: 10.1016/j.jclinepi.2010.04.002. Epub 2010 Jun 22.
8
Reporting bias when using real data sets to analyze classification performance.使用真实数据集分析分类性能时的报告偏倚。
Bioinformatics. 2010 Jan 1;26(1):68-76. doi: 10.1093/bioinformatics/btp605. Epub 2009 Oct 21.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验