关于高通量检测的统计可重复性和研究设计的说明

A note on statistical repeatability and study design for high-throughput assays.

作者信息

Nicholson George, Holmes Chris

机构信息

Department of Statistics, University of Oxford, 24-29 St Giles, Oxford, OX1 3LB, U.K.

出版信息

Stat Med. 2017 Feb 28;36(5):790-798. doi: 10.1002/sim.7175. Epub 2016 Nov 24.

DOI:10.1002/sim.7175

PMID:27882571

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5299465/

Abstract

Characterizing the technical precision of measurements is a necessary stage in the planning of experiments and in the formal sample size calculation for optimal design. Instruments that measure multiple analytes simultaneously, such as in high-throughput assays arising in biomedical research, pose particular challenges from a statistical perspective. The current most popular method for assessing precision of high-throughput assays is by scatterplotting data from technical replicates. Here, we question the statistical rationale of this approach from both an empirical and theoretical perspective, illustrating our discussion using four example data sets from different genomic platforms. We demonstrate that such scatterplots convey little statistical information of relevance and are potentially highly misleading. We present an alternative framework for assessing the precision of high-throughput assays and planning biomedical experiments. Our methods are based on repeatability-a long-established statistical quantity also known as the intraclass correlation coefficient. We provide guidance and software for estimation and visualization of repeatability of high-throughput assays, and for its incorporation into study design. © 2016 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.

摘要

表征测量的技术精度是实验规划以及进行最优设计的正式样本量计算的必要阶段。同时测量多种分析物的仪器，比如生物医学研究中出现的高通量检测，从统计学角度来看会带来特殊挑战。当前评估高通量检测精度最流行的方法是绘制技术重复数据的散点图。在此，我们从实证和理论角度质疑这种方法的统计学原理，并使用来自不同基因组平台的四个示例数据集来说明我们的讨论。我们证明此类散点图几乎没有传达相关的统计信息，并且可能极具误导性。我们提出了一个用于评估高通量检测精度和规划生物医学实验的替代框架。我们的方法基于重复性——一个早已确立的统计量，也称为组内相关系数。我们提供了用于估计和可视化高通量检测重复性以及将其纳入研究设计的指导和软件。© 2016作者。《医学统计学》由约翰·威利父子有限公司出版。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3a3a/5299465/5efbbc600104/SIM-36-790-g001.jpg

相似文献

A note on statistical repeatability and study design for high-throughput assays.

Stat Med. 2017 Feb 28;36(5):790-798. doi: 10.1002/sim.7175. Epub 2016 Nov 24.

Promises and Pitfalls of High-Throughput Biological Assays.

Methods Mol Biol. 2016;1415:225-43. doi: 10.1007/978-1-4939-3572-7_12.

A statistical framework for assessing pharmacological responses and biomarkers using uncertainty estimates.

Elife. 2020 Dec 4;9:e60352. doi: 10.7554/eLife.60352.

High-throughput micro-plate HCI-vanillin assay for screening tannin content in sorghum grain.

J Sci Food Agric. 2014 Aug;94(10):2133-6. doi: 10.1002/jsfa.6538. Epub 2014 Jan 24.

On Efficient Feature Ranking Methods for High-Throughput Data Analysis.

IEEE/ACM Trans Comput Biol Bioinform. 2015 Nov-Dec;12(6):1374-84. doi: 10.1109/TCBB.2015.2415790.

Analyzing matrices of meta-analytic correlations: current practices and recommendations.

Res Synth Methods. 2016 Jun;7(2):187-208. doi: 10.1002/jrsm.1206.

Filtering data from high-throughput experiments based on measurement reliability.

Proc Natl Acad Sci U S A. 2010 Nov 16;107(46):E173-4; author reply E175. doi: 10.1073/pnas.1010604107. Epub 2010 Nov 8.

Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies.

Brief Bioinform. 2015 Jul;16(4):563-75. doi: 10.1093/bib/bbu033. Epub 2014 Sep 24.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Correction for interference by test samples in high-throughput assays.

J Biomol Screen. 2009 Sep;14(8):1008-16. doi: 10.1177/1087057109341768. Epub 2009 Jul 30.

引用本文的文献

Systematic data analysis pipeline for quantitative morphological cell phenotyping.

Comput Struct Biotechnol J. 2024 Jul 14;23:2949-2962. doi: 10.1016/j.csbj.2024.07.012. eCollection 2024 Dec.

Employing multiple synchronous outcome samples per subject to improve study efficiency.

BMC Med Res Methodol. 2021 Oct 17;21(1):211. doi: 10.1186/s12874-021-01414-7.

A Bayesian non-parametric mixed-effects model of microbial growth curves.

PLoS Comput Biol. 2020 Oct 26;16(10):e1008366. doi: 10.1371/journal.pcbi.1008366. eCollection 2020 Oct.

本文引用的文献

A new initiative on precision medicine.

N Engl J Med. 2015 Feb 26;372(9):793-5. doi: 10.1056/NEJMp1500523. Epub 2015 Jan 30.

Points of significance: error bars.

Nat Methods. 2013 Oct;10(10):921-2. doi: 10.1038/nmeth.2659.

The case for using the repeatability coefficient when calculating test-retest reliability.

PLoS One. 2013 Sep 9;8(9):e73990. doi: 10.1371/journal.pone.0073990. eCollection 2013.

Statistical methods used to test for agreement of medical instruments measuring continuous variables in method comparison studies: a systematic review.

PLoS One. 2012;7(5):e37908. doi: 10.1371/journal.pone.0037908. Epub 2012 May 25.

Coexpression network analysis in abdominal and gluteal adipose tissue reveals regulatory genetic loci for metabolic syndrome and related phenotypes.

PLoS Genet. 2012;8(2):e1002505. doi: 10.1371/journal.pgen.1002505. Epub 2012 Feb 23.

MicroRNA expression in abdominal and gluteal adipose tissue is associated with mRNA expression levels and partly genetically driven.

PLoS One. 2011;6(11):e27338. doi: 10.1371/journal.pone.0027338. Epub 2011 Nov 15.

Variance decomposition of protein profiles from antibody arrays using a longitudinal twin model.

Proteome Sci. 2011 Nov 17;9:73. doi: 10.1186/1477-5956-9-73.

A genome-wide metabolic QTL analysis in Europeans implicates two loci shaped by recent positive selection.

PLoS Genet. 2011 Sep;7(9):e1002270. doi: 10.1371/journal.pgen.1002270. Epub 2011 Sep 8.

Validation of two ribosomal RNA removal methods for microbial metatranscriptomics.

Nat Methods. 2010 Oct;7(10):807-12. doi: 10.1038/nmeth.1507. Epub 2010 Sep 19.

Tackling the widespread and critical impact of batch effects in high-throughput data.

Nat Rev Genet. 2010 Oct;11(10):733-9. doi: 10.1038/nrg2825. Epub 2010 Sep 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

关于高通量检测的统计可重复性和研究设计的说明

A note on statistical repeatability and study design for high-throughput assays.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献