• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

非检测定量聚合酶链式反应数据的多重插补和直接估计。

Multiple imputation and direct estimation for qPCR data with non-detects.

机构信息

Department of Biostatistics and Computational Biology, University of Rochester Medical Center, 265 Crittenden Blvd., 14642, Rochester, NY, USA.

Department of Biomedical Genetics, University of Rochester Medical Center, 601 Elmwood Ave., 14642, Rochester, NY, USA.

出版信息

BMC Bioinformatics. 2020 Nov 26;21(1):545. doi: 10.1186/s12859-020-03807-9.

DOI:10.1186/s12859-020-03807-9
PMID:33243147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7693525/
Abstract

BACKGROUND

Quantitative real-time PCR (qPCR) is one of the most widely used methods to measure gene expression. An important aspect of qPCR data that has been largely ignored is the presence of non-detects: reactions failing to exceed the quantification threshold and therefore lacking a measurement of expression. While most current software replaces these non-detects with a value representing the limit of detection, this introduces substantial bias in the estimation of both absolute and differential expression. Single imputation procedures, while an improvement on previously used methods, underestimate residual variance, which can lead to anti-conservative inference.

RESULTS

We propose to treat non-detects as non-random missing data, model the missing data mechanism, and use this model to impute missing values or obtain direct estimates of model parameters. To account for the uncertainty inherent in the imputation, we propose a multiple imputation procedure, which provides a set of plausible values for each non-detect. We assess the proposed methods via simulation studies and demonstrate the applicability of these methods to three experimental data sets. We compare our methods to mean imputation, single imputation, and a penalized EM algorithm incorporating non-random missingness (PEMM). The developed methods are implemented in the R/Bioconductor package nondetects.

CONCLUSIONS

The statistical methods introduced here reduce discrepancies in gene expression values derived from qPCR experiments in the presence of non-detects, providing increased confidence in downstream analyses.

摘要

背景

实时荧光定量 PCR(qPCR)是测量基因表达最广泛使用的方法之一。qPCR 数据的一个重要方面在很大程度上被忽视了,即存在无法检测到的情况:反应未能超过定量阈值,因此缺乏表达的测量。虽然大多数当前的软件用代表检测极限的值替换这些无法检测到的值,但这会对绝对和差异表达的估计引入很大的偏差。单插补程序虽然优于以前使用的方法,但低估了剩余方差,这可能导致反保守的推断。

结果

我们建议将无法检测到的情况视为随机缺失数据,对缺失数据机制进行建模,并使用该模型对缺失值进行插补或直接估计模型参数。为了考虑插补中固有的不确定性,我们提出了一种多重插补程序,为每个无法检测到的值提供一组合理的值。我们通过模拟研究评估了所提出的方法,并展示了这些方法在三个实验数据集上的适用性。我们将我们的方法与均值插补、单插补和纳入非随机缺失的惩罚 EM 算法(PEMM)进行了比较。所开发的方法在 R/Bioconductor 包 nondetects 中实现。

结论

这里介绍的统计方法减少了在存在无法检测到的情况下从 qPCR 实验中得出的基因表达值之间的差异,为下游分析提供了更大的信心。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efd0/7693525/90ecda5f7715/12859_2020_3807_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efd0/7693525/bf65110a14ef/12859_2020_3807_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efd0/7693525/90ecda5f7715/12859_2020_3807_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efd0/7693525/bf65110a14ef/12859_2020_3807_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efd0/7693525/90ecda5f7715/12859_2020_3807_Fig2_HTML.jpg

相似文献

1
Multiple imputation and direct estimation for qPCR data with non-detects.非检测定量聚合酶链式反应数据的多重插补和直接估计。
BMC Bioinformatics. 2020 Nov 26;21(1):545. doi: 10.1186/s12859-020-03807-9.
2
On non-detects in qPCR data.关于qPCR数据中的未检测到情况。
Bioinformatics. 2014 Aug 15;30(16):2310-6. doi: 10.1093/bioinformatics/btu239. Epub 2014 Apr 23.
3
Mechanism-aware imputation: a two-step approach in handling missing values in metabolomics.基于机制的插补:代谢组学中处理缺失值的两步法。
BMC Bioinformatics. 2022 May 16;23(1):179. doi: 10.1186/s12859-022-04659-1.
4
Multiple imputation with sequential penalized regression.多重插补与序贯惩罚回归。
Stat Methods Med Res. 2019 May;28(5):1311-1327. doi: 10.1177/0962280218755574. Epub 2018 Feb 16.
5
The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study.预后模型的性能取决于缺失值插补算法的选择:一项模拟研究。
J Clin Epidemiol. 2024 Dec;176:111539. doi: 10.1016/j.jclinepi.2024.111539. Epub 2024 Sep 24.
6
Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random.仅使用辅助预测缺失变量的多重插补可能会因数据缺失而增加偏差。
BMC Med Res Methodol. 2024 Oct 7;24(1):231. doi: 10.1186/s12874-024-02353-9.
7
Accounting for bias due to outcome data missing not at random: comparison and illustration of two approaches to probabilistic bias analysis: a simulation study.考虑由于非随机缺失结局数据导致的偏倚:两种概率性偏倚分析方法的比较和说明:一项模拟研究。
BMC Med Res Methodol. 2024 Nov 13;24(1):278. doi: 10.1186/s12874-024-02382-4.
8
A nonparametric multiple imputation approach for missing categorical data.一种针对缺失分类数据的非参数多重填补方法。
BMC Med Res Methodol. 2017 Jun 6;17(1):87. doi: 10.1186/s12874-017-0360-2.
9
Missing value imputation in high-dimensional phenomic data: imputable or not, and how?高维表型组数据中的缺失值插补:是否可插补以及如何插补?
BMC Bioinformatics. 2014 Nov 5;15(1):346. doi: 10.1186/s12859-014-0346-6.
10
Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data.并行缺失值插补:一种用于微阵列数据的新型稳健缺失值估计算法。
Bioinformatics. 2005 May 15;21(10):2417-23. doi: 10.1093/bioinformatics/bti345. Epub 2005 Feb 24.

引用本文的文献

1
Correction: Multiple imputation and direct estimation for qPCR data with non-detects.更正:对存在未检出值的qPCR数据进行多重填补和直接估计。
BMC Bioinformatics. 2024 Feb 7;25(1):63. doi: 10.1186/s12859-024-05653-5.
2
Epigenomic mapping reveals distinct B cell acute lymphoblastic leukemia chromatin architectures and regulators.表观基因组图谱揭示了不同的 B 细胞急性淋巴细胞白血病染色质结构和调控因子。
Cell Genom. 2023 Nov 20;3(12):100442. doi: 10.1016/j.xgen.2023.100442. eCollection 2023 Dec 13.
3
Prognostic MicroRNA Panel for HCV-Associated HCC: Integrating Computational Biology and Clinical Validation.

本文引用的文献

1
Practical data handling pipeline improves performance of qPCR-based circulating miRNA measurements.实用的数据处理流程提高了基于定量聚合酶链反应的循环微小核糖核酸测量的性能。
RNA. 2017 May;23(5):811-821. doi: 10.1261/rna.059063.116. Epub 2017 Feb 15.
2
A four gene signature predictive of recurrent prostate cancer.一种预测复发性前列腺癌的四基因特征。
Oncotarget. 2017 Jan 10;8(2):3430-3440. doi: 10.18632/oncotarget.13837.
3
limma powers differential expression analyses for RNA-sequencing and microarray studies.limma为RNA测序和微阵列研究提供差异表达分析的动力。
丙型肝炎病毒相关肝细胞癌的预后微小RNA检测:整合计算生物学与临床验证
Cancers (Basel). 2022 Jun 21;14(13):3036. doi: 10.3390/cancers14133036.
Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.
4
On non-detects in qPCR data.关于qPCR数据中的未检测到情况。
Bioinformatics. 2014 Aug 15;30(16):2310-6. doi: 10.1093/bioinformatics/btu239. Epub 2014 Apr 23.
5
A penalized EM algorithm incorporating missing data mechanism for Gaussian parameter estimation.一种用于高斯参数估计的结合缺失数据机制的惩罚期望最大化算法。
Biometrics. 2014 Jun;70(2):312-22. doi: 10.1111/biom.12149. Epub 2014 Jan 28.
6
Data exploration, quality control and testing in single-cell qPCR-based gene expression experiments.单细胞 qPCR 基因表达实验中的数据探索、质量控制和测试。
Bioinformatics. 2013 Feb 15;29(4):461-7. doi: 10.1093/bioinformatics/bts714. Epub 2012 Dec 24.
7
Fitting Boolean networks from steady state perturbation data.根据稳态扰动数据拟合布尔网络。
Stat Appl Genet Mol Biol. 2011 Oct 5;10(1):/j/sagmb.2011.10.issue-1/1544-6115.1727/1544-6115.1727.xml. doi: 10.2202/1544-6115.1727.
8
Gene signature critical to cancer phenotype as a paradigm for anticancer drug discovery.基因特征作为抗癌药物发现的范例对癌症表型至关重要。
Oncogene. 2013 Aug 15;32(33):3809-18. doi: 10.1038/onc.2012.389. Epub 2012 Sep 10.
9
Differential expression analysis for sequence count data.差异表达分析序列计数数据。
Genome Biol. 2010;11(10):R106. doi: 10.1186/gb-2010-11-10-r106. Epub 2010 Oct 27.
10
Tackling the widespread and critical impact of batch effects in high-throughput data.解决高通量数据中广泛存在且极具影响力的批次效应问题。
Nat Rev Genet. 2010 Oct;11(10):733-9. doi: 10.1038/nrg2825. Epub 2010 Sep 14.