一种基于逆转录定量聚合酶链反应（RT-qPCR）基因表达和临床协变量构建并验证预后生物标志物模型的策略。

A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates.

作者信息

Tournoud Maud, Larue Audrey, Cazalis Marie-Angelique, Venet Fabienne, Pachot Alexandre, Monneret Guillaume, Lepape Alain, Veyrieras Jean-Baptiste

机构信息

Bioinformatics Research Department, bioMérieux, Marcy L'Etoile, France.

Medical Diagnostic Discovery Department, bioMérieux, Marcy L'Etoile, France.

出版信息

BMC Bioinformatics. 2015 Mar 28;16:106. doi: 10.1186/s12859-015-0537-9.

DOI:10.1186/s12859-015-0537-9

PMID:25880752

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4384357/

Abstract

BACKGROUND

Construction and validation of a prognostic model for survival data in the clinical domain is still an active field of research. Nevertheless there is no consensus on how to develop routine prognostic tests based on a combination of RT-qPCR biomarkers and clinical or demographic variables. In particular, the estimation of the model performance requires to properly account for the RT-qPCR experimental design.

RESULTS

We present a strategy to build, select, and validate a prognostic model for survival data based on a combination of RT-qPCR biomarkers and clinical or demographic data and we provide an illustration on a real clinical dataset. First, we compare two cross-validation schemes: a classical outcome-stratified cross-validation scheme and an alternative one that accounts for the RT-qPCR plate design, especially when samples are processed by batches. The latter is intended to limit the performance discrepancies, also called the validation surprise, between the training and the test sets. Second, strategies for model building (covariate selection, functional relationship modeling, and statistical model) as well as performance indicators estimation are presented. Since in practice several prognostic models can exhibit similar performances, complementary criteria for model selection are discussed: the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performance.

CONCLUSION

On the training dataset, appropriate resampling methods are expected to prevent from any upward biases due to unaccounted technical and biological variability that may arise from the experimental and intrinsic design of the RT-qPCR assay. Moreover, the stability of the selected variables, the model optimism, and the impact of the omitted variables on the model performances are pivotal indicators to select the optimal model to be validated on the test dataset.

摘要

背景

临床领域生存数据预后模型的构建与验证仍是一个活跃的研究领域。然而，对于如何基于逆转录定量聚合酶链反应（RT-qPCR）生物标志物与临床或人口统计学变量的组合来开发常规预后测试，尚无共识。特别是，模型性能的估计需要适当考虑RT-qPCR实验设计。

结果

我们提出了一种基于RT-qPCR生物标志物与临床或人口统计学数据的组合来构建、选择和验证生存数据预后模型的策略，并在一个真实的临床数据集上进行了说明。首先，我们比较了两种交叉验证方案：经典的结果分层交叉验证方案和另一种考虑RT-qPCR板设计的方案，特别是当样本分批处理时。后者旨在限制训练集和测试集之间的性能差异，也称为验证意外。其次，介绍了模型构建策略（协变量选择、功能关系建模和统计模型）以及性能指标估计。由于在实践中几个预后模型可能表现出相似的性能，因此讨论了模型选择的补充标准：所选变量的稳定性、模型乐观性以及遗漏变量对模型性能的影响。

结论

在训练数据集上，预期适当的重采样方法可防止因RT-qPCR检测的实验和内在设计可能产生的未考虑的技术和生物学变异性而导致的任何向上偏差。此外，所选变量的稳定性、模型乐观性以及遗漏变量对模型性能的影响是选择要在测试数据集上验证的最佳模型的关键指标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf9/4384357/48f98701414b/12859_2015_537_Fig1_HTML.jpg

相似文献

A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates.一种基于逆转录定量聚合酶链反应（RT-qPCR）基因表达和临床协变量构建并验证预后生物标志物模型的策略。

BMC Bioinformatics. 2015 Mar 28;16:106. doi: 10.1186/s12859-015-0537-9.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Technical validation of an RT-qPCR in vitro diagnostic test system for the determination of breast cancer molecular subtypes by quantification of ERBB2, ESR1, PGR and MKI67 mRNA levels from formalin-fixed paraffin-embedded breast tumor specimens.一种用于通过定量福尔马林固定石蜡包埋乳腺肿瘤标本中的ERBB2、ESR1、PGR和MKI67 mRNA水平来确定乳腺癌分子亚型的RT-qPCR体外诊断测试系统的技术验证。

BMC Cancer. 2016 Jul 7;16:398. doi: 10.1186/s12885-016-2476-x.

Gene expression analysis in biomarker research and early drug development using function tested reverse transcription quantitative real-time PCR assays.利用功能验证的反转录实时定量 PCR 分析方法进行生物标志物研究和早期药物开发中的基因表达分析。

Methods. 2013 Jan;59(1):10-9. doi: 10.1016/j.ymeth.2012.07.003. Epub 2012 Jul 14.

Identification and Validation of Reference Genes for RT-qPCR Studies of Hypoxia in Squamous Cervical Cancer Patients.宫颈癌鳞状细胞癌患者缺氧相关RT-qPCR研究中内参基因的鉴定与验证

PLoS One. 2016 May 31;11(5):e0156259. doi: 10.1371/journal.pone.0156259. eCollection 2016.

Identification and validation of a multigene predictor of recurrence in primary laryngeal cancer.原发性喉癌复发的多基因预测因子的鉴定和验证。

PLoS One. 2013 Aug 9;8(8):e70429. doi: 10.1371/journal.pone.0070429. eCollection 2013.

Development and evaluation of a novel RT-qPCR based test for the quantification of HER2 gene expression in breast cancer.一种基于实时定量聚合酶链反应（RT-qPCR）的新型检测方法用于乳腺癌中HER2基因表达定量的开发与评估。

Gene. 2017 Mar 20;605:114-122. doi: 10.1016/j.gene.2016.12.027. Epub 2016 Dec 28.

Substantial performance discrepancies among commercially available kits for reverse transcription quantitative polymerase chain reaction: a systematic comparative investigator-driven approach.市售逆转录定量聚合酶链反应试剂盒之间存在显著的性能差异：一种系统的、以调查者为驱动的比较方法。

Anal Biochem. 2010 Jun 15;401(2):303-11. doi: 10.1016/j.ab.2010.03.007. Epub 2010 Mar 10.

Disseminated single tumor cells as detected by real-time quantitative polymerase chain reaction represent a prognostic factor in patients undergoing surgery for colorectal cancer.通过实时定量聚合酶链反应检测到的播散性单个肿瘤细胞是接受结直肠癌手术患者的一个预后因素。

Ann Surg. 2002 Dec;236(6):768-75; discussion 775-6. doi: 10.1097/00000658-200212000-00009.

Effectiveness and cost-effectiveness of four different strategies for SARS-CoV-2 surveillance in the general population (CoV-Surv Study): a structured summary of a study protocol for a cluster-randomised, two-factorial controlled trial.在普通人群中进行 SARS-CoV-2 监测的四种不同策略的有效性和成本效益（CoV-Surv 研究）：一项关于集群随机、双因素对照试验的研究方案的结构化总结。

Trials. 2021 Jan 8;22(1):39. doi: 10.1186/s13063-020-04982-z.

引用本文的文献

Prediction of postoperative infection in elderly using deep learning-based analysis: an observational cohort study.基于深度学习的分析预测老年患者术后感染：一项观察性队列研究。

Aging Clin Exp Res. 2023 Mar;35(3):639-647. doi: 10.1007/s40520-022-02325-3. Epub 2023 Jan 4.

MMP11 and CD2 as novel prognostic factors in hormone receptor-negative, HER2-positive breast cancer.基质金属蛋白酶11和CD2作为激素受体阴性、人表皮生长因子受体2阳性乳腺癌的新型预后因素。

Breast Cancer Res Treat. 2017 Jul;164(1):41-56. doi: 10.1007/s10549-017-4234-4. Epub 2017 Apr 13.

Boosting the discriminatory power of sparse survival models via optimization of the concordance index and stability selection.通过优化一致性指数和稳定性选择提高稀疏生存模型的判别能力。

BMC Bioinformatics. 2016 Jul 22;17:288. doi: 10.1186/s12859-016-1149-8.

Straightforward and sensitive RT-qPCR based gene expression analysis of FFPE samples.基于FFPE样本的简单且灵敏的逆转录定量聚合酶链反应基因表达分析

Sci Rep. 2016 Feb 22;6:21418. doi: 10.1038/srep21418.

本文引用的文献

Validation of prediction models based on lasso regression with multiply imputed data.基于套索回归与多重填补数据的预测模型验证

BMC Med Res Methodol. 2014 Oct 16;14:116. doi: 10.1186/1471-2288-14-116.

Sample size requirements for training high-dimensional risk predictors.高维风险预测器训练的样本量要求。

Biostatistics. 2013 Sep;14(4):639-52. doi: 10.1093/biostatistics/kxt022. Epub 2013 Jul 19.

Variable selection for multiply-imputed data with application to dioxin exposure study.具有应用于二恶英暴露研究的多重插补数据的变量选择。

Stat Med. 2013 Sep 20;32(21):3646-59. doi: 10.1002/sim.5783. Epub 2013 Mar 25.

Mixed modeling and sample size calculations for identifying housekeeping genes.用于鉴定管家基因的混合建模和样本量计算。

Stat Med. 2013 Aug 15;32(18):3115-25. doi: 10.1002/sim.5768. Epub 2013 Feb 26.

An evaluation of resampling methods for assessment of survival risk prediction in high-dimensional settings.高维环境下评估生存风险预测的重采样方法评估。

Stat Med. 2011 Mar 15;30(6):642-53. doi: 10.1002/sim.4106. Epub 2010 Dec 1.

Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data.使用交叉验证评估基于高维数据的生存风险分类器的预测准确性。

Brief Bioinform. 2011 May;12(3):203-14. doi: 10.1093/bib/bbr001. Epub 2011 Feb 15.

Reporting performance of prognostic models in cancer: a review.报告癌症预后模型的性能：综述。

BMC Med. 2010 Mar 30;8:21. doi: 10.1186/1741-7015-8-21.

Robust biomarker identification for cancer diagnosis with ensemble feature selection methods.基于集成特征选择方法的癌症诊断稳健生物标志物识别。

Bioinformatics. 2010 Feb 1;26(3):392-8. doi: 10.1093/bioinformatics/btp630. Epub 2009 Nov 25.

Prognostic models: a methodological framework and review of models for breast cancer.预后模型：乳腺癌模型的方法框架与综述

Cancer Invest. 2009 Mar;27(3):235-43. doi: 10.1080/07357900802572110.

How should variable selection be performed with multiply imputed data?对于多重填补的数据，应如何进行变量选择？

Stat Med. 2008 Jul 30;27(17):3227-46. doi: 10.1002/sim.3177.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种基于逆转录定量聚合酶链反应（RT-qPCR）基因表达和临床协变量构建并验证预后生物标志物模型的策略。

A strategy to build and validate a prognostic biomarker model based on RT-qPCR gene expression and clinical covariates.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献