采用贝叶斯方法提高生物标志物选择的可重复性、稳健性和通用性：荟萃分析研究。

Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology.

机构信息

Institute for Immunity, Transplantation and Infection, School of Medicine, Stanford University, Stanford, California, United States of America.

Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, California, United States of America.

出版信息

PLoS Comput Biol. 2022 Jun 27;18(6):e1010260. doi: 10.1371/journal.pcbi.1010260. eCollection 2022 Jun.

DOI:10.1371/journal.pcbi.1010260

PMID:35759523

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9269905/

Abstract

A major limitation of gene expression biomarker studies is that they are not reproducible as they simply do not generalize to larger, real-world, heterogeneous populations. Frequentist multi-cohort gene expression meta-analysis has been frequently used as a solution to this problem to identify biomarkers that are truly differentially expressed. However, the frequentist meta-analysis framework has its limitations-it needs at least 4-5 datasets with hundreds of samples, is prone to confounding from outliers and relies on multiple-hypothesis corrected p-values. To address these shortcomings, we have created a Bayesian meta-analysis framework for the analysis of gene expression data. Using real-world data from three different diseases, we show that the Bayesian method is more robust to outliers, creates more informative estimates of between-study heterogeneity, reduces the number of false positive and false negative biomarkers and selects more generalizable biomarkers with less data. We have compared the Bayesian framework to a previously published frequentist framework and have developed a publicly available R package for use.

摘要

基因表达生物标志物研究的一个主要局限性是，它们不可重现，因为它们根本无法推广到更大、更真实、更多样化的人群。频率派多队列基因表达荟萃分析经常被用作解决这个问题的方法，以确定真正差异表达的生物标志物。然而，频率派荟萃分析框架有其局限性——它至少需要 4-5 个具有数百个样本的数据集，容易受到离群值的干扰，并依赖于多重假设校正的 p 值。为了解决这些缺点，我们创建了一个用于基因表达数据分析的贝叶斯荟萃分析框架。使用来自三种不同疾病的真实数据，我们表明贝叶斯方法对离群值更稳健，对研究间异质性的估计更具信息量，减少了假阳性和假阴性生物标志物的数量，并选择了具有更少数据的更具可推广性的生物标志物。我们将贝叶斯框架与之前发表的频率派框架进行了比较，并开发了一个可供使用的公共 R 包。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c0e/9269905/f9583898a4a1/pcbi.1010260.g001.jpg

相似文献

Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology.采用贝叶斯方法提高生物标志物选择的可重复性、稳健性和通用性：荟萃分析研究。

PLoS Comput Biol. 2022 Jun 27;18(6):e1010260. doi: 10.1371/journal.pcbi.1010260. eCollection 2022 Jun.

Bayesian versus frequentist statistical inference for investigating a one-off cancer cluster reported to a health department.贝叶斯方法与频率论统计推断在调查上报给卫生部门的一次性癌症聚集性病例中的应用

BMC Med Res Methodol. 2009 May 11;9:30. doi: 10.1186/1471-2288-9-30.

A comparison of Bayesian and frequentist methods in random-effects network meta-analysis of binary data.贝叶斯和频率派方法在二分类数据随机效应网络荟萃分析中的比较。

Res Synth Methods. 2020 May;11(3):363-378. doi: 10.1002/jrsm.1397. Epub 2020 Feb 20.

Implementing informative priors for heterogeneity in meta-analysis using meta-regression and pseudo data.使用Meta回归和伪数据为Meta分析中的异质性实施信息性先验。

Stat Med. 2016 Dec 20;35(29):5495-5511. doi: 10.1002/sim.7090. Epub 2016 Aug 30.

C-reactive protein and fracture risk: an updated systematic review and meta-analysis of cohort studies through the use of both frequentist and Bayesian approaches.C 反应蛋白与骨折风险：基于经典和贝叶斯两种方法的系统回顾和队列研究荟萃分析的更新

Osteoporos Int. 2021 Mar;32(3):425-435. doi: 10.1007/s00198-020-05623-6. Epub 2020 Sep 15.

Frequentist performances of Bayesian prediction intervals for random-effects meta-analysis.贝叶斯预测区间在随机效应荟萃分析中的频繁主义表现。

Biom J. 2021 Feb;63(2):394-405. doi: 10.1002/bimj.201900351. Epub 2020 Nov 9.

Incorporating Bayesian methods into the propensity score matching framework: A no-treatment effect safety analysis.将贝叶斯方法纳入倾向评分匹配框架：无治疗效果安全性分析。

Accid Anal Prev. 2020 Sep;145:105691. doi: 10.1016/j.aap.2020.105691. Epub 2020 Jul 22.

Hypothesis testing in Bayesian network meta-analysis.贝叶斯网络荟萃分析中的假设检验。

BMC Med Res Methodol. 2018 Nov 12;18(1):128. doi: 10.1186/s12874-018-0574-y.

How vague is vague? How informative is informative? Reference analysis for Bayesian meta-analysis.如何界定模糊，如何界定信息充分？贝叶斯荟萃分析的参考分析。

Stat Med. 2021 Sep 10;40(20):4505-4521. doi: 10.1002/sim.9076. Epub 2021 May 26.

BayesGmed: An R-package for Bayesian causal mediation analysis.BayesGmed：一个用于贝叶斯因果中介分析的 R 包。

PLoS One. 2023 Jun 14;18(6):e0287037. doi: 10.1371/journal.pone.0287037. eCollection 2023.

引用本文的文献

Strengths and limitations of non-disclosive data analysis: a comparison of breast cancer survival classifiers using VisualSHIELD.非公开数据分析的优势与局限：使用VisualSHIELD对乳腺癌生存分类器的比较

Front Genet. 2024 Jan 29;15:1270387. doi: 10.3389/fgene.2024.1270387. eCollection 2024.

本文引用的文献

Diagnostic Accuracy of the Cepheid 3-gene Host Response Fingerstick Blood Test in a Prospective, Multi-site Study: Interim Results.在一项前瞻性、多中心研究中，Cepheid 3 基因宿主反应指血检测的诊断准确性：中期结果。

Clin Infect Dis. 2022 Jul 6;74(12):2136-2141. doi: 10.1093/cid/ciab839.

Blood-based host biomarker diagnostics in active case finding for pulmonary tuberculosis: A diagnostic case-control study.用于肺结核主动病例发现的血液宿主生物标志物诊断：一项诊断性病例对照研究。

EClinicalMedicine. 2021 Mar 6;33:100776. doi: 10.1016/j.eclinm.2021.100776. eCollection 2021 Mar.

Diagnostic Accuracy Study of a Novel Blood-Based Assay for Identification of Tuberculosis in People Living with HIV.新型血液检测方法对 HIV 感染者结核病诊断准确性的研究。

J Clin Microbiol. 2021 Feb 18;59(3). doi: 10.1128/JCM.01643-20.

Long Noncoding RNA and Predictive Model To Improve Diagnosis of Clinically Diagnosed Pulmonary Tuberculosis.长链非编码RNA与预测模型以改善临床诊断肺结核的诊断

J Clin Microbiol. 2020 Jun 24;58(7). doi: 10.1128/JCM.01973-19.

A generalizable 29-mRNA neural-network classifier for acute bacterial and viral infections.一种可推广的 29-mRNA 神经网络分类器，用于急性细菌和病毒感染。

Nat Commun. 2020 Mar 4;11(1):1177. doi: 10.1038/s41467-020-14975-w.

Cross-validation of existing signatures and derivation of a novel 29-gene transcriptomic signature predictive of progression to TB in a Brazilian cohort of household contacts of pulmonary TB.巴西肺结核家庭接触者队列中，对现有标志物进行交叉验证并建立一个新的 29 基因转录组学标志物，预测进展为结核病的风险。

Tuberculosis (Edinb). 2020 Jan;120:101898. doi: 10.1016/j.tube.2020.101898. Epub 2020 Jan 7.

Rein in the four horsemen of irreproducibility.控制住不可重复性的四大因素。

Nature. 2019 Apr;568(7753):435. doi: 10.1038/d41586-019-01307-2.

Design preclinical studies for reproducibility.设计具有可重复性的临床前研究。

Nat Biomed Eng. 2018 Nov;2(11):789-790. doi: 10.1038/s41551-018-0322-y.

Complement pathway gene activation and rising circulating immune complexes characterize early disease in HIV-associated tuberculosis.补体途径基因激活和循环免疫复合物升高是 HIV 相关结核病早期发病的特征。

Proc Natl Acad Sci U S A. 2018 Jan 30;115(5):E964-E973. doi: 10.1073/pnas.1711853115. Epub 2018 Jan 16.

Multitissue Transcriptomics Delineates the Diversity of Airway T Cell Functions in Asthma.多组织转录组学描绘哮喘气道T细胞功能的多样性

Am J Respir Cell Mol Biol. 2018 Feb;58(2):261-270. doi: 10.1165/rcmb.2017-0162OC.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

采用贝叶斯方法提高生物标志物选择的可重复性、稳健性和通用性：荟萃分析研究。

Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献