• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

荟萃分析中异质性检验的困境:来自一项模拟研究的挑战。

The dilemma of heterogeneity tests in meta-analysis: a challenge from a simulation study.

作者信息

Li Shi-jun, Jiang Hua, Yang Hao, Chen Wei, Peng Jin, Sun Ming-wei, Lu Charles Damien, Peng Xi, Zeng Jun

机构信息

Department of Computational Mathematics and Bio-statistics, Metabolomics and Multidisciplinary Laboratory for Trauma Research, Sichuan Provincial People's Hospital, Sichuan Academy of Medical Sciences, Chengdu, Sichuan, China.

Department of Computational Mathematics and Bio-statistics, Metabolomics and Multidisciplinary Laboratory for Trauma Research, Sichuan Provincial People's Hospital, Sichuan Academy of Medical Sciences, Chengdu, Sichuan, China; Department of Parenteral and Enteral Nutrition, Peking Union Medical College Hospital, Beijing, China; Institute of Burn Research, Southwest Hospital of the Third Military Medical University, Chongqing, China.

出版信息

PLoS One. 2015 May 29;10(5):e0127538. doi: 10.1371/journal.pone.0127538. eCollection 2015.

DOI:10.1371/journal.pone.0127538
PMID:26023932
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4449216/
Abstract

INTRODUCTION

After several decades' development, meta-analysis has become the pillar of evidence-based medicine. However, heterogeneity is still the threat to the validity and quality of such studies. Currently, Q and its descendant I(2) (I square) tests are widely used as the tools for heterogeneity evaluation. The core mission of this kind of test is to identify data sets from similar populations and exclude those are from different populations. Although Q and I(2) are used as the default tool for heterogeneity testing, the work we present here demonstrates that the robustness of these two tools is questionable.

METHODS AND FINDINGS

We simulated a strictly normalized population S. The simulation successfully represents randomized control trial data sets, which fits perfectly with the theoretical distribution (experimental group: p = 0.37, control group: p = 0.88). And we randomly generate research samples Si that fits the population with tiny distributions. In short, these data sets are perfect and can be seen as completely homogeneous data from the exactly same population. If Q and I(2) are truly robust tools, the Q and I(2) testing results on our simulated data sets should not be positive. We then synthesized these trials by using fixed model. Pooled results indicated that the mean difference (MD) corresponds highly with the true values, and the 95% confidence interval (CI) is narrow. But, when the number of trials and sample size of trials enrolled in the meta-analysis are substantially increased; the Q and I(2) values also increase steadily. This result indicates that I(2) and Q are only suitable for testing heterogeneity amongst small sample size trials, and are not adoptable when the sample sizes and the number of trials increase substantially.

CONCLUSIONS

Every day, meta-analysis studies which contain flawed data analysis are emerging and passed on to clinical practitioners as "updated evidence". Using this kind of evidence that contain heterogeneous data sets leads to wrong conclusion, makes chaos in clinical practice and weakens the foundation of evidence-based medicine. We suggest more strict applications of meta-analysis: it should only be applied to those synthesized trials with small sample sizes. We call upon that the tools of evidence-based medicine should keep up-to-dated with the cutting-edge technologies in data science. Clinical research data should be made available publicly when there is any relevant article published so the research community could conduct in-depth data mining, which is a better alternative for meta-analysis in many instances.

摘要

引言

经过几十年的发展,荟萃分析已成为循证医学的支柱。然而,异质性仍然是此类研究有效性和质量的威胁。目前,Q检验及其衍生的I²(I方)检验被广泛用作异质性评估工具。这类检验的核心任务是识别来自相似总体的数据集,并排除那些来自不同总体的数据集。尽管Q检验和I²检验被用作异质性检验的默认工具,但我们在此展示的工作表明这两种工具的稳健性值得怀疑。

方法与结果

我们模拟了一个严格标准化的总体S。该模拟成功地代表了随机对照试验数据集,与理论分布完美契合(实验组:p = 0.37,对照组:p = 0.88)。并且我们随机生成符合该总体微小分布的研究样本Si。简而言之,这些数据集是完美的,可以被视为来自完全相同总体的完全同质的数据。如果Q检验和I²检验是真正稳健的工具,那么对我们模拟数据集进行Q检验和I²检验的结果不应为阳性。然后我们使用固定模型对这些试验进行综合分析。合并结果表明,平均差值(MD)与真实值高度相符,且95%置信区间(CI)较窄。但是,当荟萃分析中纳入的试验数量和试验样本量大幅增加时,Q值和I²值也会稳步上升。这一结果表明,I²检验和Q检验仅适用于检验小样本量试验之间的异质性,当样本量和试验数量大幅增加时则不适用。

结论

每天,都有包含有缺陷数据分析的荟萃分析研究出现,并作为“更新的证据”传递给临床医生。使用这种包含异质数据集的证据会导致错误的结论,在临床实践中造成混乱,并削弱循证医学的基础。我们建议对荟萃分析进行更严格的应用:它应该仅应用于那些小样本量的综合试验。我们呼吁循证医学工具应跟上数据科学的前沿技术。当有任何相关文章发表时,临床研究数据应公开提供,以便研究界能够进行深入的数据挖掘,在许多情况下,这是荟萃分析的更好替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/0fa6de5264f2/pone.0127538.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/6e4e785df0e1/pone.0127538.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/61771c4e3247/pone.0127538.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/cc1eb3c87094/pone.0127538.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/51e8c17e0764/pone.0127538.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/a0e2196ec8ab/pone.0127538.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/320248577cbf/pone.0127538.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/0fa6de5264f2/pone.0127538.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/6e4e785df0e1/pone.0127538.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/61771c4e3247/pone.0127538.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/cc1eb3c87094/pone.0127538.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/51e8c17e0764/pone.0127538.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/a0e2196ec8ab/pone.0127538.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/320248577cbf/pone.0127538.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6623/4449216/0fa6de5264f2/pone.0127538.g007.jpg

相似文献

1
The dilemma of heterogeneity tests in meta-analysis: a challenge from a simulation study.荟萃分析中异质性检验的困境:来自一项模拟研究的挑战。
PLoS One. 2015 May 29;10(5):e0127538. doi: 10.1371/journal.pone.0127538. eCollection 2015.
2
[Meta-analysis of the Italian studies on short-term effects of air pollution].[意大利关于空气污染短期影响研究的荟萃分析]
Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71.
3
Meta-analysis: Problems with Russian Publications.荟萃分析:俄罗斯出版物存在的问题。
Int J Risk Saf Med. 2015;27 Suppl 1:S89-90. doi: 10.3233/JRS-150702.
4
Assessing heterogeneity in meta-analysis: Q statistic or I2 index?评估荟萃分析中的异质性:Q统计量还是I²指数?
Psychol Methods. 2006 Jun;11(2):193-206. doi: 10.1037/1082-989X.11.2.193.
5
The effect of publication bias on the Q test and assessment of heterogeneity.发表偏倚对 Q 检验和异质性评估的影响。
Psychol Methods. 2019 Feb;24(1):116-134. doi: 10.1037/met0000197. Epub 2018 Nov 29.
6
Multicity study of air pollution and mortality in Latin America (the ESCALA study).拉丁美洲空气污染与死亡率的多城市研究(ESCALA研究)。
Res Rep Health Eff Inst. 2012 Oct(171):5-86.
7
Effectiveness of interventions that assist caregivers to support people with dementia living in the community: a systematic review.干预措施对帮助照顾者支持社区中痴呆症患者的有效性:系统评价。
Int J Evid Based Healthc. 2008 Jun;6(2):137-72. doi: 10.1111/j.1744-1609.2008.00090.x.
8
A double-blind, randomized study to assess the validity of applied kinesiology (AK) as a diagnostic tool and as a nonlocal proximity effect.一项双盲随机研究,旨在评估应用运动机能学(AK)作为诊断工具以及作为非局部邻近效应的有效性。
Explore (NY). 2014 Mar-Apr;10(2):99-108. doi: 10.1016/j.explore.2013.12.002. Epub 2013 Dec 18.
9
The (non)comparability of the correlation effect size across different measurement procedures: a challenge to meta-analysis as a tool for identifying "evidence based practices".
J Evid Based Soc Work. 2011 May;8(3):253-74. doi: 10.1080/15433710903346574.
10
Small group learning in medical education: a second look at the Springer, Stanne, and Donovan meta-analysis.医学教育中的小组学习:对施普林格、斯坦内和多诺万元分析的再审视
Teach Learn Med. 2003 Winter;15(1):2-5. doi: 10.1207/S15328015TLM1501_01.

引用本文的文献

1
Pancreatic stone protein as a novel biomarker of microvascular complications in type II diabetes mellitus: A systematic review and meta-analysis.胰石蛋白作为2型糖尿病微血管并发症的新型生物标志物:一项系统评价和荟萃分析
Tzu Chi Med J. 2025 Mar 5;37(3):328-338. doi: 10.4103/tcmj.tcmj_211_24. eCollection 2025 Jul-Sep.
2
Impact of awake extracorporeal membrane oxygenation on patients mortality with cardiogenic shock: a systematic review and trial sequential meta-analysis based on observational studies.体外膜肺氧合对心源性休克患者死亡率的影响:基于观察性研究的系统评价和试验序贯荟萃分析。
BMJ Open. 2024 Oct 29;14(10):e086383. doi: 10.1136/bmjopen-2024-086383.
3

本文引用的文献

1
Undue reliance on I(2) in assessing heterogeneity may mislead.在评估异质性时过度依赖I²可能会产生误导。
BMC Med Res Methodol. 2008 Nov 27;8:79. doi: 10.1186/1471-2288-8-79.
2
Measuring inconsistency in meta-analyses.评估荟萃分析中的异质性
BMJ. 2003 Sep 6;327(7414):557-60. doi: 10.1136/bmj.327.7414.557.
Impact of imaging-diagnosed sarcopenia on outcomes in patients with biliary tract cancer after surgical resection: a systematic review and meta-analysis.
影像学诊断的肌肉减少症对胆道癌患者手术后结局的影响:系统评价和荟萃分析。
World J Surg Oncol. 2024 Sep 2;22(1):229. doi: 10.1186/s12957-024-03516-0.
4
Effects of early enteral nutrition in patients with severe burns: A systematic review.严重烧伤患者早期肠内营养的效果:系统评价。
Medicine (Baltimore). 2024 Feb 16;103(7):e37023. doi: 10.1097/MD.0000000000037023.
5
Acute kidney injury and its predictors among HIV-positive patients in Africa: Systematic review and meta-analysis.非洲 HIV 阳性患者的急性肾损伤及其预测因素:系统评价和荟萃分析。
PLoS One. 2024 Feb 9;19(2):e0298302. doi: 10.1371/journal.pone.0298302. eCollection 2024.
6
Efficacy of permissive underfeeding for critically ill patients: an updated systematic review and trial sequential meta-analysis.允许性低热量喂养对危重症患者的疗效:一项更新的系统评价和试验序贯Meta分析
J Intensive Care. 2024 Jan 23;12(1):4. doi: 10.1186/s40560-024-00717-3.
7
The association between sedentary behaviour and sarcopenia in older adults: a systematic review and meta-analysis.久坐行为与老年人肌少症的关系:系统评价和荟萃分析。
BMC Geriatr. 2023 Dec 20;23(1):877. doi: 10.1186/s12877-023-04489-7.
8
Efficacy of omega-3 fatty acids for hospitalized COVID-19 patients: a systematic review and meta-analysis of randomized controlled trials.ω-3 脂肪酸对住院 COVID-19 患者的疗效:随机对照试验的系统评价和荟萃分析。
Asia Pac J Clin Nutr. 2023 Sep;32(3):308-320. doi: 10.6133/apjcn.202309_32(3).0002.
9
Self-expanding metal stents versus decompression tubes as a bridge to surgery for patients with obstruction caused by colorectal cancer: a systematic review and meta-analysis.自膨式金属支架与减压管桥接治疗结直肠癌所致梗阻患者的手术:系统评价和荟萃分析。
World J Emerg Surg. 2023 Sep 27;18(1):46. doi: 10.1186/s13017-023-00515-6.
10
Optimizing enteral nutrition delivery by implementing volume-based feeding protocol for critically ill patients: an updated meta-analysis and systematic review.通过实施基于容量的危重症患者肠内营养输送方案来优化肠内营养输送:一项更新的荟萃分析和系统评价。
Crit Care. 2023 May 5;27(1):173. doi: 10.1186/s13054-023-04439-0.