• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有序贯检验的研究任务中的决策制定。

Decision-making in research tasks with sequential testing.

作者信息

Pfeiffer Thomas, Rand David G, Dreber Anna

机构信息

Program for Evolutionary Dynamics, Harvard University, Cambridge, Massachusetts, United States of America.

出版信息

PLoS One. 2009;4(2):e4607. doi: 10.1371/journal.pone.0004607. Epub 2009 Feb 25.

DOI:10.1371/journal.pone.0004607
PMID:19240797
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2643008/
Abstract

BACKGROUND

In a recent controversial essay, published by JPA Ioannidis in PLoS Medicine, it has been argued that in some research fields, most of the published findings are false. Based on theoretical reasoning it can be shown that small effect sizes, error-prone tests, low priors of the tested hypotheses and biases in the evaluation and publication of research findings increase the fraction of false positives. These findings raise concerns about the reliability of research. However, they are based on a very simple scenario of scientific research, where single tests are used to evaluate independent hypotheses.

METHODOLOGY/PRINCIPAL FINDINGS: In this study, we present computer simulations and experimental approaches for analyzing more realistic scenarios. In these scenarios, research tasks are solved sequentially, i.e. subsequent tests can be chosen depending on previous results. We investigate simple sequential testing and scenarios where only a selected subset of results can be published and used for future rounds of test choice. Results from computer simulations indicate that for the tasks analyzed in this study, the fraction of false among the positive findings declines over several rounds of testing if the most informative tests are performed. Our experiments show that human subjects frequently perform the most informative tests, leading to a decline of false positives as expected from the simulations.

CONCLUSIONS/SIGNIFICANCE: For the research tasks studied here, findings tend to become more reliable over time. We also find that the performance in those experimental settings where not all performed tests could be published turned out to be surprisingly inefficient. Our results may help optimize existing procedures used in the practice of scientific research and provide guidance for the development of novel forms of scholarly communication.

摘要

背景

在JPA·约阿尼季斯发表于《公共科学图书馆·医学》的一篇近期颇具争议的文章中,有人认为在某些研究领域,大多数已发表的研究结果是错误的。基于理论推理可以表明,效应量小、易出错的检验、所检验假设的先验概率低以及研究结果评估和发表过程中的偏差会增加假阳性的比例。这些发现引发了对研究可靠性的担忧。然而,它们基于一种非常简单的科学研究情景,即使用单一检验来评估独立假设。

方法/主要发现:在本研究中,我们提出了用于分析更现实情景的计算机模拟和实验方法。在这些情景中,研究任务是依次解决的,即后续检验可根据先前结果来选择。我们研究了简单的序贯检验以及仅能发表选定结果子集并将其用于未来检验选择轮次的情景。计算机模拟结果表明,对于本研究中分析的任务,如果进行信息性最强的检验,在多轮检验中阳性结果中的假结果比例会下降。我们的实验表明,人类受试者经常进行信息性最强的检验,导致假阳性如模拟预期那样下降。

结论/意义:对于此处研究的研究任务,随着时间推移,研究结果往往会变得更可靠。我们还发现,在并非所有进行的检验都能发表的实验环境中,其表现出人意料地低效。我们的结果可能有助于优化科学研究实践中使用的现有程序,并为新型学术交流形式的发展提供指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/e7083d226e3e/pone.0004607.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/6ceaf8daad11/pone.0004607.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/441143a49b6a/pone.0004607.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/2886111e09f5/pone.0004607.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/e7083d226e3e/pone.0004607.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/6ceaf8daad11/pone.0004607.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/441143a49b6a/pone.0004607.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/2886111e09f5/pone.0004607.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d76/2643008/e7083d226e3e/pone.0004607.g004.jpg

相似文献

1
Decision-making in research tasks with sequential testing.具有序贯检验的研究任务中的决策制定。
PLoS One. 2009;4(2):e4607. doi: 10.1371/journal.pone.0004607. Epub 2009 Feb 25.
2
Are most published research findings false in a continuous universe?在一个连续的宇宙中,大多数已发表的研究结果都是错误的吗?
PLoS One. 2022 Dec 20;17(12):e0277935. doi: 10.1371/journal.pone.0277935. eCollection 2022.
3
Replication, Communication, and the Population Dynamics of Scientific Discovery.复制、交流与科学发现的种群动态
PLoS One. 2015 Aug 26;10(8):e0136088. doi: 10.1371/journal.pone.0136088. eCollection 2015.
4
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
5
Large-scale assessment of the effect of popularity on the reliability of research.对知名度对研究可靠性影响的大规模评估。
PLoS One. 2009 Jun 24;4(6):e5996. doi: 10.1371/journal.pone.0005996.
6
Student and educator experiences of maternal-child simulation-based learning: a systematic review of qualitative evidence protocol.基于母婴模拟学习的学生和教育工作者体验:定性证据协议的系统评价
JBI Database System Rev Implement Rep. 2015 Jan;13(1):14-26. doi: 10.11124/jbisrir-2015-1694.
7
Is N-Hacking Ever OK? The consequences of collecting more data in pursuit of statistical significance.N 操弄是否可以接受?为了追求统计学意义而收集更多数据的后果。
PLoS Biol. 2023 Nov 1;21(11):e3002345. doi: 10.1371/journal.pbio.3002345. eCollection 2023 Nov.
8
Publication bias and the canonization of false facts.发表性偏倚与虚假事实的公认化
Elife. 2016 Dec 20;5:e21451. doi: 10.7554/eLife.21451.
9
Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives.随机对照试验中的亚组分析:量化假阳性和假阴性风险
Health Technol Assess. 2001;5(33):1-56. doi: 10.3310/hta5330.
10
Gray (Literature) Matters: Evidence of Selective Hypothesis Reporting in Social Psychological Research.灰色文献很重要:社会心理学研究中选择性假说报告的证据。
Pers Soc Psychol Bull. 2020 Sep;46(9):1344-1362. doi: 10.1177/0146167220903896. Epub 2020 Feb 24.

引用本文的文献

1
When decision heuristics and science collide.当决策启发法与科学发生冲突时。
Psychon Bull Rev. 2014 Apr;21(2):268-82. doi: 10.3758/s13423-013-0495-z.
2
Quantifying selective reporting and the Proteus phenomenon for multiple datasets with similar bias.对多个数据集进行相似偏倚的选择性报告和变形虫现象的定量分析。
PLoS One. 2011 Mar 29;6(3):e18362. doi: 10.1371/journal.pone.0018362.
3
An experiment on prediction markets in science.科学中预测市场的实验。

本文引用的文献

1
A wiki for the life sciences where authorship matters.一个重视作者身份的生命科学维基网站。
Nat Genet. 2008 Sep;40(9):1047-51. doi: 10.1038/ng.f.217.
2
Persistence of contradicted claims in the literature.文献中相互矛盾的观点持续存在。
JAMA. 2007 Dec 5;298(21):2517-26. doi: 10.1001/jama.298.21.2517.
3
Temporal patterns of genes in scientific publications.科学出版物中基因的时间模式。
PLoS One. 2009 Dec 30;4(12):e8500. doi: 10.1371/journal.pone.0008500.
Proc Natl Acad Sci U S A. 2007 Jul 17;104(29):12052-6. doi: 10.1073/pnas.0701315104. Epub 2007 Jul 9.
4
Commentary: rare alleles, modest genetic effects and the need for collaboration.评论:罕见等位基因、适度的遗传效应以及合作的必要性。
Int J Epidemiol. 2007 Apr;36(2):445-8. doi: 10.1093/ije/dym055. Epub 2007 Apr 30.
5
Digital cows grazing on digital grounds.数字奶牛在数字草地上吃草。
Curr Biol. 2006 Nov 21;16(22):R946-9. doi: 10.1016/j.cub.2006.10.011.
6
Evolution and translation of research findings: from bench to where?研究成果的演变与转化:从实验室到何处?
PLoS Clin Trials. 2006 Nov 17;1(7):e36. doi: 10.1371/journal.pctr.0010036.
7
Microparadigms: chains of collective reasoning in publications about molecular interactions.微观范式:关于分子相互作用的出版物中的集体推理链
Proc Natl Acad Sci U S A. 2006 Mar 28;103(13):4940-5. doi: 10.1073/pnas.0600591103. Epub 2006 Mar 16.
8
Finding useful questions: on Bayesian diagnosticity, probability, impact, and information gain.寻找有用的问题:关于贝叶斯诊断性、概率、影响和信息增益
Psychol Rev. 2005 Oct;112(4):979-99. doi: 10.1037/0033-295X.112.4.979.
9
Why most published research findings are false.为何大多数已发表的研究结果是错误的。
PLoS Med. 2005 Aug;2(8):e124. doi: 10.1371/journal.pmed.0020124. Epub 2005 Aug 30.
10
Contradicted and initially stronger effects in highly cited clinical research.在高被引临床研究中相互矛盾且最初更强的效应。
JAMA. 2005 Jul 13;294(2):218-28. doi: 10.1001/jama.294.2.218.