• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

作为应对假设挖掘的一种对策的数据拆分:以腰痛预测因素的案例研究为例

Data splitting as a countermeasure against hypothesis fishing: with a case study of predictors for low back pain.

作者信息

Dahl Fredrik A, Grotle Margreth, Saltyte Benth Jūrate, Natvig Bård

机构信息

Helse Sør-Øst Health Services Research Centre, Akershus University Hospital, Lorenskog, Norway.

出版信息

Eur J Epidemiol. 2008;23(4):237-42. doi: 10.1007/s10654-008-9230-x. Epub 2008 Feb 21.

DOI:10.1007/s10654-008-9230-x
PMID:18288577
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2270357/
Abstract

There is growing concern in the scientific community that many published scientific findings may represent spurious patterns that are not reproducible in independent data sets. A reason for this is that significance levels or confidence intervals are often applied to secondary variables or sub-samples within the trial, in addition to the primary hypotheses (multiple hypotheses). This problem is likely to be extensive for population-based surveys, in which epidemiological hypotheses are derived after seeing the data set (hypothesis fishing). We recommend a data-splitting procedure to counteract this methodological problem, in which one part of the data set is used for identifying hypotheses, and the other is used for hypothesis testing. The procedure is similar to two-stage analysis of microarray data. We illustrate the process using a real data set related to predictors of low back pain at 14-year follow-up in a population initially free of low back pain. "Widespreadness" of pain (pain reported in several other places than the low back) was a statistically significant predictor, while smoking was not, despite its strong association with low back pain in the first half of the data set. We argue that the application of data splitting, in which an independent party handles the data set, will achieve for epidemiological surveys what pre-registration has done for clinical studies.

摘要

科学界越来越担心,许多已发表的科学发现可能代表了虚假模式,在独立数据集中无法重现。原因之一是,除了主要假设(多重假设)之外,显著性水平或置信区间通常还应用于试验中的次要变量或子样本。对于基于人群的调查而言,这个问题可能很普遍,在这类调查中,流行病学假设是在查看数据集之后得出的(假设捕捞)。我们建议采用一种数据拆分程序来应对这一方法学问题,即将数据集的一部分用于识别假设,另一部分用于假设检验。该程序类似于微阵列数据的两阶段分析。我们使用一个真实数据集来说明这个过程,该数据集与最初无腰痛人群14年随访时腰痛的预测因素有关。疼痛的“广泛性”(除腰部外其他几个部位也报告有疼痛)是一个具有统计学意义的预测因素,而吸烟则不是,尽管在数据集的前半部分吸烟与腰痛有很强的关联。我们认为,由独立方处理数据集的数据拆分应用,将为流行病学调查带来与临床研究预注册相同的效果。

相似文献

1
Data splitting as a countermeasure against hypothesis fishing: with a case study of predictors for low back pain.作为应对假设挖掘的一种对策的数据拆分:以腰痛预测因素的案例研究为例
Eur J Epidemiol. 2008;23(4):237-42. doi: 10.1007/s10654-008-9230-x. Epub 2008 Feb 21.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Potential risk factors for new onset of back pain disability in Japanese workers: findings from the Japan epidemiological research of occupation-related back pain study.日本职业性腰痛研究中日本工人新发腰痛残疾的潜在危险因素研究。
Spine (Phila Pa 1976). 2012 Jul 1;37(15):1324-33. doi: 10.1097/BRS.0b013e3182498382.
4
Occupational loading, health behavior and sleep disturbance as predictors of low-back pain.职业负荷、健康行为和睡眠障碍作为腰痛的预测因素。
Scand J Work Environ Health. 2008 Dec;34(6):411-9. doi: 10.5271/sjweh.1290.
5
Seeking care for low back pain in the general population: a two-year follow-up study: results from the MUSIC-Norrtälje Study.普通人群中腰痛患者的就医情况:一项为期两年的随访研究:来自于MUSIC - 诺尔雪平研究的结果
Spine (Phila Pa 1976). 2002 Oct 1;27(19):2159-65. doi: 10.1097/00007632-200210010-00016.
6
[Analysis of risk factors of nonspecific low back pain in a community population: a case-control study].[社区人群非特异性下腰痛的危险因素分析:一项病例对照研究]
Nan Fang Yi Ke Da Xue Xue Bao. 2014 Dec;34(12):1794-8.
7
Prevalence of neck and low back pain in community-dwelling adults in Spain: a population-based national study.颈痛和腰痛在西班牙社区居住成年人中的流行情况:一项基于人群的全国性研究。
Spine (Phila Pa 1976). 2011 Feb 1;36(3):E213-9. doi: 10.1097/BRS.0b013e3181d952c2.
8
Discographic, MRI and psychosocial determinants of low back pain disability and remission: a prospective study in subjects with benign persistent back pain.下腰痛残疾与缓解的椎间盘造影、MRI及社会心理决定因素:一项针对良性持续性背痛患者的前瞻性研究
Spine J. 2005 Jan-Feb;5(1):24-35. doi: 10.1016/j.spinee.2004.05.250.
9
The six-month incidence of clinically significant low back pain in the Saskatchewan adult population.萨斯喀彻温省成年人口中具有临床意义的下背痛的六个月发病率。
Spine (Phila Pa 1976). 2002 Aug 15;27(16):1778-82. doi: 10.1097/00007632-200208150-00017.
10
Why has the search for causes of low back pain largely been nonconclusive?为什么对腰痛病因的探寻大多没有定论?
Spine (Phila Pa 1976). 1997 Apr 15;22(8):877-81. doi: 10.1097/00007632-199704150-00010.

引用本文的文献

1
Hyperemesis gravidarum and the risk of emotional distress during and after pregnancy.妊娠剧吐与孕期及产后情绪困扰的风险
Arch Womens Ment Health. 2017 Dec;20(6):747-756. doi: 10.1007/s00737-017-0770-5. Epub 2017 Aug 25.
2
History of depression and risk of hyperemesis gravidarum: a population-based cohort study.抑郁症病史与妊娠剧吐风险:一项基于人群的队列研究。
Arch Womens Ment Health. 2017 Jun;20(3):397-404. doi: 10.1007/s00737-016-0713-6. Epub 2017 Jan 7.
3
Training frontline workforce on psychosis management: a prospective study of training effects.对一线工作人员进行精神病管理培训:培训效果的前瞻性研究
Int J Ment Health Syst. 2015 Nov 19;9:38. doi: 10.1186/s13033-015-0029-3. eCollection 2015.
4
Biomarkers of Nutrition for Development-Folate Review.营养促进发展的生物标志物——叶酸综述
J Nutr. 2015 Jul;145(7):1636S-1680S. doi: 10.3945/jn.114.206599. Epub 2015 Jun 3.
5
Analysis of liquid bead microarray antibody assay data for epidemiologic studies of pathogen-cancer associations.用于病原体与癌症关联流行病学研究的液滴微阵列抗体检测数据分析。
J Immunol Methods. 2015 Oct;425:45-50. doi: 10.1016/j.jim.2015.06.007. Epub 2015 Jun 11.
6
CTOs: what is the state of the evidence?慢性完全闭塞病变:证据现状如何?
Soc Psychiatry Psychiatr Epidemiol. 2014 Dec;49(12):1861-71. doi: 10.1007/s00127-014-0839-7. Epub 2014 Feb 22.
7
Study protocol: brief intervention for medication overuse headache--a double-blinded cluster randomised parallel controlled trial in primary care.研究方案:药物过度使用性头痛的简短干预——初级保健中的双盲、集群随机平行对照试验。
BMC Neurol. 2012 Aug 10;12:70. doi: 10.1186/1471-2377-12-70.
8
Does improved access to diagnostic imaging results reduce hospital length of stay? A retrospective study.改善诊断成像结果的可及性是否会缩短医院住院时间?一项回顾性研究。
BMC Health Serv Res. 2010 Sep 6;10:262. doi: 10.1186/1472-6963-10-262.
9
Do clinicians read our reports? Integrating the radiology information system with the electronic patient record: experiences from the first 2 years.临床医生会阅读我们的报告吗?将放射信息系统与电子病历整合:头两年的经验
Eur Radiol. 2009 Jan;19(1):31-6. doi: 10.1007/s00330-008-1098-3. Epub 2008 Aug 6.

本文引用的文献

1
Why most published research findings are false.为何大多数已发表的研究结果是错误的。
PLoS Med. 2005 Aug;2(8):e124. doi: 10.1371/journal.pmed.0020124. Epub 2005 Aug 30.
2
Standardised Nordic questionnaires for the analysis of musculoskeletal symptoms.用于分析肌肉骨骼症状的标准化北欧问卷。
Appl Ergon. 1987 Sep;18(3):233-7. doi: 10.1016/0003-6870(87)90010-x.
3
Two-stage designs for gene-disease association studies.基因-疾病关联研究的两阶段设计
Biometrics. 2002 Mar;58(1):163-70. doi: 10.1111/j.0006-341x.2002.00163.x.