• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

缺乏经验的临床研究人员所生成的数据驱动型假设的质量:一项案例研究。

The quality of data-driven hypotheses generated by inexperienced clinical researchers: A case study.

作者信息

Ernst Mytchell A, Draghi Brooke N, Cimino James J, Patel Vimla L, Zhou Yuchun, Shubrook Jay H, De Lacalle Sonsoles, Weaver Aneesa, Liu Chang, Jing Xia

机构信息

Department of Public Health Sciences, Clemson University, Clemson, SC.

Department of Biomedical Informatics and Data Science, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL.

出版信息

medRxiv. 2024 Aug 13:2024.08.12.24311877. doi: 10.1101/2024.08.12.24311877.

DOI:10.1101/2024.08.12.24311877
PMID:39185523
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11343241/
Abstract

OBJECTIVES

We invited inexperienced clinical researchers to analyze coded health datasets and develop hypotheses. We recorded and analyzed their hypothesis generation process. All the hypotheses generated in the process were rated by the same group of seven experts by using the same metrics. This case study examines the higher quality (i.e., higher ratings) and lower quality of hypotheses and participants who generated them. We characterized the contextual factors associated with the quality of hypotheses.

METHODS

All participants (i.e., clinical researchers) completed a 2-hour study session to analyze data and generate scientific hypotheses using the think-aloud method. Participants' screen activity and audio were recorded and transcribed. These transcriptions were used to measure the time used to generate each hypothesis and to code cognitive events (i.e., cognitive activities used when generating hypotheses, for example, "Seeking for Connection" describes an attempt to draw connections between data points). The hypothesis ratings by the expert panel were used as the quality of the hypotheses during the analysis. We analyzed the factors associated with (1) the five highest and (2) five lowest rated hypotheses and (3) the participants who generated them, including the number of hypotheses per participant, the validity of those hypotheses, the number of cognitive events used for each hypothesis, as well as the participant's research experience and basic demographics.

RESULTS

Participants who generated the five highest-rated hypotheses used similar lengths of time (difference 3:03), whereas those who generated the five lowest-rated hypotheses used more varying lengths of time (difference 7:13). Participants who generated the five highest-rated hypotheses also utilized slightly fewer cognitive events on average compared to the five lowest-rated hypotheses (4 per hypothesis vs. 4.8 per hypothesis). When we examine the participants (who generated the five highest and five lowest hypotheses) and their total hypotheses generated during the 2-hour study sessions, the participants with the five highest-rated hypotheses again had a shorter range of time per hypothesis on average (0:03:34 vs. 0:07:17). They (with the five highest ratings) used fewer cognitive events per hypothesis (3.498 vs. 4.626). They (with the five highest ratings) also had a higher percentage of valid rate (75.51% vs. 63.63%) and generally had more experience with clinical research.

CONCLUSION

The quality of the hypotheses was shown to be associated with the time taken to generate them, where too long or too short time to generate hypotheses appears to be negatively associated with the hypotheses' quality ratings. Also, having more experience seems to positively correlate with higher ratings of hypotheses and higher valid rates. Validity is a quality dimension used by the expert panel during rating. However, we acknowledge that our results are anecdotal. The effect may not be simply linear, and future research is necessary. These results underscore the multi-factor nature of hypothesis generation.

摘要

目的

我们邀请了缺乏经验的临床研究人员分析编码后的健康数据集并提出假设。我们记录并分析了他们的假设生成过程。在此过程中生成的所有假设均由同一组七名专家使用相同的指标进行评分。本案例研究考察了假设的较高质量(即较高评分)和较低质量,以及提出这些假设的参与者。我们描述了与假设质量相关的背景因素。

方法

所有参与者(即临床研究人员)完成了一个为时两小时的研究环节,采用出声思维法分析数据并提出科学假设。记录并转录了参与者的屏幕活动和音频。这些转录内容用于测量生成每个假设所用的时间,并对认知事件进行编码(即生成假设时使用的认知活动,例如,“寻求联系”描述了尝试在数据点之间建立联系的行为)。在分析过程中,专家小组的假设评分被用作假设的质量指标。我们分析了与(1)评分最高的五个和(2)评分最低的五个假设以及(3)提出这些假设的参与者相关的因素,包括每个参与者提出的假设数量、这些假设的有效性、用于每个假设的认知事件数量,以及参与者的研究经验和基本人口统计学信息。

结果

提出评分最高的五个假设的参与者所用时间长度相似(相差3分03秒),而提出评分最低的五个假设的参与者所用时间差异较大(相差7分13秒)。与评分最低的五个假设相比,提出评分最高的五个假设的参与者平均使用的认知事件也略少(每个假设4个 vs. 每个假设4.8个)。当我们考察提出评分最高和最低的五个假设的参与者以及他们在两小时研究环节中提出的假设总数时,提出评分最高的五个假设的参与者每个假设平均用时范围再次较短(0:03:34 vs. 0:07:17)。他们(评分最高的)每个假设使用的认知事件较少(3.498个 vs. 4.626个)。他们(评分最高的)有效率百分比也更高(75.51% vs. 63.63%),并且一般临床研究经验更丰富。

结论

研究表明,假设的质量与生成假设所用的时间有关,生成假设的时间过长或过短似乎都与假设的质量评分呈负相关。此外,经验更丰富似乎与假设的较高评分和较高有效率呈正相关。有效性是专家小组在评分时使用的一个质量维度。然而,我们承认我们的结果只是轶事性的。这种影响可能并非简单的线性关系,未来有必要进行进一步研究。这些结果强调了假设生成的多因素性质。

相似文献

1
The quality of data-driven hypotheses generated by inexperienced clinical researchers: A case study.缺乏经验的临床研究人员所生成的数据驱动型假设的质量:一项案例研究。
medRxiv. 2024 Aug 13:2024.08.12.24311877. doi: 10.1101/2024.08.12.24311877.
2
A secondary data analysis on hypotheses generated by inexperienced clinical researchers: Cases from a randomized controlled study.对缺乏经验的临床研究人员提出的假设进行的二次数据分析:来自一项随机对照研究的案例
Health Informatics J. 2025 Jul-Sep;31(3):14604582251353587. doi: 10.1177/14604582251353587. Epub 2025 Aug 6.
3
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
4
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
7
Personally tailored activities for improving psychosocial outcomes for people with dementia in long-term care. 为长期护理机构中的痴呆症患者制定个性化的活动以改善其心理社会结局。
Cochrane Database Syst Rev. 2023 Mar 13;3(3):CD009812. doi: 10.1002/14651858.CD009812.pub3.
8
Surgical interventions for treating extracapsular hip fractures in older adults: a network meta-analysis.老年人髋关节囊外骨折的手术干预:一项网络荟萃分析。
Cochrane Database Syst Rev. 2022 Feb 10;2(2):CD013405. doi: 10.1002/14651858.CD013405.pub2.
9
Ultrasound guidance versus anatomical landmarks for internal jugular vein catheterization.超声引导与解剖标志用于颈内静脉置管的比较
Cochrane Database Syst Rev. 2015 Jan 9;1(1):CD006962. doi: 10.1002/14651858.CD006962.pub2.
10
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

本文引用的文献

1
Using think-aloud protocol to identify cognitive events while generating data-driven scientific hypotheses by inexperienced clinical researchers.使用出声思维法来识别经验不足的临床研究人员在生成数据驱动的科学假设时的认知事件。
AMIA Annu Symp Proc. 2025 May 22;2024:561-570. eCollection 2024.
2
Development, validation, and usage of metrics to evaluate the quality of clinical research hypotheses.用于评估临床研究假设质量的指标的开发、验证及应用。
BMC Med Res Methodol. 2025 Jan 16;25(1):11. doi: 10.1186/s12874-025-02460-1.
3
Data-Driven Hypothesis Generation in Clinical Research: What We Learned from a Human Subject Study?
临床研究中数据驱动的假设生成:我们从一项人体研究中学到了什么?
Med Res Arch. 2024 Feb;12(2). doi: 10.18103/mra.v12i2.5132. Epub 2024 Feb 28.
4
Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools.经验不足的临床研究人员中数据驱动的假设生成:二次数据分析与可视化(VIADS)及其他工具的比较
J Clin Transl Sci. 2024 Jan 4;8(1):e13. doi: 10.1017/cts.2023.708. eCollection 2024.
5
A Visual Analytic Tool (VIADS) to Assist the Hypothesis Generation Process in Clinical Research: Mixed Methods Usability Study.一种辅助临床研究中假设生成过程的视觉分析工具(VIADS):混合方法可用性研究。
JMIR Hum Factors. 2023 Apr 27;10:e44644. doi: 10.2196/44644.
6
The Roles of a Secondary Data Analytics Tool and Experience in Scientific Hypothesis Generation in Clinical Research: Protocol for a Mixed Methods Study.二次数据分析工具和经验在临床研究科学假设生成中的作用:一项混合方法研究方案
JMIR Res Protoc. 2022 Jul 18;11(7):e39414. doi: 10.2196/39414.
7
Remembrance of inferences past: Amortization in human hypothesis generation.回忆过去的推断:人类假设生成中的摊销。
Cognition. 2018 Sep;178:67-81. doi: 10.1016/j.cognition.2018.04.017. Epub 2018 May 21.
8
MOLIERE: Automatic Biomedical Hypothesis Generation System.莫里哀:自动生物医学假设生成系统。
KDD. 2017 Aug;2017:1633-1642. doi: 10.1145/3097983.3098057.
9
The effects of subjective time pressure and individual differences on hypotheses generation and action prioritization in police investigations.主观时间压力和个体差异对警察调查中假设生成和行动优先级的影响。
J Exp Psychol Appl. 2013 Mar;19(1):83-93. doi: 10.1037/a0032148.
10
Hypothesis generation and the coordination of theory and evidence in novice diagnostic reasoning.新手诊断推理中的假设生成以及理论与证据的协调
Med Decis Making. 1993 Jul-Sep;13(3):198-211. doi: 10.1177/0272989X9301300305.