• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类计算作为基于网络的指南制定小组中循证知识转移的新方法:概念验证随机对照试验

Human computation as a new method for evidence-based knowledge transfer in Web-based guideline development groups: proof of concept randomized controlled trial.

作者信息

Heselmans Annemie, Aertgeerts Bert, Donceel Peter, Van de Velde Stijn, Vanbrabant Peter, Ramaekers Dirk

机构信息

School of Public Health and Primary Care, Katholieke Universiteit Leuven, Leuven, Belgium.

出版信息

J Med Internet Res. 2013 Jan 17;15(1):e8. doi: 10.2196/jmir.2055.

DOI:10.2196/jmir.2055
PMID:23328663
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3636290/
Abstract

BACKGROUND

Guideline developers use different consensus methods to develop evidence-based clinical practice guidelines. Previous research suggests that existing guideline development techniques are subject to methodological problems and are logistically demanding. Guideline developers welcome new methods that facilitate a methodologically sound decision-making process. Systems that aggregate knowledge while participants play a game are one class of human computation applications. Researchers have already proven that these games with a purpose are effective in building common sense knowledge databases.

OBJECTIVE

We aimed to evaluate the feasibility of a new consensus method based on human computation techniques compared to an informal face-to-face consensus method.

METHODS

We set up a randomized design to study 2 different methods for guideline development within a group of advanced students completing a master of nursing and obstetrics. Students who participated in the trial were enrolled in an evidence-based health care course. We compared the Web-based method of human-based computation (HC) with an informal face-to-face consensus method (IC). We used 4 clinical scenarios of lower back pain as the subject of the consensus process. These scenarios concerned the following topics: (1) medical imaging, (2) therapeutic options, (3) drugs use, and (4) sick leave. Outcomes were expressed as the amount of group (dis)agreement and the concordance of answers with clinical evidence. We estimated within-group and between-group effect sizes by calculating Cohen's d. We calculated within-group effect sizes as the absolute difference between the outcome value at round 3 and the baseline outcome value, divided by the pooled standard deviation. We calculated between-group effect sizes as the absolute difference between the mean change in outcome value across rounds in HC and the mean change in outcome value across rounds in IC, divided by the pooled standard deviation. We analyzed statistical significance of within-group changes between round 1 and round 3 using the Wilcoxon signed rank test. We assessed the differences between the HC and IC groups using Mann-Whitney U tests. We used a Bonferroni adjusted alpha level of .025 in all statistical tests. We performed a thematic analysis to explore participants' arguments during group discussion. Participants completed a satisfaction survey at the end of the consensus process.

RESULTS

Of the 135 students completing a master of nursing and obstetrics, 120 participated in the experiment. We formed 8 HC groups (n=64) and 7 IC groups (n=56). The between-group comparison demonstrated that the human computation groups obtained a greater improvement in evidence scores compared to the IC groups, although the difference was not statistically significant. The between-group effect size was 0.56 (P=.30) for the medical imaging scenario, 0.07 (P=.97) for the therapeutic options scenario, and 0.89 (P=.11) for the drug use scenario. We found no significant differences in improvement in the degree of agreement between HC and IC groups. Between-group comparisons revealed that the HC groups showed greater improvement in degree of agreement for the medical imaging scenario (d=0.46, P=.37) and the drug use scenario (d=0.31, P=.59). Very few evidence arguments (6%) were quoted during informal group discussions.

CONCLUSIONS

Overall, the use of the IC method was appropriate as long as the evidence supported participants' beliefs or usual practice, or when the availability of the evidence was sparse. However, when some controversy about the evidence existed, the HC method outperformed the IC method. The findings of our study illustrate the importance of the choice of the consensus method in guideline development. Human computation could be an acceptable methodology for guideline development specifically for scenarios in which the evidence shows no resonance with participants' beliefs. Future research is needed to confirm the results of this study and to establish practical significance in a controlled setting of multidisciplinary guideline panels during real-life guideline development.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/90e5002f7398/jmir_v15i1e8_fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/65bd6a09c6d0/jmir_v15i1e8_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/4fb6a86fd5f1/jmir_v15i1e8_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/0e8a665616ad/jmir_v15i1e8_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/f113c3cff1de/jmir_v15i1e8_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/f975853e0d37/jmir_v15i1e8_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/496ad1a248a4/jmir_v15i1e8_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/a2ab41a77f2e/jmir_v15i1e8_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/caabc69a3452/jmir_v15i1e8_fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/320d8b730c6b/jmir_v15i1e8_fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/41b699456961/jmir_v15i1e8_fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/a69020075ede/jmir_v15i1e8_fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/90e5002f7398/jmir_v15i1e8_fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/65bd6a09c6d0/jmir_v15i1e8_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/4fb6a86fd5f1/jmir_v15i1e8_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/0e8a665616ad/jmir_v15i1e8_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/f113c3cff1de/jmir_v15i1e8_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/f975853e0d37/jmir_v15i1e8_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/496ad1a248a4/jmir_v15i1e8_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/a2ab41a77f2e/jmir_v15i1e8_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/caabc69a3452/jmir_v15i1e8_fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/320d8b730c6b/jmir_v15i1e8_fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/41b699456961/jmir_v15i1e8_fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/a69020075ede/jmir_v15i1e8_fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2246/3636290/90e5002f7398/jmir_v15i1e8_fig12.jpg
摘要

背景

指南制定者使用不同的共识方法来制定基于证据的临床实践指南。先前的研究表明,现有的指南制定技术存在方法学问题,且在后勤方面要求较高。指南制定者欢迎有助于进行方法学上合理的决策过程的新方法。在参与者玩游戏时汇总知识的系统是一类人类计算应用程序。研究人员已经证明,这些有目的的游戏在构建常识知识库方面是有效的。

目的

我们旨在评估一种基于人类计算技术的新共识方法与非正式面对面共识方法相比的可行性。

方法

我们设置了一项随机设计,以研究在一组完成护理学与产科学硕士学位的高年级学生中用于指南制定的2种不同方法。参与试验的学生参加了一门基于证据的医疗保健课程。我们将基于网络的人类计算方法(HC)与非正式面对面共识方法(IC)进行了比较。我们使用4个下背痛的临床场景作为共识过程的主题。这些场景涉及以下主题:(1)医学成像,(2)治疗选择,(3)药物使用,以及(4)病假。结果以小组(不)一致的程度以及答案与临床证据的一致性来表示。我们通过计算科恩d值来估计组内和组间效应量。我们将组内效应量计算为第3轮结果值与基线结果值之间的绝对差值,除以合并标准差。我们将组间效应量计算为HC组各轮结果值平均变化与IC组各轮结果值平均变化之间的绝对差值,除以合并标准差。我们使用威尔科克森符号秩检验分析第1轮和第3轮之间组内变化的统计学显著性。我们使用曼-惠特尼U检验评估HC组和IC组之间的差异。在所有统计检验中,我们使用经邦费罗尼校正的α水平为0.025。我们进行了主题分析,以探讨小组讨论期间参与者的论点。参与者在共识过程结束时完成了一项满意度调查。

结果

在135名完成护理学与产科学硕士学位的学生中,120名参与了实验。我们组建了8个HC组(n = 64)和7个IC组(n = 56)。组间比较表明,与IC组相比,人类计算组在证据得分方面有更大的提高,尽管差异无统计学意义。医学成像场景的组间效应量为0.56(P = 0.30),治疗选择场景为0.07(P = 0.97),药物使用场景为0.89(P = 0.11)。我们发现HC组和IC组在一致程度的提高方面没有显著差异。组间比较显示,HC组在医学成像场景(d = 0.46,P = 0.37)和药物使用场景(d = 0.31,P = 0.59)的一致程度上有更大的提高。在非正式小组讨论期间,很少引用证据性论据(6%)。

结论

总体而言,只要证据支持参与者的信念或常规做法,或者证据的可得性较少,使用IC方法就是合适的。然而,当存在关于证据的一些争议时,HC方法优于IC方法。我们研究的结果说明了在指南制定中选择共识方法的重要性。人类计算可能是一种可接受的指南制定方法,特别是对于证据与参与者信念不相符的场景。需要未来的研究来证实本研究的结果,并在现实生活中的指南制定过程中在多学科指南小组的受控环境中确定其实际意义。

相似文献

1
Human computation as a new method for evidence-based knowledge transfer in Web-based guideline development groups: proof of concept randomized controlled trial.人类计算作为基于网络的指南制定小组中循证知识转移的新方法:概念验证随机对照试验
J Med Internet Res. 2013 Jan 17;15(1):e8. doi: 10.2196/jmir.2055.
2
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
3
The effectiveness of internet-based e-learning on clinician behavior and patient outcomes: a systematic review protocol.基于互联网的电子学习对临床医生行为和患者结局的有效性:一项系统评价方案。
JBI Database System Rev Implement Rep. 2015 Jan;13(1):52-64. doi: 10.11124/jbisrir-2015-1919.
4
5
Evaluating the use of the appropriateness method in the Agency for Health Care Policy and Research Clinical Practice Guideline Development process.评估卫生保健政策与研究机构临床实践指南制定过程中适宜性方法的使用情况。
Health Serv Res. 1996 Oct;31(4):453-68.
6
Intraarticular Facet Injections for Low Back Pain: Design Considerations, Consensus Methodology to Develop the Protocol for a Randomized Controlled Trial.用于腰痛的关节内小关节注射:设计考量、制定随机对照试验方案的共识方法
Pain Physician. 2015 Sep-Oct;18(5):473-93.
7
Knowledge transfer in family physicians managing patients with acute low back pain: a prospective randomized control trial.家庭医生管理急性下腰痛患者的知识转移:一项前瞻性随机对照试验。
Spine J. 2006 May-Jun;6(3):282-8. doi: 10.1016/j.spinee.2005.10.008.
8
Similar Outcomes of Web-Based and Face-to-Face Training of the GRADE Approach for the Certainty of Evidence: Randomized Controlled Trial.基于网络的和面对面的 GRADE 方法培训对证据确定性的效果比较:随机对照试验。
J Med Internet Res. 2023 Jun 6;25:e43928. doi: 10.2196/43928.
9
Recommendations from the international evidence-based guideline for the assessment and management of polycystic ovary syndrome.国际循证指南关于多囊卵巢综合征评估和管理的推荐意见。
Fertil Steril. 2018 Aug;110(3):364-379. doi: 10.1016/j.fertnstert.2018.05.004. Epub 2018 Jul 19.
10

本文引用的文献

1
CONSORT-EHEALTH: improving and standardizing evaluation reports of Web-based and mobile health interventions.CONSORT-EHEALTH:改进并规范基于网络和移动健康干预措施的评估报告。
J Med Internet Res. 2011 Dec 31;13(4):e126. doi: 10.2196/jmir.1923.
2
A method for studying decision-making by guideline development groups.一种用于研究指南制定小组决策的方法。
Implement Sci. 2009 Aug 5;4:48. doi: 10.1186/1748-5908-4-48.
3
Why guideline-making requires reform.为何制定指南需要改革。
JAMA. 2009 Jan 28;301(4):429-31. doi: 10.1001/jama.2009.15.
4
GRADE: an emerging consensus on rating quality of evidence and strength of recommendations.GRADE:关于证据质量评级和推荐强度的新共识。
BMJ. 2008 Apr 26;336(7650):924-6. doi: 10.1136/bmj.39489.470347.AD.
5
An experimental study of the influence of individual participant characteristics on formal consensus development.个体参与者特征对正式共识形成影响的实验研究
Int J Technol Assess Health Care. 2007 Winter;23(1):108-15. doi: 10.1017/S0266462307051641.
6
Improving the use of research evidence in guideline development: 3. Group composition and consultation process.改善研究证据在指南制定中的应用:3. 小组构成与咨询过程。
Health Res Policy Syst. 2006 Nov 29;4:15. doi: 10.1186/1478-4505-4-15.
7
A comparison of two consensus methods for classifying morbidities in a single professional group showed the same outcomes.对单一专业群体中疾病分类的两种共识方法进行比较,结果显示相同。
J Clin Epidemiol. 2006 Nov;59(11):1169-73. doi: 10.1016/j.jclinepi.2006.02.016. Epub 2006 Jul 26.
8
A comparison of formal consensus methods used for developing clinical guidelines.用于制定临床指南的正式共识方法比较。
J Health Serv Res Policy. 2006 Oct;11(4):218-24. doi: 10.1258/135581906778476553.
9
What's the evidence that NICE guidance has been implemented? Results from a national evaluation using time series analysis, audit of patients' notes, and interviews.有哪些证据表明英国国家卫生与临床优化研究所(NICE)的指南已得到实施?一项使用时间序列分析、患者病历审核和访谈的全国性评估结果。
BMJ. 2004 Oct 30;329(7473):999. doi: 10.1136/bmj.329.7473.999.
10
An experimental study of determinants of group judgments in clinical guideline development.临床指南制定中群体判断决定因素的实验研究
Lancet. 2004;364(9432):429-37. doi: 10.1016/S0140-6736(04)16766-4.