• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用拉施模型提高众包数据的一致性评分:一种诊断工具的开发与完善

Improving Consensus Scoring of Crowdsourced Data Using the Rasch Model: Development and Refinement of a Diagnostic Instrument.

作者信息

Brady Christopher John, Mudie Lucy Iluka, Wang Xueyang, Guallar Eliseo, Friedman David Steven

机构信息

Dana Center for Preventive Ophthalmology, Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, United States.

Bloomberg School of Public Health, Department of Epidemiology, Johns Hopkins University, Baltimore, MD, United States.

出版信息

J Med Internet Res. 2017 Jun 20;19(6):e222. doi: 10.2196/jmir.7984.

DOI:10.2196/jmir.7984
PMID:28634154
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5497070/
Abstract

BACKGROUND

Diabetic retinopathy (DR) is a leading cause of vision loss in working age individuals worldwide. While screening is effective and cost effective, it remains underutilized, and novel methods are needed to increase detection of DR. This clinical validation study compared diagnostic gradings of retinal fundus photographs provided by volunteers on the Amazon Mechanical Turk (AMT) crowdsourcing marketplace with expert-provided gold-standard grading and explored whether determination of the consensus of crowdsourced classifications could be improved beyond a simple majority vote (MV) using regression methods.

OBJECTIVE

The aim of our study was to determine whether regression methods could be used to improve the consensus grading of data collected by crowdsourcing.

METHODS

A total of 1200 retinal images of individuals with diabetes mellitus from the Messidor public dataset were posted to AMT. Eligible crowdsourcing workers had at least 500 previously approved tasks with an approval rating of 99% across their prior submitted work. A total of 10 workers were recruited to classify each image as normal or abnormal. If half or more workers judged the image to be abnormal, the MV consensus grade was recorded as abnormal. Rasch analysis was then used to calculate worker ability scores in a random 50% training set, which were then used as weights in a regression model in the remaining 50% test set to determine if a more accurate consensus could be devised. Outcomes of interest were the percent correctly classified images, sensitivity, specificity, and area under the receiver operating characteristic (AUROC) for the consensus grade as compared with the expert grading provided with the dataset.

RESULTS

Using MV grading, the consensus was correct in 75.5% of images (906/1200), with 75.5% sensitivity, 75.5% specificity, and an AUROC of 0.75 (95% CI 0.73-0.78). A logistic regression model using Rasch-weighted individual scores generated an AUROC of 0.91 (95% CI 0.88-0.93) compared with 0.89 (95% CI 0.86-92) for a model using unweighted scores (chi-square P value<.001). Setting a diagnostic cut-point to optimize sensitivity at 90%, 77.5% (465/600) were graded correctly, with 90.3% sensitivity, 68.5% specificity, and an AUROC of 0.79 (95% CI 0.76-0.83).

CONCLUSIONS

Crowdsourced interpretations of retinal images provide rapid and accurate results as compared with a gold-standard grading. Creating a logistic regression model using Rasch analysis to weight crowdsourced classifications by worker ability improves accuracy of aggregated grades as compared with simple majority vote.

摘要

背景

糖尿病视网膜病变(DR)是全球劳动年龄人群视力丧失的主要原因。虽然筛查有效且具有成本效益,但仍未得到充分利用,需要新的方法来提高DR的检测率。这项临床验证研究将亚马逊土耳其机器人(AMT)众包市场上志愿者提供的视网膜眼底照片诊断分级与专家提供的金标准分级进行了比较,并探讨了使用回归方法是否可以在简单多数投票(MV)之外改进众包分类共识的确定。

目的

我们研究的目的是确定是否可以使用回归方法来改进众包收集数据的共识分级。

方法

将来自Messidor公共数据集的1200张糖尿病患者的视网膜图像发布到AMT上。符合条件的众包工作者之前至少有500项已批准的任务,其之前提交工作的批准率为99%。总共招募了10名工作者将每张图像分类为正常或异常。如果一半或更多工作者判断图像为异常,则将MV共识分级记录为异常。然后使用Rasch分析在随机的50%训练集中计算工作者能力得分,然后将其用作剩余50%测试集中回归模型的权重,以确定是否可以设计出更准确的共识。感兴趣的结果是与数据集中提供的专家分级相比,共识分级的正确分类图像百分比、敏感性、特异性和受试者操作特征曲线下面积(AUROC)。

结果

使用MV分级时,75.5%的图像(906/1200)的共识是正确的,敏感性为75.5%,特异性为75.5%,AUROC为0.75(95%CI 0.73 - 0.78)。使用Rasch加权个体得分的逻辑回归模型产生的AUROC为0.91(95%CI 0.88 - 0.93),而使用未加权得分的模型的AUROC为0.89(95%CI 0.86 - 0.92)(卡方P值<0.001)。将诊断切点设定为在90%时优化敏感性,77.5%(465/600)的分级正确,敏感性为90.3%,特异性为68.5%,AUROC为0.79(95%CI 0.76 - 0.83)。

结论

与金标准分级相比,视网膜图像的众包解释提供了快速准确的结果。使用Rasch分析创建逻辑回归模型以根据工作者能力对众包分类进行加权,与简单多数投票相比提高了汇总分级的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/a0039ef45f47/jmir_v19i6e222_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/357d6df3cafb/jmir_v19i6e222_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/1f2c71331e73/jmir_v19i6e222_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/8ee79942fa5e/jmir_v19i6e222_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/6dbc8a1e0a9c/jmir_v19i6e222_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/18cef94c7029/jmir_v19i6e222_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/aa4484b9ec30/jmir_v19i6e222_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/a0039ef45f47/jmir_v19i6e222_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/357d6df3cafb/jmir_v19i6e222_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/1f2c71331e73/jmir_v19i6e222_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/8ee79942fa5e/jmir_v19i6e222_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/6dbc8a1e0a9c/jmir_v19i6e222_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/18cef94c7029/jmir_v19i6e222_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/aa4484b9ec30/jmir_v19i6e222_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f86/5497070/a0039ef45f47/jmir_v19i6e222_fig7.jpg

相似文献

1
Improving Consensus Scoring of Crowdsourced Data Using the Rasch Model: Development and Refinement of a Diagnostic Instrument.使用拉施模型提高众包数据的一致性评分:一种诊断工具的开发与完善
J Med Internet Res. 2017 Jun 20;19(6):e222. doi: 10.2196/jmir.7984.
2
Rapid grading of fundus photographs for diabetic retinopathy using crowdsourcing.利用众包对糖尿病视网膜病变眼底照片进行快速分级。
J Med Internet Res. 2014 Oct 30;16(10):e233. doi: 10.2196/jmir.3807.
3
Crowdsourcing to Evaluate Fundus Photographs for the Presence of Glaucoma.众包评估眼底照片中青光眼的存在情况。
J Glaucoma. 2017 Jun;26(6):505-510. doi: 10.1097/IJG.0000000000000660.
4
A Virtual Reading Center Model Using Crowdsourcing to Grade Photographs for Trachoma: Validation Study.基于众包的沙眼照片分级虚拟阅读中心模型:验证研究。
J Med Internet Res. 2023 Apr 6;25:e41233. doi: 10.2196/41233.
5
Crowdsourcing as a novel technique for retinal fundus photography classification: analysis of images in the EPIC Norfolk cohort on behalf of the UK Biobank Eye and Vision Consortium.众包作为一种新的眼底照相分类技术:代表英国生物银行眼与视觉联盟对 EPIC Norfolk 队列中的图像进行分析。
PLoS One. 2013 Aug 21;8(8):e71154. doi: 10.1371/journal.pone.0071154. eCollection 2013.
6
Validity of Online Screening for Autism: Crowdsourcing Study Comparing Paid and Unpaid Diagnostic Tasks.自闭症在线筛查的有效性:比较付费和免费诊断任务的众包研究
J Med Internet Res. 2019 May 23;21(5):e13668. doi: 10.2196/13668.
7
Validation of automated screening for referable diabetic retinopathy with the IDx-DR device in the Hoorn Diabetes Care System.在霍伦糖尿病护理系统中使用IDx-DR设备对可转诊糖尿病视网膜病变进行自动筛查的验证。
Acta Ophthalmol. 2018 Feb;96(1):63-68. doi: 10.1111/aos.13613. Epub 2017 Nov 27.
8
Crowdsourcing and Automated Retinal Image Analysis for Diabetic Retinopathy.用于糖尿病视网膜病变的众包与自动视网膜图像分析
Curr Diab Rep. 2017 Sep 23;17(11):106. doi: 10.1007/s11892-017-0940-x.
9
Accuracy and reliability of retinal photo grading for diabetic retinopathy: Remote graders from a developing country and standard retinal photo grader in Australia.糖尿病视网膜病变视网膜照片分级的准确性和可靠性:来自发展中国家的远程分级人员与澳大利亚的标准视网膜照片分级人员。
PLoS One. 2017 Jun 20;12(6):e0179310. doi: 10.1371/journal.pone.0179310. eCollection 2017.
10
Automated Identification of Diabetic Retinopathy Using Deep Learning.基于深度学习的糖尿病视网膜病变自动识别。
Ophthalmology. 2017 Jul;124(7):962-969. doi: 10.1016/j.ophtha.2017.02.008. Epub 2017 Mar 27.

引用本文的文献

1
A Virtual Reading Center Model Using Crowdsourcing to Grade Photographs for Trachoma: Validation Study.基于众包的沙眼照片分级虚拟阅读中心模型:验证研究。
J Med Internet Res. 2023 Apr 6;25:e41233. doi: 10.2196/41233.
2
Detection of trachoma using machine learning approaches.基于机器学习方法的沙眼检测。
PLoS Negl Trop Dis. 2022 Dec 7;16(12):e0010943. doi: 10.1371/journal.pntd.0010943. eCollection 2022 Dec.
3
Deep Learning Intervention for Health Care Challenges: Some Biomedical Domain Considerations.深度学习干预医疗保健挑战:一些生物医学领域的考虑因素。

本文引用的文献

1
Crowdsourcing to Evaluate Fundus Photographs for the Presence of Glaucoma.众包评估眼底照片中青光眼的存在情况。
J Glaucoma. 2017 Jun;26(6):505-510. doi: 10.1097/IJG.0000000000000660.
2
Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs.深度学习算法在视网膜眼底照片糖尿病视网膜病变检测中的开发与验证。
JAMA. 2016 Dec 13;316(22):2402-2410. doi: 10.1001/jama.2016.17216.
3
Improved Automated Detection of Diabetic Retinopathy on a Publicly Available Dataset Through Integration of Deep Learning.
JMIR Mhealth Uhealth. 2019 Aug 2;7(8):e11966. doi: 10.2196/11966.
4
Mapping of Crowdsourcing in Health: Systematic Review.健康领域众包的映射:系统综述
J Med Internet Res. 2018 May 15;20(5):e187. doi: 10.2196/jmir.9330.
5
ComprehENotes, an Instrument to Assess Patient Reading Comprehension of Electronic Health Record Notes: Development and Validation.ComprehENotes,一种评估患者对电子健康记录笔记阅读理解能力的工具:开发与验证
J Med Internet Res. 2018 Apr 25;20(4):e139. doi: 10.2196/jmir.9380.
通过深度学习整合在公开可用数据集上改进糖尿病视网膜病变的自动检测
Invest Ophthalmol Vis Sci. 2016 Oct 1;57(13):5200-5206. doi: 10.1167/iovs.16-19964.
4
The Evolution of Teleophthalmology Programs in the United Kingdom: Beyond Diabetic Retinopathy Screening.英国远程眼科项目的发展:超越糖尿病视网膜病变筛查
J Diabetes Sci Technol. 2016 Feb 1;10(2):308-17. doi: 10.1177/1932296816629983.
5
Long-term Comparative Effectiveness of Telemedicine in Providing Diabetic Retinopathy Screening Examinations: A Randomized Clinical Trial.远程医疗在提供糖尿病视网膜病变筛查检查方面的长期比较效果:一项随机临床试验。
JAMA Ophthalmol. 2015 May;133(5):518-25. doi: 10.1001/jamaophthalmol.2015.1.
6
Teleophthalmology for first nations clients at risk of diabetic retinopathy: a mixed methods evaluation.第一民族糖尿病视网膜病变高危患者的远程眼科服务:混合方法评估。
JMIR Med Inform. 2015 Feb 23;3(1):e10. doi: 10.2196/medinform.3872.
7
Tele-ophthalmology screening for proliferative diabetic retinopathy in urban primary care offices: an economic analysis.城市基层医疗办公室中增殖性糖尿病视网膜病变的远程眼科筛查:一项经济学分析。
Ophthalmic Surg Lasers Imaging Retina. 2014 Nov-Dec;45(6):556-61. doi: 10.3928/23258160-20141118-11.
8
Rapid grading of fundus photographs for diabetic retinopathy using crowdsourcing.利用众包对糖尿病视网膜病变眼底照片进行快速分级。
J Med Internet Res. 2014 Oct 30;16(10):e233. doi: 10.2196/jmir.3807.
9
Crowdsourcing applications for public health.众包在公共卫生中的应用。
Am J Prev Med. 2014 Feb;46(2):179-87. doi: 10.1016/j.amepre.2013.10.016.
10
Crowdsourcing as a novel technique for retinal fundus photography classification: analysis of images in the EPIC Norfolk cohort on behalf of the UK Biobank Eye and Vision Consortium.众包作为一种新的眼底照相分类技术:代表英国生物银行眼与视觉联盟对 EPIC Norfolk 队列中的图像进行分析。
PLoS One. 2013 Aug 21;8(8):e71154. doi: 10.1371/journal.pone.0071154. eCollection 2013.