• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

众包患者照片中慢性移植物抗宿主病的皮肤界限划分:训练与性能研究

Crowdsourcing Skin Demarcations of Chronic Graft-Versus-Host Disease in Patient Photographs: Training Versus Performance Study.

作者信息

McNeil Andrew J, Parks Kelsey, Liu Xiaoqi, Jiang Bohan, Coco Joseph, McCool Kira, Fabbri Daniel, Duhaime Erik P, Dawant Benoit M, Tkaczyk Eric R

机构信息

Dermatology Service and Research Service, Department of Veterans Affairs, Tennessee Valley Healthcare System, Nashville, TN, United States.

Department of Dermatology, Vanderbilt University Medical Center, Nashville, TN, United States.

出版信息

JMIR Dermatol. 2023 Dec 26;6:e48589. doi: 10.2196/48589.

DOI:10.2196/48589
PMID:38147369
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10777279/
Abstract

BACKGROUND

Chronic graft-versus-host disease (cGVHD) is a significant cause of long-term morbidity and mortality in patients after allogeneic hematopoietic cell transplantation. Skin is the most commonly affected organ, and visual assessment of cGVHD can have low reliability. Crowdsourcing data from nonexpert participants has been used for numerous medical applications, including image labeling and segmentation tasks.

OBJECTIVE

This study aimed to assess the ability of crowds of nonexpert raters-individuals without any prior training for identifying or marking cGHVD-to demarcate photos of cGVHD-affected skin. We also studied the effect of training and feedback on crowd performance.

METHODS

Using a Canfield Vectra H1 3D camera, 360 photographs of the skin of 36 patients with cGVHD were taken. Ground truth demarcations were provided in 3D by a trained expert and reviewed by a board-certified dermatologist. In total, 3000 2D images (projections from various angles) were created for crowd demarcation through the DiagnosUs mobile app. Raters were split into high and low feedback groups. The performances of 4 different crowds of nonexperts were analyzed, including 17 raters per image for the low and high feedback groups, 32-35 raters per image for the low feedback group, and the top 5 performers for each image from the low feedback group.

RESULTS

Across 8 demarcation competitions, 130 raters were recruited to the high feedback group and 161 to the low feedback group. This resulted in a total of 54,887 individual demarcations from the high feedback group and 78,967 from the low feedback group. The nonexpert crowds achieved good overall performance for segmenting cGVHD-affected skin with minimal training, achieving a median surface area error of less than 12% of skin pixels for all crowds in both the high and low feedback groups. The low feedback crowds performed slightly poorer than the high feedback crowd, even when a larger crowd was used. Tracking the 5 most reliable raters from the low feedback group for each image recovered a performance similar to that of the high feedback crowd. Higher variability between raters for a given image was not found to correlate with lower performance of the crowd consensus demarcation and cannot therefore be used as a measure of reliability. No significant learning was observed during the task as more photos and feedback were seen.

CONCLUSIONS

Crowds of nonexpert raters can demarcate cGVHD images with good overall performance. Tracking the top 5 most reliable raters provided optimal results, obtaining the best performance with the lowest number of expert demarcations required for adequate training. However, the agreement amongst individual nonexperts does not help predict whether the crowd has provided an accurate result. Future work should explore the performance of crowdsourcing in standard clinical photos and further methods to estimate the reliability of consensus demarcations.

摘要

背景

慢性移植物抗宿主病(cGVHD)是异基因造血细胞移植患者长期发病和死亡的重要原因。皮肤是最常受影响的器官,对cGVHD的视觉评估可靠性可能较低。来自非专业参与者的众包数据已用于众多医学应用,包括图像标记和分割任务。

目的

本研究旨在评估非专业评分者群体(即未接受过任何识别或标记cGVHD培训的个体)对cGVHD受累皮肤照片进行划分的能力。我们还研究了培训和反馈对群体表现的影响。

方法

使用Canfield Vectra H1 3D相机,拍摄了36例cGVHD患者皮肤的360张照片。由一名经过培训的专家以3D形式提供真实边界,并由一名获得委员会认证的皮肤科医生进行审核。通过DiagnosUs移动应用程序总共创建了3000张二维图像(来自不同角度的投影)用于群体划分。评分者被分为高反馈组和低反馈组。分析了4个不同非专业群体的表现,包括低反馈组和高反馈组每张图像17名评分者、低反馈组每张图像32 - 35名评分者,以及低反馈组每张图像表现最佳的前5名评分者。

结果

在8次划分竞赛中,130名评分者被招募到高反馈组,161名被招募到低反馈组。这导致高反馈组共有54887次个体划分,低反馈组有78967次。非专业群体在经过最少培训后对cGVHD受累皮肤进行分割时总体表现良好,高反馈组和低反馈组所有群体的皮肤像素表面积误差中位数均小于12%。低反馈群体的表现略逊于高反馈群体,即使使用了更大的群体。跟踪低反馈组每张图像最可靠的5名评分者,其表现与高反馈群体相似。未发现给定图像评分者之间较高的变异性与群体共识划分的较低表现相关,因此不能将其用作可靠性的衡量标准。在任务过程中,随着看到更多照片和反馈,未观察到显著的学习效果。

结论

非专业评分者群体能够以良好的总体表现对cGVHD图像进行划分。跟踪最可靠的前5名评分者可提供最佳结果,以最少的专家划分数量进行充分训练即可获得最佳表现。然而,个体非专业人员之间的一致性无助于预测群体是否给出了准确结果。未来的工作应探索众包在标准临床照片中的表现,以及进一步估计共识划分可靠性的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/50f53aa94ca8/derma_v6i1e48589_fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/35dfef1adfa9/derma_v6i1e48589_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/aaf93c0660e3/derma_v6i1e48589_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/e3e70db1806c/derma_v6i1e48589_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/2e08eab14dca/derma_v6i1e48589_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/1ae947f8476f/derma_v6i1e48589_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/e12d9b6e5a27/derma_v6i1e48589_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/143159489bec/derma_v6i1e48589_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/50f53aa94ca8/derma_v6i1e48589_fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/35dfef1adfa9/derma_v6i1e48589_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/aaf93c0660e3/derma_v6i1e48589_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/e3e70db1806c/derma_v6i1e48589_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/2e08eab14dca/derma_v6i1e48589_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/1ae947f8476f/derma_v6i1e48589_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/e12d9b6e5a27/derma_v6i1e48589_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/143159489bec/derma_v6i1e48589_fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbe0/10777279/50f53aa94ca8/derma_v6i1e48589_fig8.jpg

相似文献

1
Crowdsourcing Skin Demarcations of Chronic Graft-Versus-Host Disease in Patient Photographs: Training Versus Performance Study.众包患者照片中慢性移植物抗宿主病的皮肤界限划分:训练与性能研究
JMIR Dermatol. 2023 Dec 26;6:e48589. doi: 10.2196/48589.
2
Crowdsourcing to delineate skin affected by chronic graft-vs-host disease.众包划定慢性移植物抗宿主病累及的皮肤。
Skin Res Technol. 2019 Jul;25(4):572-577. doi: 10.1111/srt.12688. Epub 2019 Feb 20.
3
Agreement Between Experts and an Untrained Crowd for Identifying Dermoscopic Features Using a Gamified App: Reader Feasibility Study.专家与未经培训的人群使用游戏化应用程序识别皮肤镜特征的一致性:读者可行性研究
JMIR Med Inform. 2023 Jan 18;11:e38412. doi: 10.2196/38412.
4
Gamified Crowdsourcing as a Novel Approach to Lung Ultrasound Data Set Labeling: Prospective Analysis.游戏化众包作为一种新型的肺部超声数据集标注方法:前瞻性分析。
J Med Internet Res. 2024 Jul 4;26:e51397. doi: 10.2196/51397.
5
Crowds Replicate Performance of Scientific Experts Scoring Phylogenetic Matrices of Phenotypes.群体复制科学专家对表型系统发育矩阵评分的表现。
Syst Biol. 2018 Jan 1;67(1):49-60. doi: 10.1093/sysbio/syx052.
6
Crowd-sourced assessment of technical skills: an adjunct to urology resident surgical simulation training.技术技能的众包评估:泌尿外科住院医师手术模拟训练的辅助手段
J Endourol. 2015 May;29(5):604-9. doi: 10.1089/end.2014.0616. Epub 2015 Jan 7.
7
Non-Expert Markings of Active Chronic Graft-Versus-Host Disease Photographs: Optimal Metrics of Training Effects.非专家对慢性移植物抗宿主病活动期照片的标记:训练效果的最佳度量。
J Digit Imaging. 2023 Feb;36(1):373-378. doi: 10.1007/s10278-022-00730-8. Epub 2022 Nov 7.
8
Crowdsourcing image annotation for nucleus detection and segmentation in computational pathology: evaluating experts, automated methods, and the crowd.用于计算病理学中细胞核检测与分割的众包图像标注:评估专家、自动化方法及大众标注。
Pac Symp Biocomput. 2015:294-305. doi: 10.1142/9789814644730_0029.
9
A multicenter pilot evaluation of the National Institutes of Health chronic graft-versus-host disease (cGVHD) therapeutic response measures: feasibility, interrater reliability, and minimum detectable change.一项多中心 NIH 慢性移植物抗宿主病(cGVHD)治疗反应评估的初步研究:可行性、评估者间可靠性和最小可检测变化。
Biol Blood Marrow Transplant. 2011 Nov;17(11):1619-29. doi: 10.1016/j.bbmt.2011.04.002. Epub 2011 Apr 12.
10
Crowd-Sourced Assessment of Technical Skill: A Valid Method for Discriminating Basic Robotic Surgery Skills.技术技能的众包评估:一种区分基本机器人手术技能的有效方法。
J Endourol. 2015 Nov;29(11):1295-301. doi: 10.1089/end.2015.0191. Epub 2015 Aug 24.

本文引用的文献

1
Hyperspectral imaging to accurately segment skin erythema and hyperpigmentation in cutaneous chronic graft-versus-host disease.利用高光谱成像技术准确分割皮肤慢性移植物抗宿主病的红斑和色素沉着。
J Biophotonics. 2023 Jul;16(7):e202300009. doi: 10.1002/jbio.202300009. Epub 2023 Apr 10.
2
Prognostic Value of Cutaneous Disease Severity Estimates on Survival Outcomes in Patients With Chronic Graft-vs-Host Disease.慢性移植物抗宿主病患者皮肤疾病严重程度估计对生存结局的预后价值。
JAMA Dermatol. 2023 Apr 1;159(4):393-402. doi: 10.1001/jamadermatol.2022.6624.
3
Agreement Between Experts and an Untrained Crowd for Identifying Dermoscopic Features Using a Gamified App: Reader Feasibility Study.
专家与未经培训的人群使用游戏化应用程序识别皮肤镜特征的一致性:读者可行性研究
JMIR Med Inform. 2023 Jan 18;11:e38412. doi: 10.2196/38412.
4
Non-Expert Markings of Active Chronic Graft-Versus-Host Disease Photographs: Optimal Metrics of Training Effects.非专家对慢性移植物抗宿主病活动期照片的标记:训练效果的最佳度量。
J Digit Imaging. 2023 Feb;36(1):373-378. doi: 10.1007/s10278-022-00730-8. Epub 2022 Nov 7.
5
Artificial intelligence recognition of cutaneous chronic graft-versus-host disease by a deep learning neural network.通过深度学习神经网络对皮肤慢性移植物抗宿主病进行人工智能识别。
Br J Haematol. 2022 Jun;197(6):e69-e72. doi: 10.1111/bjh.18141. Epub 2022 Mar 24.
6
Crowdsourcing human-based computation for medical image analysis: A systematic literature review.众包基于人类的医学图像分析计算:一项系统的文献综述。
Health Informatics J. 2020 Dec;26(4):2446-2469. doi: 10.1177/1460458220907435. Epub 2020 Mar 6.
7
Large-scale medical image annotation with crowd-powered algorithms.利用众包算法进行大规模医学图像标注
J Med Imaging (Bellingham). 2018 Jul;5(3):034002. doi: 10.1117/1.JMI.5.3.034002. Epub 2018 Sep 8.
8
Crowdsourcing to delineate skin affected by chronic graft-vs-host disease.众包划定慢性移植物抗宿主病累及的皮肤。
Skin Res Technol. 2019 Jul;25(4):572-577. doi: 10.1111/srt.12688. Epub 2019 Feb 20.
9
Crowdsourcing of Histological Image Labeling and Object Delineation by Medical Students.医学学生的组织病理学图像标注和目标勾画的众包
IEEE Trans Med Imaging. 2019 May;38(5):1284-1294. doi: 10.1109/TMI.2018.2883237. Epub 2018 Nov 26.
10
Changing the Way We Think About Burn Size Estimation.改变我们对烧伤面积估计的思考方式。
J Burn Care Res. 2019 Jan 1;40(1):1-11. doi: 10.1093/jbcr/iry050.