• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于观察研究的方法定量评估合成医学图像的真实性。

Observer-study-based approaches to quantitatively evaluate the realism of synthetic medical images.

机构信息

Department of Biomedical Engineering, Washington University, St. Louis, MO 63130, United States of America.

Mallinckrodt Institute of Radiology, Washington University School of Medicine, St. Louis, MO 63110, United States of America.

出版信息

Phys Med Biol. 2023 Mar 21;68(7):074001. doi: 10.1088/1361-6560/acc0ce.

DOI:10.1088/1361-6560/acc0ce
PMID:36863028
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10411234/
Abstract

Synthetic images generated by simulation studies have a well-recognized role in developing and evaluating imaging systems and methods. However, for clinically relevant development and evaluation, the synthetic images must be clinically realistic and, ideally, have the same distribution as that of clinical images. Thus, mechanisms that can quantitatively evaluate this clinical realism and, ideally, the similarity in distributions of the real and synthetic images, are much needed.We investigated two observer-study-based approaches to quantitatively evaluate the clinical realism of synthetic images. In the first approach, we presented a theoretical formalism for the use of an ideal-observer study to quantitatively evaluate the similarity in distributions between the real and synthetic images. This theoretical formalism provides a direct relationship between the area under the receiver operating characteristic curve, AUC, for an ideal observer and the distributions of real and synthetic images. The second approach is based on the use of expert-human-observer studies to quantitatively evaluate the realism of synthetic images. In this approach, we developed a web-based software to conduct two-alternative forced-choice (2-AFC) experiments with expert human observers. The usability of this software was evaluated by conducting a system usability scale (SUS) survey with seven expert human readers and five observer-study designers. Further, we demonstrated the application of this software to evaluate a stochastic and physics-based image-synthesis technique for oncologic positron emission tomography (PET). In this evaluation, the 2-AFC study with our software was performed by six expert human readers, who were highly experienced in reading PET scans, with years of expertise ranging from 7 to 40 years (median: 12 years, average: 20.4 years).In the ideal-observer-study-based approach, we theoretically demonstrated that the AUC for an ideal observer can be expressed, to an excellent approximation, by the Bhattacharyya distance between the distributions of the real and synthetic images. This relationship shows that a decrease in the ideal-observer AUC indicates a decrease in the distance between the two image distributions. Moreover, a lower bound of ideal-observer AUC = 0.5 implies that the distributions of synthetic and real images exactly match. For the expert-human-observer-study-based approach, our software for performing the 2-AFC experiments is available athttps://apps.mir.wustl.edu/twoafc. Results from the SUS survey demonstrate that the web application is very user friendly and accessible. As a secondary finding, evaluation of a stochastic and physics-based PET image-synthesis technique using our software showed that expert human readers had limited ability to distinguish the real images from the synthetic images.This work addresses the important need for mechanisms to quantitatively evaluate the clinical realism of synthetic images. The mathematical treatment in this paper shows that quantifying the similarity in the distribution of real and synthetic images is theoretically possible by using an ideal-observer-study-based approach. Our developed software provides a platform for designing and performing 2-AFC experiments with human observers in a highly accessible, efficient, and secure manner. Additionally, our results on the evaluation of the stochastic and physics-based image-synthesis technique motivate the application of this technique to develop and evaluate a wide array of PET imaging methods.

摘要

模拟研究生成的合成图像在开发和评估成像系统和方法方面具有公认的作用。然而,对于临床相关的开发和评估,合成图像必须具有临床现实性,并且理想情况下,与临床图像具有相同的分布。因此,非常需要能够定量评估这种临床现实性和(理想情况下)真实和合成图像分布相似性的机制。

我们研究了两种基于观察者研究的方法,用于定量评估合成图像的临床真实性。在第一种方法中,我们提出了一种理论形式主义,用于使用理想观察者研究来定量评估真实和合成图像之间分布的相似性。这个理论形式主义为理想观察者的接收者操作特征曲线下面积(AUC)与真实和合成图像的分布之间提供了直接关系。第二种方法基于使用专家人类观察者研究来定量评估合成图像的真实性。在这种方法中,我们开发了一个基于网络的软件,用于进行专家人类观察者的二项式迫选(2-AFC)实验。通过对七位专家人类读者和五位观察者研究设计师进行系统可用性量表(SUS)调查,评估了该软件的可用性。此外,我们展示了该软件在评估基于随机和物理的肿瘤正电子发射断层扫描(PET)图像合成技术中的应用。在该评估中,我们的软件进行了 2-AFC 研究,由六位具有丰富 PET 扫描阅读经验的专家人类读者进行,他们的专业经验从 7 年到 40 年不等(中位数:12 年,平均:20.4 年)。

在理想观察者研究的基础上,我们从理论上证明了理想观察者的 AUC 可以通过真实和合成图像分布之间的 Bhattacharyya 距离来很好地近似表示。这种关系表明,理想观察者 AUC 的降低表明两个图像分布之间的距离减小。此外,理想观察者 AUC 的下限为 0.5 意味着合成和真实图像的分布完全匹配。对于基于专家人类观察者研究的方法,我们用于进行 2-AFC 实验的软件可在https://apps.mir.wustl.edu/twoafc 上获得。SUS 调查的结果表明,该网络应用程序非常用户友好且易于访问。作为次要发现,使用我们的软件评估基于随机和物理的 PET 图像合成技术表明,专家人类读者很难区分真实图像和合成图像。

这项工作满足了定量评估合成图像临床真实性的机制的重要需求。本文的数学处理表明,通过使用基于理想观察者研究的方法,理论上可以定量评估真实和合成图像分布的相似性。我们开发的软件为设计和以高效、安全的方式进行人类观察者的 2-AFC 实验提供了一个平台。此外,我们对基于随机和物理的图像合成技术的评估结果表明,该技术可以应用于开发和评估广泛的 PET 成像方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/b4547b97c45b/pmbacc0cef6_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/f8aeafcb4ae3/pmbacc0cef1_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/6bc0763cc61a/pmbacc0cef2_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/97473e7ffa4c/pmbacc0cef3_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/7c92a3fa9ab8/pmbacc0cef4_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/c6e78388ee44/pmbacc0cef5_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/b4547b97c45b/pmbacc0cef6_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/f8aeafcb4ae3/pmbacc0cef1_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/6bc0763cc61a/pmbacc0cef2_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/97473e7ffa4c/pmbacc0cef3_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/7c92a3fa9ab8/pmbacc0cef4_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/c6e78388ee44/pmbacc0cef5_lr.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d3d/10481936/b4547b97c45b/pmbacc0cef6_lr.jpg

相似文献

1
Observer-study-based approaches to quantitatively evaluate the realism of synthetic medical images.基于观察研究的方法定量评估合成医学图像的真实性。
Phys Med Biol. 2023 Mar 21;68(7):074001. doi: 10.1088/1361-6560/acc0ce.
2
Correlation between human observer performance and model observer performance in differential phase contrast CT.在差示相位对比 CT 中,人与模型观察者性能的相关性。
Med Phys. 2013 Nov;40(11):111905. doi: 10.1118/1.4822576.
3
Computerized method for evaluating diagnostic image quality of calcified plaque images in cardiac CT: validation on a physical dynamic cardiac phantom.计算机化方法评估心脏 CT 中钙化斑块图像的诊断图像质量:在物理动态心脏体模上的验证。
Med Phys. 2010 Nov;37(11):5777-86. doi: 10.1118/1.3495684.
4
Human Observer Net: A Platform Tool for Human Observer Studies of Image Data.人类观察者网络:用于图像数据人类观察者研究的平台工具。
Radiology. 2022 Jun;303(3):524-530. doi: 10.1148/radiol.211832. Epub 2022 Mar 8.
5
Confidence intervals for performance assessment of linear observers.线性观察者性能评估的置信区间。
Med Phys. 2011 Jul;38 Suppl 1(Suppl 1):S57. doi: 10.1118/1.3577764.
6
Automated assessment of low contrast sensitivity for CT systems using a model observer.使用模型观察者自动评估 CT 系统的低对比灵敏度。
Med Phys. 2011 Jul;38 Suppl 1:S25. doi: 10.1118/1.3577757.
7
Correlation between a 2D channelized Hotelling observer and human observers in a low-contrast detection task with multislice reading in CT.在 CT 多层面阅读的低对比度检测任务中,2D 通道化霍特林观测器与人类观测者之间的相关性。
Med Phys. 2017 Aug;44(8):3990-3999. doi: 10.1002/mp.12380. Epub 2017 Jul 13.
8
Adaptation of a clustered lumpy background model for task-based image quality assessment in x-ray phase-contrast mammography.基于聚类块状背景模型的自适应在 X 射线相衬乳腺成像中的任务型图像质量评估。
Med Phys. 2012 Feb;39(2):906-11. doi: 10.1118/1.3676183.
9
Improving realism in patient-specific abdominal ultrasound simulation using CycleGANs.利用 CycleGAN 提高腹部超声模拟中的患者特异性真实感。
Int J Comput Assist Radiol Surg. 2020 Feb;15(2):183-192. doi: 10.1007/s11548-019-02046-5. Epub 2019 Aug 7.
10
Development of 4D mathematical observer models for the task-based evaluation of gated myocardial perfusion SPECT.用于门控心肌灌注单光子发射计算机断层扫描任务评估的4D数学观测模型的开发。
Phys Med Biol. 2015 Apr 7;60(7):2751-63. doi: 10.1088/0031-9155/60/7/2751. Epub 2015 Mar 13.

引用本文的文献

1
WIN-PDQ: A Wiener-estimator-based projection-domain quantitative SPECT method that accounts for intra-regional uptake heterogeneity.WIN-PDQ:一种基于维纳估计器的投影域定量单光子发射计算机断层扫描方法,该方法考虑了区域内摄取异质性。
Proc SPIE Int Soc Opt Eng. 2024 Feb;12925. doi: 10.1117/12.3006569. Epub 2024 Apr 1.
2
Noise-aware system generative model (NASGM): positron emission tomography (PET) image simulation framework with observer validation studies.噪声感知系统生成模型(NASGM):用于正电子发射断层扫描(PET)图像模拟框架及观察者验证研究。
Med Phys. 2025 Jul;52(7):e17962. doi: 10.1002/mp.17962.

本文引用的文献

1
A Projection-Domain Low-Count Quantitative SPECT Method for -Particle-Emitting Radiopharmaceutical Therapy.一种用于发射β粒子放射性药物治疗的投影域低计数定量单光子发射计算机断层扫描方法。
IEEE Trans Radiat Plasma Med Sci. 2023 Jan;7(1):62-74. doi: 10.1109/trpms.2022.3175435. Epub 2022 May 23.
2
A tissue-fraction estimation-based segmentation method for quantitative dopamine transporter SPECT.基于组织分数估计的定量多巴胺转运体 SPECT 分割方法。
Med Phys. 2022 Aug;49(8):5121-5137. doi: 10.1002/mp.15778. Epub 2022 Jun 29.
3
Nuclear Medicine and Artificial Intelligence: Best Practices for Evaluation (the RELAINCE Guidelines).
核医学与人工智能:评估的最佳实践(RELAINCE 指南)。
J Nucl Med. 2022 Sep;63(9):1288-1299. doi: 10.2967/jnumed.121.263239. Epub 2022 May 26.
4
Human Observer Net: A Platform Tool for Human Observer Studies of Image Data.人类观察者网络:用于图像数据人类观察者研究的平台工具。
Radiology. 2022 Jun;303(3):524-530. doi: 10.1148/radiol.211832. Epub 2022 Mar 8.
5
Evaluation of attenuation correction in PET/MRI with synthetic lesion insertion.通过合成病变插入评估PET/MRI中的衰减校正
J Med Imaging (Bellingham). 2021 Sep;8(5):056001. doi: 10.1117/1.JMI.8.5.056001. Epub 2021 Sep 20.
6
Toward High-Throughput Artificial Intelligence-Based Segmentation in Oncological PET Imaging.迈向基于高通量人工智能的肿瘤 PET 成像分割。
PET Clin. 2021 Oct;16(4):577-596. doi: 10.1016/j.cpet.2021.06.001.
7
Objective Task-Based Evaluation of Artificial Intelligence-Based Medical Imaging Methods:: Framework, Strategies, and Role of the Physician.基于目标的人工智能医学成像方法的客观评估:框架、策略和医师的作用。
PET Clin. 2021 Oct;16(4):493-511. doi: 10.1016/j.cpet.2021.06.013.
8
A Bayesian approach to tissue-fraction estimation for oncological PET segmentation.基于贝叶斯方法的肿瘤 PET 分割组织分数估计。
Phys Med Biol. 2021 Jun 14;66(12). doi: 10.1088/1361-6560/ac01f4.
9
DiCyc: GAN-based deformation invariant cross-domain information fusion for medical image synthesis.DiCyc:基于生成对抗网络的变形不变跨域信息融合用于医学图像合成
Inf Fusion. 2021 Mar;67:147-160. doi: 10.1016/j.inffus.2020.10.015.
10
In silico imaging clinical trials: cheaper, faster, better, safer, and more scalable.计算机成像临床试验:更便宜、更快、更好、更安全、更具可扩展性。
Trials. 2021 Jan 19;22(1):64. doi: 10.1186/s13063-020-05002-w.