从多机构队列中为病理学家注释数据集确定病例优先级。

Prioritizing cases from a multi-institutional cohort for a dataset of pathologist annotations.

作者信息

Garcia Victor, Gardecki Emma, Jou Stephanie, Li Xiaoxian, Shroyer Kenneth R, Saltz Joel, Acs Balazs, Elfer Katherine, Lennerz Jochen, Salgado Roberto, Gallas Brandon D

机构信息

U.S. Food and Drug Administration, Center for Devices and Radiological Health, Office of Science and Engineering Laboratories, Division of Imaging, Diagnostics, and Software Reliability, Silver Spring, MD, United States of America.

Department of Pathology and Laboratory Medicine, Emory University, Atlanta, GA, United States of America.

出版信息

J Pathol Inform. 2024 Nov 16;16:100411. doi: 10.1016/j.jpi.2024.100411. eCollection 2025 Jan.

DOI:10.1016/j.jpi.2024.100411

PMID:39720416

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11667696/

Abstract

OBJECTIVE

With the increasing energy surrounding the development of artificial intelligence and machine learning (AI/ML) models, the use of the same external validation dataset by various developers allows for a direct comparison of model performance. Through our High Throughput Truthing project, we are creating a validation dataset for AI/ML models trained in the assessment of stromal tumor-infiltrating lymphocytes (sTILs) in triple negative breast cancer (TNBC).

MATERIALS AND METHODS

We obtained clinical metadata for hematoxylin and eosin-stained glass slides and corresponding scanned whole slide images (WSIs) of TNBC core biopsies from two US academic medical centers. We selected regions of interest (ROIs) from the WSIs to target regions with various tissue morphologies and sTILs densities. Given the selected ROIs, we implemented a hierarchical rank-sort method for case prioritization.

RESULTS

We received 122 glass slides and clinical metadata on 105 unique patients with TNBC. All received cases were female, and the mean age was 63.44 years. 60% of all cases were White patients, and 38.1% were Black or African American. After case prioritization, the skewness of the sTILs density distribution improved from 0.60 to 0.46 with a corresponding increase in the entropy of the sTILs density bins from 1.20 to 1.24. We retained cases with less prevalent metadata elements.

CONCLUSION

This method allows us to prioritize underrepresented subgroups based on important clinical factors. In this manuscript, we discuss how we sourced the clinical metadata, selected ROIs, and developed our approach to prioritizing cases for inclusion in our pivotal study.

摘要

目的

随着围绕人工智能和机器学习（AI/ML）模型开发的热度不断上升，不同开发者使用相同的外部验证数据集能够直接比较模型性能。通过我们的高通量真值标注项目，我们正在创建一个用于在三阴性乳腺癌（TNBC）基质肿瘤浸润淋巴细胞（sTILs）评估中训练的AI/ML模型的验证数据集。

材料与方法

我们从两个美国学术医疗中心获取了苏木精和伊红染色玻璃幻灯片的临床元数据以及TNBC核心活检对应的全切片扫描图像（WSIs）。我们从WSIs中选择感兴趣区域（ROIs），以针对具有不同组织形态和sTILs密度的区域。鉴于所选的ROIs，我们实施了一种分层排序方法来对病例进行优先级排序。

结果

我们收到了122张玻璃幻灯片和105例TNBC独特患者的临床元数据。所有收到的病例均为女性，平均年龄为63.44岁。所有病例中有60%为白人患者，38.1%为黑人或非裔美国人。在病例优先级排序后，sTILs密度分布的偏度从0.60改善到0.46，sTILs密度区间的熵相应地从1.20增加到1.24。我们保留了具有较少常见元数据元素的病例。

结论

该方法使我们能够根据重要的临床因素对代表性不足的亚组进行优先级排序。在本手稿中，我们讨论了我们如何获取临床元数据、选择ROIs以及开发我们的方法来对纳入关键研究的病例进行优先级排序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a85/11667696/85b2c8244c3b/gr1.jpg

相似文献

Prioritizing cases from a multi-institutional cohort for a dataset of pathologist annotations.从多机构队列中为病理学家注释数据集确定病例优先级。

J Pathol Inform. 2024 Nov 16;16:100411. doi: 10.1016/j.jpi.2024.100411. eCollection 2025 Jan.

Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer.为病理学家开发培训材料，以提供乳腺癌肿瘤浸润淋巴细胞的机器学习验证数据。

Cancers (Basel). 2022 May 17;14(10):2467. doi: 10.3390/cancers14102467.

A Pathologist-Annotated Dataset for Validating Artificial Intelligence: A Project Description and Pilot Study.一个用于验证人工智能的病理学家注释数据集：项目描述与初步研究

J Pathol Inform. 2021 Nov 15;12:45. doi: 10.4103/jpi.jpi_83_20. eCollection 2021.

Pilot study to evaluate tools to collect pathologist annotations for validating machine learning algorithms.评估用于收集病理学家注释以验证机器学习算法的工具的初步研究。

J Med Imaging (Bellingham). 2022 Jul;9(4):047501. doi: 10.1117/1.JMI.9.4.047501. Epub 2022 Jul 27.

Development and validation of artificial intelligence-based prescreening of large-bowel biopsies taken in the UK and Portugal: a retrospective cohort study.基于人工智能的英国和葡萄牙大结肠活检预筛查的开发和验证：一项回顾性队列研究。

Lancet Digit Health. 2023 Nov;5(11):e786-e797. doi: 10.1016/S2589-7500(23)00148-6.

Artificial intelligence-based digital scores of stromal tumour-infiltrating lymphocytes and tumour-associated stroma predict disease-specific survival in triple-negative breast cancer.基于人工智能的基质肿瘤浸润淋巴细胞和肿瘤相关基质数字评分可预测三阴性乳腺癌的疾病特异性生存情况。

J Pathol. 2023 May;260(1):32-42. doi: 10.1002/path.6061. Epub 2023 Feb 24.

Evaluation of tumour infiltrating lymphocytes in luminal breast cancer using artificial intelligence.利用人工智能评估腔面型乳腺癌中的肿瘤浸润淋巴细胞。

Br J Cancer. 2023 Nov;129(11):1747-1758. doi: 10.1038/s41416-023-02451-3. Epub 2023 Sep 30.

Tumor-Infiltrating Lymphocytes in Patients With Stage I Triple-Negative Breast Cancer Untreated With Chemotherapy.未经化疗治疗的 I 期三阴性乳腺癌患者的肿瘤浸润淋巴细胞。

JAMA Oncol. 2024 Aug 1;10(8):1077-1086. doi: 10.1001/jamaoncol.2024.1917.

Interobserver variability in the assessment of stromal tumor-infiltrating lymphocytes (sTILs) in triple-negative invasive breast carcinoma influences the association with pathological complete response: the IVITA study.三阴性浸润性乳腺癌中评估间质肿瘤浸润淋巴细胞（sTILs）的观察者间变异性影响与病理完全缓解的相关性：IVITA 研究。

Mod Pathol. 2021 Dec;34(12):2130-2140. doi: 10.1038/s41379-021-00865-z. Epub 2021 Jul 3.

BI-RADS Ultrasound Lexicon Descriptors and Stromal Tumor-Infiltrating Lymphocytes in Triple-Negative Breast Cancer.BI-RADS 超声词汇描述符与三阴性乳腺癌间质肿瘤浸润淋巴细胞。

Acad Radiol. 2022 Jan;29 Suppl 1(Suppl 1):S35-S41. doi: 10.1016/j.acra.2021.06.007. Epub 2021 Jul 14.

本文引用的文献

Training pathologists to assess stromal tumour-infiltrating lymphocytes in breast cancer synergises efforts in clinical care and scientific research.培训病理学家评估乳腺癌中的间质肿瘤浸润淋巴细胞可协同临床护理和科学研究。

Histopathology. 2024 May;84(6):915-923. doi: 10.1111/his.15140. Epub 2024 Mar 3.

Reproducible Reporting of the Collection and Evaluation of Annotations for Artificial Intelligence Models.人工智能模型注释的收集和评估的可复现报告。

Mod Pathol. 2024 Apr;37(4):100439. doi: 10.1016/j.modpat.2024.100439. Epub 2024 Jan 28.

Initial interactions with the FDA on developing a validation dataset as a medical device development tool.与 FDA 就开发验证数据集作为医疗器械开发工具的初步互动。

J Pathol. 2023 Dec;261(4):378-384. doi: 10.1002/path.6208. Epub 2023 Oct 4.

Clinical Meaning of Stromal Tumor Infiltrating Lymphocytes (sTIL) in Early Luminal B Breast Cancer.早期管腔B型乳腺癌中基质肿瘤浸润淋巴细胞（sTIL）的临床意义

Cancers (Basel). 2023 May 20;15(10):2846. doi: 10.3390/cancers15102846.

Effective and efficient active learning for deep learning-based tissue image analysis.基于深度学习的组织图像分析的有效和高效主动学习。

Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad138.

Pilot study to evaluate tools to collect pathologist annotations for validating machine learning algorithms.评估用于收集病理学家注释以验证机器学习算法的工具的初步研究。

J Med Imaging (Bellingham). 2022 Jul;9(4):047501. doi: 10.1117/1.JMI.9.4.047501. Epub 2022 Jul 27.

External Validation of Deep Learning Algorithms for Radiologic Diagnosis: A Systematic Review.用于放射诊断的深度学习算法的外部验证：一项系统评价。

Radiol Artif Intell. 2022 May 4;4(3):e210064. doi: 10.1148/ryai.210064. eCollection 2022 May.

Cancers (Basel). 2022 May 17;14(10):2467. doi: 10.3390/cancers14102467.

Triple-negative breast cancer: current treatment strategies and factors of negative prognosis.三阴性乳腺癌：当前的治疗策略及不良预后因素

J Med Life. 2022 Feb;15(2):153-161. doi: 10.25122/jml-2021-0108.

Prognostic Value of Stromal Tumor-Infiltrating Lymphocytes in Young, Node-Negative, Triple-Negative Breast Cancer Patients Who Did Not Receive (neo)Adjuvant Systemic Therapy.年轻、淋巴结阴性、三阴性乳腺癌患者未接受（新）辅助全身治疗时，基质肿瘤浸润淋巴细胞的预后价值。

J Clin Oncol. 2022 Jul 20;40(21):2361-2374. doi: 10.1200/JCO.21.01536. Epub 2022 Mar 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从多机构队列中为病理学家注释数据集确定病例优先级。

Prioritizing cases from a multi-institutional cohort for a dataset of pathologist annotations.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

CONCLUSION

目的

材料与方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献