Janowczyk Andrew, Zuo Ren, Gilmore Hannah, Feldman Michael, Madabhushi Anant
Case Western Reserve University, Cleveland, OH.
University Hospitals Cleveland Medical Center, Cleveland, OH.
JCO Clin Cancer Inform. 2019 Apr;3:1-7. doi: 10.1200/CCI.18.00157.
Digital pathology (DP), referring to the digitization of tissue slides, is beginning to change the landscape of clinical diagnostic workflows and has engendered active research within the area of computational pathology. One of the challenges in DP is the presence of artefacts and batch effects, unintentionally introduced during both routine slide preparation (eg, staining, tissue folding) and digitization (eg, blurriness, variations in contrast and hue). Manual review of glass and digital slides is laborious, qualitative, and subject to intra- and inter-reader variability. Therefore, there is a critical need for a reproducible automated approach of precisely localizing artefacts to identify slides that need to be reproduced or regions that should be avoided during computational analysis.
Here we present HistoQC, a tool for rapidly performing quality control to not only identify and delineate artefacts but also discover cohort-level outliers (eg, slides stained darker or lighter than others in the cohort). This open-source tool employs a combination of image metrics (eg, color histograms, brightness, contrast), features (eg, edge detectors), and supervised classifiers (eg, pen detection) to identify artefact-free regions on digitized slides. These regions and metrics are presented to the user via an interactive graphical user interface, facilitating artefact detection through real-time visualization and filtering. These same metrics afford users the opportunity to explicitly define acceptable tolerances for their workflows.
The output of HistoQC on 450 slides from The Cancer Genome Atlas was reviewed by two pathologists and found to be suitable for computational analysis more than 95% of the time.
These results suggest that HistoQC could provide an automated, quantifiable, quality control process for identifying artefacts and measuring slide quality, in turn helping to improve both the repeatability and robustness of DP workflows.
数字病理学(DP),即组织切片的数字化,正开始改变临床诊断工作流程的格局,并在计算病理学领域引发了积极的研究。DP面临的挑战之一是在常规玻片制备(如染色、组织折叠)和数字化过程(如模糊、对比度和色调变化)中无意引入的伪影和批次效应。对玻璃切片和数字切片进行人工检查既费力又定性,而且易受阅片者内部和阅片者之间差异的影响。因此,迫切需要一种可重复的自动化方法来精确地定位伪影,以识别需要重新制作的切片或在计算分析过程中应避免的区域。
在此,我们展示了HistoQC,这是一种用于快速执行质量控制的工具,不仅可以识别和描绘伪影,还能发现队列水平的异常值(如队列中染色比其他切片更深或更浅的切片)。这个开源工具结合了图像指标(如颜色直方图、亮度、对比度)、特征(如边缘检测器)和监督分类器(如笔检测)来识别数字化切片上无伪影的区域。这些区域和指标通过交互式图形用户界面呈现给用户,通过实时可视化和过滤促进伪影检测。这些相同的指标为用户提供了明确为其工作流程定义可接受公差的机会。
两名病理学家对来自癌症基因组图谱的450张切片上的HistoQC输出进行了评估,发现超过95%的时间该输出适用于计算分析。
这些结果表明,HistoQC可以提供一个自动化、可量化的质量控制过程,用于识别伪影和测量切片质量,进而有助于提高DP工作流程的可重复性和稳健性。