Klein H M, Eisele T, Klose K C, Stauss I, Brenner M, Ameling W, Günther R W
Diagnostic Radiology, RWTH Aachen, Germany.
Invest Radiol. 1996 Jan;31(1):6-10. doi: 10.1097/00004424-199601000-00002.
To determine the diagnostic performance of an artificial intelligence system for classification of focal liver lesions, in comparison to human observers.
One hundred forty-three focal hepatic lesions were evaluated with dynamic computed tomography. The study comprised 59 hemangiomas, 24 other benign lesions (focal nodular hyperplasia, adenoma), and 60 malignant liver lesions (18 primary, 42 secondary). All lesions but the hemangiomas were histologically examined by needle biopsy. For delineation of the lesion, a region of interest was defined interactively. The pattern recognition was performed in two steps with initial extraction of textural features: training of a classifier and classification of the lesions. The accuracy of classification of hepatic lesions into three groups (hemangioma, other benign processes, malignant lesions) was tested. The results were compared with those achieved by human observers using receiver operating characteristic statistical analysis.
The accuracy (total rate of correct diagnoses) was 90.2%. False classifications were found owing to small size, weak contrast enhancement after bolus injection, respiratory movement, and atypical morphology of the lesion. The area under the receiver operating characteristic curve was not significantly different for computer and human observers.
The system demonstrated a diagnostic accuracy comparable to human observers. Further improvement with increasing numbers of typical computed tomographic series for training of the classifier can be expected.
与人类观察者相比,确定人工智能系统对肝脏局灶性病变进行分类的诊断性能。
对143个肝脏局灶性病变进行动态计算机断层扫描评估。该研究包括59个血管瘤、24个其他良性病变(局灶性结节性增生、腺瘤)和60个恶性肝脏病变(18个原发性、42个继发性)。除血管瘤外,所有病变均通过针吸活检进行组织学检查。为了勾勒病变,交互式定义感兴趣区域。模式识别分两步进行,首先提取纹理特征:训练分类器并对病变进行分类。测试了将肝脏病变分为三组(血管瘤、其他良性病变、恶性病变)的分类准确性。使用受试者操作特征统计分析将结果与人类观察者的结果进行比较。
准确率(正确诊断的总率)为90.2%。由于病变尺寸小、团注后对比增强弱、呼吸运动和病变形态不典型,发现了错误分类。计算机和人类观察者的受试者操作特征曲线下面积无显著差异。
该系统显示出与人类观察者相当的诊断准确性。随着用于训练分类器的典型计算机断层扫描系列数量的增加,有望进一步改进。