Ma Ling, Liu Xiabi, Fei Baowei
Department of Radiology and Imaging Sciences, Emory University School of Medicine, Atlanta, GA, USA. School of Computer Science, Beijing Institute of Technology, Beijing, People's Republic of China.
Phys Med Biol. 2017 Jan 21;62(2):612-632. doi: 10.1088/1361-6560/62/2/612. Epub 2016 Dec 29.
Common CT imaging signs of lung diseases (CISLs) are defined as the imaging signs that frequently appear in lung CT images from patients. CISLs play important roles in the diagnosis of lung diseases. This paper proposes a novel learning method, namely learning with distribution of optimized feature (DOF), to effectively recognize the characteristics of CISLs. We improve the classification performance by learning the optimized features under different distributions. Specifically, we adopt the minimum spanning tree algorithm to capture the relationship between features and discriminant ability of features for selecting the most important features. To overcome the problem of various distributions in one CISL, we propose a hierarchical learning method. First, we use an unsupervised learning method to cluster samples into groups based on their distribution. Second, in each group, we use a supervised learning method to train a model based on their categories of CISLs. Finally, we obtain multiple classification decisions from multiple trained models and use majority voting to achieve the final decision. The proposed approach has been implemented on a set of 511 samples captured from human lung CT images and achieves a classification accuracy of 91.96%. The proposed DOF method is effective and can provide a useful tool for computer-aided diagnosis of lung diseases on CT images.
肺部疾病的常见CT影像征象(CISLs)被定义为在患者肺部CT图像中频繁出现的影像征象。CISLs在肺部疾病的诊断中发挥着重要作用。本文提出了一种新颖的学习方法,即基于优化特征分布的学习(DOF),以有效识别CISLs的特征。我们通过在不同分布下学习优化特征来提高分类性能。具体而言,我们采用最小生成树算法来捕捉特征之间的关系以及特征的判别能力,以选择最重要的特征。为了克服一个CISL中存在多种分布的问题,我们提出了一种分层学习方法。首先,我们使用无监督学习方法根据样本的分布将其聚类成组。其次,在每组中,我们使用监督学习方法根据CISLs的类别训练模型。最后,我们从多个训练模型中获得多个分类决策,并使用多数投票来达成最终决策。所提出的方法已在从人类肺部CT图像中采集的一组511个样本上实现,分类准确率达到了91.96%。所提出的DOF方法是有效的,可为CT图像上的肺部疾病计算机辅助诊断提供有用工具。