Bratisl Lek Listy. 2024;125(4):223-232. doi: 10.4149/BLL_2024_34.
Despite being the second most often diagnosed form of cancer, lung cancers are rarely found in the general population. It is proposed in this study to employ a methodology of extracting both global and local features from CT scan images for the identification of lung cancer. Data gathering, globalised and localised training as well as testing the model are all part of this structure. This study makes use of 800 CT scan images. Images are pre-processed by warping and cropping in advance of the global testing step. Each image is represented by a feature vector employing eight distinct types of image characteristics, which are taken from the images. After creating feature vectors, three machine learning methods are employed to create detection models. Every medical image has been partitioned over a series of simple divisions throughout the training and testing process locally. To describe each block, feature vectors are derived from the image features that worked effectively in the general phase of the experiment. Similar extracted features are then used to build detection systems for all picture blocks using the learning strategies that were effective in the global stage. SVM using Haar Wavelet characteristics had an accuracy, sensitivity, and specificity of 89%, 90%, and 89%, respectively. One might get 90%‑accurate results with SVM and 91%‑sensitive and 91%‑specific results using SVM plus HOG features. Finally, the utilisation of SVM with Gabor Filter characteristics achieved the greatest correctness, specificity, and sensitivity values, particularly 87%, 86%, and 87%, respectively (Tab. 3, Fig. 7, Ref. 18). Keywords: feature extraction, support vector machine, lung cancer, classification, machine learning.
尽管肺癌是第二大常见的癌症类型,但在普通人群中很少发现。本研究提出了一种从 CT 扫描图像中提取全局和局部特征的方法,用于识别肺癌。该结构包括数据收集、全球化和本地化训练以及模型测试。本研究使用了 800 张 CT 扫描图像。在进行全局测试之前,对图像进行了变形和裁剪预处理。每个图像都由一个特征向量表示,该向量使用从图像中提取的八种不同类型的图像特征。创建特征向量后,使用三种机器学习方法创建检测模型。在训练和测试过程中,对每一张医学图像都进行了一系列简单的分区。为了描述每个块,从在实验的全局阶段有效工作的图像特征中提取特征向量。然后,使用在全局阶段有效的学习策略,使用提取的相似特征为所有图像块构建检测系统。使用 Haar 小波特征的 SVM 的准确率、敏感度和特异性分别为 89%、90%和 89%。使用 SVM 和 HOG 特征可以获得 90%的准确率、91%的敏感度和 91%的特异性。最后,使用 Gabor 滤波器特征的 SVM 实现了最大的正确性、特异性和敏感度值,分别为 87%、86%和 87%(表 3、图 7、参考文献 18)。关键词:特征提取、支持向量机、肺癌、分类、机器学习。