Doctoral Program Department of Computer Science and Electronics, Faculty of Mathematics and Natural Science, 59166Universitas Gadjah Mada, Yogyakarta, Indonesia.
Department of Information Technology, Faculty Computer and Informatics, Institut Teknologi Dan Bisnis STIKOM Bali, Bali, Indonesia.
Health Informatics J. 2022 Jan-Mar;28(1):14604582221084211. doi: 10.1177/14604582221084211.
The umbilical cord is an organ that circulates oxygen and nutrition from mother to fetus during pregnancy. This study aims to classify the umbilical cord based on ultrasound images. The similarity of shape and coil between each class becomes a challenge. Therefore, it requires feature values that are relevant to the characteristics of these three classes. The condition of imbalanced data sets in this study is also an obstacle that causes the classifier's performance to degrade on minority classes. Therefore, this study proposes a machine learning model capable of properly dealing with imbalanced data sets and recognizing the umbilical cord class.Furthermore, this study proposes a new feature extraction method, namely, the umbilical coiling index (UCI), which directly adopts obstetricians' knowledge. The proposed model consists of five stages: image preprocessing, feature extraction, feature selection, oversampling data using SMOTE, and Classification. Machine learning method observations were carried out comprehensively on five based classifiers: Random Forest, KNN, Decision tree, SVM, Naïve Bayes, and Multiclassifier. The results showed that the Random forest and Multiclassifier methods provide the highest accuracy, precision, recall, and F-measure performance in imbalanced data sets.
脐带是一种在怀孕期间将氧气和营养从母亲输送到胎儿的器官。本研究旨在根据超声图像对脐带进行分类。每种类别之间的形状和线圈相似性成为一个挑战。因此,它需要与这三个类别特征相关的特征值。本研究中不平衡数据集的条件也是一个障碍,导致分类器在少数类别的性能下降。因此,本研究提出了一种能够正确处理不平衡数据集并识别脐带类别的机器学习模型。此外,本研究提出了一种新的特征提取方法,即脐带缠绕指数(UCI),它直接采用了妇产科医生的知识。所提出的模型由五个阶段组成:图像预处理、特征提取、特征选择、使用 SMOTE 进行过采样数据以及分类。在五个基于类别的分类器上进行了全面的机器学习方法观察:随机森林、KNN、决策树、SVM、朴素贝叶斯和多分类器。结果表明,在不平衡数据集中,随机森林和多分类器方法提供了最高的准确性、精度、召回率和 F 度量性能。