Department of Computer Science, Florida Polytechnic University, Lakeland, FL, USA.
Department of Data Science and Business Analytics, Florida Polytechnic University, Lakeland, FL, USA.
Surg Endosc. 2023 Jun;37(6):4754-4765. doi: 10.1007/s00464-023-09955-2. Epub 2023 Mar 10.
We previously developed grading metrics for quantitative performance measurement for simulated endoscopic sleeve gastroplasty (ESG) to create a scalar reference to classify subjects into experts and novices. In this work, we used synthetic data generation and expanded our skill level analysis using machine learning techniques.
We used the synthetic data generation algorithm SMOTE to expand and balance our dataset of seven actual simulated ESG procedures using synthetic data. We performed optimization to seek optimum metrics to classify experts and novices by identifying the most critical and distinctive sub-tasks. We used support vector machine (SVM), AdaBoost, K-nearest neighbors (KNN) Kernel Fisher discriminant analysis (KFDA), random forest, and decision tree classifiers to classify surgeons as experts or novices after grading. Furthermore, we used an optimization model to create weights for each task and separate the clusters by maximizing the distance between the expert and novice scores.
We split our dataset into a training set of 15 samples and a testing dataset of five samples. We put this dataset through six classifiers, SVM, KFDA, AdaBoost, KNN, random forest, and decision tree, resulting in 0.94, 0.94, 1.00, 1.00, 1.00, and 1.00 accuracy, respectively, for training and 1.00 accuracy for the testing results for SVM and AdaBoost. Our optimization model maximized the distance between the expert and novice groups from 2 to 53.72.
This paper shows that feature reduction, in combination with classification algorithms such as SVM and KNN, can be used in tandem to classify endoscopists as experts or novices based on their results recorded using our grading metrics. Furthermore, this work introduces a non-linear constraint optimization to separate the two clusters and find the most important tasks using weights.
我们之前开发了用于模拟内镜袖状胃成形术(ESG)的定量绩效评估的评分指标,以创建一个标量参考,将受试者分为专家和新手。在这项工作中,我们使用了合成数据生成,并使用机器学习技术扩展了我们的技能水平分析。
我们使用合成数据生成算法 SMOTE 来扩展和平衡我们的七个实际模拟 ESG 程序数据集,使用合成数据。我们通过识别最关键和最独特的子任务,进行优化以寻求最佳指标来对专家和新手进行分类。我们使用支持向量机(SVM)、AdaBoost、K-最近邻(KNN)核Fisher 判别分析(KFDA)、随机森林和决策树分类器对分级后的外科医生进行分类,以将其分类为专家或新手。此外,我们使用优化模型为每个任务创建权重,并通过最大化专家和新手得分之间的距离来分离集群。
我们将数据集分为训练集 15 个样本和测试数据集 5 个样本。我们将该数据集通过六个分类器,即 SVM、KFDA、AdaBoost、KNN、随机森林和决策树进行处理,分别得到 0.94、0.94、1.00、1.00、1.00 和 1.00 的训练准确率,以及 SVM 和 AdaBoost 的 1.00 的测试准确率。我们的优化模型将专家和新手组之间的距离从 2 最大化到 53.72。
本文表明,特征减少与 SVM 和 KNN 等分类算法相结合,可以用于根据我们的评分指标记录的结果对内镜医生进行分类,将其分为专家和新手。此外,这项工作引入了一种非线性约束优化,以使用权重分离两个集群并找到最重要的任务。