Department of Ultrasound, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang, China.
Department of Radiotherapy Center, The 1st Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang, China.
PeerJ. 2023 Jan 12;11:e14546. doi: 10.7717/peerj.14546. eCollection 2023.
Preoperative prediction of cervical lymph node metastasis in papillary thyroid carcinoma provided a basis for tumor staging and treatment decision. This study aimed to investigate the utility of machine learning and develop different models to preoperatively predict cervical lymph node metastasis based on ultrasonic radiomic features and clinical characteristics in papillary thyroid carcinoma nodules.
Data from 400 papillary thyroid carcinoma nodules were included and divided into training and validation group. With the help of machine learning, clinical characteristics and ultrasonic radiomic features were extracted and selected using randomforest and least absolute shrinkage and selection operator regression before classified by five classifiers. Finally, 10 models were built and their area under the receiver operating characteristic curve, accuracy, sensitivity, specificity, positive predictive value and negative predictive value were measured.
Among the 10 models, RF-RF model revealed the highest area under curve (0.812) and accuracy (0.7542) in validation group. The top 10 variables of it included age, seven textural features, one shape feature and one first-order feature, in which eight were high-dimensional features.
RF-RF model showed the best predictive performance for cervical lymph node metastasis. And the importance features selected by it highlighted the unique role of higher-dimensional statistical methods for radiomics analysis.
术前预测甲状腺乳头状癌的颈部淋巴结转移可为肿瘤分期和治疗决策提供依据。本研究旨在探讨机器学习在术前基于超声放射组学特征和临床特征预测甲状腺乳头状癌结节颈部淋巴结转移中的应用,并建立不同的模型。
纳入 400 个甲状腺乳头状癌结节的数据,并分为训练组和验证组。借助机器学习,使用随机森林和最小绝对收缩和选择算子回归提取并选择临床特征和超声放射组学特征,然后由 5 个分类器进行分类。最后构建 10 个模型,并测量其验证组的受试者工作特征曲线下面积、准确性、敏感性、特异性、阳性预测值和阴性预测值。
在 10 个模型中,RF-RF 模型在验证组中显示出最高的曲线下面积(0.812)和准确性(0.7542)。它的前 10 个变量包括年龄、7 个纹理特征、1 个形状特征和 1 个一阶特征,其中 8 个是高维特征。
RF-RF 模型对颈部淋巴结转移具有最佳的预测性能。它所选择的重要特征突出了高维统计方法在放射组学分析中的独特作用。