Wang Yanlin, Li Gang, Zhang Xinyue, Wang Yue, Zhang Zhenhao, Li Jupeng, Ma Junqi, Wang Linghang
National Center for Stomatology & National Clinical Research Center for Oral Diseases & National Engineering Research Center of Oral Biomaterials and Digital Medical Device & Beijing Key Laboratory of Digital Stomatology & NHC Key Laboratory of Digital Stomatology, Department of Oral and Maxillofacial Radiology, Peking University School and Hospital of Stomatology, Beijing 100080, China.
School of Electronic and Information Engineering, Beijing Jiaotong University, Beijing 100044, China.
Diagnostics (Basel). 2024 Sep 20;14(18):2080. doi: 10.3390/diagnostics14182080.
: During deep learning model training, it is essential to consider the balance among the effects of sample size, actual resources, and time constraints. Single-arm objective performance criteria (OPC) was proposed to determine the optimal positive sample size for training deep learning models in caries recognition. : An expected sensitivity (P) of 0.6 and a clinically acceptable sensitivity (P) of 0.5 were applied to the single-arm OPC calculation formula, yielding an optimal training set comprising 263 carious teeth. U-Net, YOLOv5n, and CariesDetectNet were trained and validated using clinically self-collected cone-beam computed tomography (CBCT) images that included varying quantities of carious teeth. To assess performance, an additional dataset was utilized to evaluate the accuracy of caries detection by both the models and two dental radiologists. : When the number of carious teeth reached approximately 250, the models reached the optimal performance levels. U-Net demonstrated superior performance, achieving accuracy, sensitivity, specificity, F1-Score, and Dice similarity coefficients of 0.9929, 0.9307, 0.9989, 0.9590, and 0.9435, respectively. The three models exhibited greater accuracy in caries recognition compared to dental radiologists. : This study demonstrated that the positive sample size of CBCT images containing caries was predictable and could be calculated using single-arm OPC.
在深度学习模型训练过程中,必须考虑样本量、实际资源和时间限制等因素之间的平衡。提出了单臂目标性能标准(OPC)来确定龋齿识别深度学习模型训练的最佳阳性样本量。将预期敏感度(P)0.6和临床可接受敏感度(P)0.5应用于单臂OPC计算公式,得出一个包含263颗龋齿的最佳训练集。使用临床自行收集的包含不同数量龋齿的锥束计算机断层扫描(CBCT)图像对U-Net、YOLOv5n和CariesDetectNet进行训练和验证。为了评估性能,利用一个额外的数据集来评估模型和两位牙科放射科医生对龋齿检测的准确性。当龋齿数量达到约250颗时,模型达到最佳性能水平。U-Net表现出卓越的性能,准确率、敏感度、特异度、F1分数和骰子相似系数分别达到0.9929、0.9307、0.9989、0.9590和0.9435。与牙科放射科医生相比,这三个模型在龋齿识别方面表现出更高的准确性。本研究表明,包含龋齿的CBCT图像的阳性样本量是可预测的,并且可以使用单臂OPC进行计算。