Department of Ophthalmology, Aerospace Medical Center, Republic of Korea Air Force, Cheongju, South Korea.
Department of Anesthesiology and Pain Medicine, Seoul Women's Hospital, Bucheon, South Korea.
PLoS One. 2020 Apr 9;15(4):e0231322. doi: 10.1371/journal.pone.0231322. eCollection 2020.
Wrong-site surgeries can occur due to the absence of an appropriate surgical time-out. However, during a time-out, surgical participants are unable to review the patient's charts due to their aseptic hands. To improve the conditions in surgical time-outs, we introduce a deep learning-based smart speaker to confirm the surgical information prior to cataract surgeries. This pilot study utilized the publicly available audio vocabulary dataset and recorded audio data published by the authors. The audio clips of the target words, such as left, right, cataract, phacoemulsification, and intraocular lens, were selected to determine and confirm surgical information in the time-out speech. A deep convolutional neural network model was trained and implemented in the smart speaker that was developed using a mini development board and commercial speakerphone. To validate our model in the consecutive speeches during time-outs, we generated 200 time-out speeches for cataract surgeries by randomly selecting the surgical statuses of the surgical participants. After the training process, the deep learning model achieved an accuracy of 96.3% for the validation dataset of short-word audio clips. Our deep learning-based smart speaker achieved an accuracy of 93.5% for the 200 time-out speeches. The surgical and procedural accuracy was 100%. Additionally, on validating the deep learning model by using web-generated time-out speeches and video clips for general surgery, the model exhibited a robust and good performance. In this pilot study, the proposed deep learning-based smart speaker was able to successfully confirm the surgical information during the time-out speech. Future studies should focus on collecting real-world time-out data and automatically connecting the device to electronic health records. Adopting smart speaker-assisted time-out phases will improve the patients' safety during cataract surgeries, particularly in relation to wrong-site surgeries.
手术错误部位的发生可能是由于缺乏适当的手术暂停。然而,在暂停期间,由于手术人员的手是无菌的,他们无法查看患者的图表。为了改善手术暂停期间的状况,我们引入了一种基于深度学习的智能扬声器,以在白内障手术前确认手术信息。这项初步研究利用了公开可用的音频词汇数据集和作者发布的录音数据。选择目标词(如左、右、白内障、超声乳化和人工晶状体)的音频片段,以确定和确认暂停期间的手术信息。我们使用迷你开发板和商用扬声器电话开发了一种深度卷积神经网络模型,并在智能扬声器中进行了训练和实现。为了在暂停期间的连续演讲中验证我们的模型,我们通过随机选择手术人员的手术状态生成了 200 个白内障手术的暂停演讲。在训练过程之后,深度学习模型在短词音频剪辑的验证数据集中达到了 96.3%的准确率。我们的基于深度学习的智能扬声器在 200 个暂停演讲中达到了 93.5%的准确率。手术和程序的准确率达到了 100%。此外,在使用网络生成的用于普通外科手术的暂停演讲和视频剪辑验证深度学习模型时,该模型表现出了稳健和良好的性能。在这项初步研究中,所提出的基于深度学习的智能扬声器能够成功地在暂停演讲中确认手术信息。未来的研究应集中于收集真实世界的暂停数据并自动将设备连接到电子健康记录。采用智能扬声器辅助的暂停阶段将提高白内障手术期间患者的安全性,特别是在避免手术错误部位方面。