Cheng Christopher P, Sicard Ryan, Vujovic Dragan, Vasan Vikram, Choi Chris, Lerner David K, Iloreta Alfred-Marc
Department of Medical Education Icahn School of Medicine at Mount Sinai New York New York USA.
Department of Otolaryngology Icahn School of Medicine at Mount Sinai New York New York USA.
World J Otorhinolaryngol Head Neck Surg. 2024 May 28;11(2):198-206. doi: 10.1002/wjo2.188. eCollection 2025 Jun.
Documentation and billing are important and time-consuming parts of an otolaryngologist's work. Given advancements in machine learning (ML), we evaluated the ability of ML algorithms to use operative notes to classify rhinology procedures by Current Procedural Terminology (CPT®) code. We aimed to assess the potential for ML to replicate rhinologists' completion of their administrative tasks.
Retrospective cohort study.
Urban tertiary hospital.
A total of 594 operative notes from rhinological procedures across six CPT codes performed from 3/2017 to 4/2022 were collected from 22 otolaryngologists. Text was preprocessed and then vectorized using CountVectorizer (CV), term frequency-inverse document frequency, and Word2Vec. The Decision Tree, Support Vector Machine, Logistic Regression and Naïve Bayes (NB) algorithms were used to train and test models on operative notes. Model-classified CPT codes were compared to codes assigned by operating surgeons. Model performance was evaluated by area under the receiver operating characteristic curve (ROC-AUC), precision, recall, and F1-score.
Performance varied across vectorizers and ML algorithms. Across all performance metrics, CV and NB was most overall the best combination of vectorizer and ML algorithm across CPT codes and produced the single best AUC, 0.984.
In otolaryngology applications, the performance of basic ML algorithms varies depending on the context in which they are used. All algorithms demonstrated their ability to classify CPT codes well as well as the potential for using ML to replicate rhinologists' completion of their administrative tasks.
文档记录和计费是耳鼻喉科医生工作中重要且耗时的部分。鉴于机器学习(ML)的进展,我们评估了ML算法利用手术记录按现行程序编码术语(CPT®)对鼻科手术进行分类的能力。我们旨在评估ML复制鼻科医生完成其行政任务的潜力。
回顾性队列研究。
城市三级医院。
从22名耳鼻喉科医生处收集了2017年3月至2022年4月期间进行的六个CPT编码的鼻科手术的594份手术记录。文本经过预处理,然后使用计数向量器(CV)、词频逆文档频率和词向量(Word2Vec)进行向量化。使用决策树、支持向量机、逻辑回归和朴素贝叶斯(NB)算法对手术记录进行模型训练和测试。将模型分类的CPT编码与手术医生指定的编码进行比较。通过受试者操作特征曲线下面积(ROC-AUC)、精确率、召回率和F1分数评估模型性能。
不同向量化器和ML算法的性能有所不同。在所有性能指标中,CV和NB总体上是跨CPT编码的向量化器和ML算法的最佳组合,产生了单个最佳AUC,即0.984。
在耳鼻喉科应用中,基本ML算法的性能因其使用环境而异。所有算法都展示了它们对CPT编码进行良好分类的能力以及利用ML复制鼻科医生完成其行政任务的潜力。