KITE | Toronto Rehabilitation Institute-University Health Network, Toronto, Canada.
Department of Otolaryngology/Head and Neck Surgery, Massachusetts Eye and Ear Infirmary and Harvard Medical School, Boston, Massachusetts.
Facial Plast Surg Aesthet Med. 2020 Jan/Feb;22(1):42-49. doi: 10.1089/fpsam.2019.29000.gua.
Quantitative assessment of facial function is challenging, and subjective grading scales such as House-Brackmann, Sunnybrook, and eFACE have well-recognized limitations. Machine learning (ML) approaches to facial landmark localization carry great clinical potential as they enable high-throughput automated quantification of relevant facial metrics from photographs and videos. However, the translation from research settings to clinical application still requires important improvements. To develop a novel ML algorithm for fast and accurate localization of facial landmarks in photographs of facial palsy patients and utilize this technology as part of an automated computer-aided diagnosis system. Portrait photographs of 8 expressions obtained from 200 facial palsy patients and 10 healthy participants were manually annotated by localizing 68 facial landmarks in each photograph and by 3 trained clinicians using a custom graphical user interface. A novel ML model for automated facial landmark localization was trained using this disease-specific database. Algorithm accuracy was compared with manual markings and the output of a model trained using a larger database consisting only of healthy subjects. Root mean square error normalized by the interocular distance (NRMSE) of facial landmark localization between prediction of ML algorithm and manually localized landmarks. Publicly available algorithms for facial landmark localization provide poor localization accuracy when applied to photographs of patients compared with photographs of healthy controls (NRMSE, 8.56 ± 2.16 vs. 7.09 ± 2.34, ≪ 0.01). We found significant improvement in facial landmark localization accuracy for the facial palsy patient population when using a model trained with a relatively small number photographs (1440) of patients compared with a model trained using several thousand more images of healthy faces (NRMSE, 6.03 ± 2.43 vs. 8.56 ± 2.16, ≪ 0.01). Retraining a computer vision facial landmark detection model with fewer than 1600 annotated images of patients significantly improved landmark detection performance in frontal view photographs of this population. The new annotated database and facial landmark localization model represent the first steps toward an automatic system for computer-aided assessment in facial palsy. 4.
对面部功能进行定量评估具有挑战性,而 House-Brackmann、Sunnybrook 和 eFACE 等主观分级量表存在明显的局限性。面部地标定位的机器学习 (ML) 方法具有很大的临床应用潜力,因为它们能够从照片和视频中实现相关面部指标的高通量自动量化。然而,从研究环境到临床应用的转化仍需要重要的改进。为了开发一种新的 ML 算法,用于快速准确地定位面瘫患者照片中的面部地标,并将该技术用作自动计算机辅助诊断系统的一部分。从 200 名面瘫患者和 10 名健康参与者中获得的 8 种表情的肖像照片,由 3 名经过培训的临床医生使用自定义图形用户界面在每张照片中定位 68 个面部地标,并手动进行标记。使用这个特定于疾病的数据库训练了一种用于自动面部地标定位的新型 ML 模型。算法的准确性与手动标记和仅使用健康受试者的大型数据库训练的模型的输出进行了比较。面部地标定位的均方根误差除以眼间距离 (NRMSE)。与健康对照组的照片相比,用于患者照片的公开面部地标定位算法提供的定位精度较差(NRMSE,8.56 ± 2.16 对 7.09 ± 2.34,≪0.01)。我们发现,与使用数千张健康面孔图像训练的模型相比,使用相对较少的患者照片(1440 张)训练的模型对面瘫患者人群的面部地标定位准确性有显著提高(NRMSE,6.03 ± 2.43 对 8.56 ± 2.16,≪0.01)。用少于 1600 张患者注释图像重新训练计算机视觉面部地标检测模型,显著提高了该人群正面照片中的地标检测性能。新的注释数据库和面部地标定位模型代表了实现面瘫自动计算机辅助评估系统的第一步。4.