基于智能手机视频，利用面部姿势估计增强的双交叉注意力建模对眼睑痉挛进行早期诊断。

Smartphone video-based early diagnosis of blepharospasm using dual cross-attention modeling enhanced by facial pose estimation.

作者信息

Huang Shenyu, Yang Boyuan, Huang Xiaoling, Zhang Huina, Luo Dong, Tong Guanchao, Wang Yijie, Shao Yongqing, Chen Menglu, Gao Qi, Ye Juan

机构信息

Zhejiang University, Eye Center of Second Affiliated Hospital, School of Medicine. Zhejiang Provincial Key Laboratory of Ophthalmology. Zhejiang Provincial Clinical Research Center for Eye Diseases. Zhejiang Provincial Engineering Institute on Eye Diseases, Hangzhou, China.

Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA.

出版信息

NPJ Digit Med. 2025 Aug 5;8(1):505. doi: 10.1038/s41746-025-01904-8.

DOI:10.1038/s41746-025-01904-8

PMID:40764679

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12325706/

Abstract

Blepharospasm is a focal dystonia characterized by involuntary eyelid contractions that impair vision and social function. The subtle clinical signs of blepharospasm make early and accurate diagnosis difficult, delaying timely intervention. In this study, we propose a dual cross-attention deep learning framework that integrates temporal video features and facial landmark dynamics to assess blepharospasm severity, frequency, and diagnosis from smartphone-recorded facial videos. A retrospective dataset of 847 patient videos collected from two hospitals (2016-2023) was used for model development. The model achieved high accuracy for severity (0.828) and frequency (0.82), and moderate performance for diagnosis (0.674).SHAP analysis identified case-specific video fragments contributing to predictions, enhancing interpretability. In a prospective evaluation on an independent dataset (N = 179), AI assistance improved junior ophthalmologist's diagnostic accuracy by up to 18.5%. These findings demonstrate the potential of an explainable, smartphone-compatible video model to support early detection and assessment of blepharospasm.

摘要

眼睑痉挛是一种局灶性肌张力障碍，其特征为不自主的眼睑收缩，会损害视力和社交功能。眼睑痉挛的细微临床体征使得早期准确诊断变得困难，从而延误了及时干预。在本研究中，我们提出了一种双交叉注意力深度学习框架，该框架整合了视频的时间特征和面部标志动态，以从智能手机录制的面部视频中评估眼睑痉挛的严重程度、频率和诊断情况。我们使用从两家医院收集的847例患者视频的回顾性数据集（2016 - 2023年）进行模型开发。该模型在严重程度（0.828）和频率（0.82）方面取得了较高的准确率，在诊断方面表现中等（0.674）。SHAP分析确定了有助于预测的特定病例视频片段，增强了可解释性。在对一个独立数据集（N = 179）的前瞻性评估中，人工智能辅助将初级眼科医生的诊断准确率提高了18.5%。这些发现证明了一种可解释的、与智能手机兼容的视频模型在支持眼睑痉挛早期检测和评估方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3ee/12325706/8189a49d90bc/41746_2025_1904_Fig1_HTML.jpg

相似文献

Smartphone video-based early diagnosis of blepharospasm using dual cross-attention modeling enhanced by facial pose estimation.基于智能手机视频，利用面部姿势估计增强的双交叉注意力建模对眼睑痉挛进行早期诊断。

NPJ Digit Med. 2025 Aug 5;8(1):505. doi: 10.1038/s41746-025-01904-8.

Facial Emotion Recognition of 16 Distinct Emotions From Smartphone Videos: Comparative Study of Machine Learning and Human Performance.基于智能手机视频的16种不同情绪的面部表情识别：机器学习与人类表现的对比研究

J Med Internet Res. 2025 Jul 2;27:e68942. doi: 10.2196/68942.

Improving reliability of movement assessment in Parkinson's disease using computer vision-based automated severity estimation.利用基于计算机视觉的自动严重程度估计提高帕金森病运动评估的可靠性。

J Parkinsons Dis. 2025 Mar;15(2):349-360. doi: 10.1177/1877718X241312605. Epub 2025 Feb 13.

Botulinum toxin type A therapy for blepharospasm.A型肉毒杆菌毒素治疗睑痉挛。

Cochrane Database Syst Rev. 2005 Jan 25(1):CD004900. doi: 10.1002/14651858.CD004900.pub2.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Artificial intelligence for detecting keratoconus.人工智能在圆锥角膜检测中的应用。

Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2.

A Comprehensive and Modality Diverse Cervical Spine and Back Musculoskeletal Physical Exam Curriculum for Medical Students.面向医学生的全面且多模态的颈椎和背部肌肉骨骼物理检查课程

J Educ Teach Emerg Med. 2025 Jul 31;10(3):SG1-SG8. doi: 10.21980/J8RQ0N. eCollection 2025 Jul.

Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

Emergency Medical Services Streaming Enabled Evaluation In Trauma: The SEE-IT Feasibility RCT.创伤中启用紧急医疗服务流的评估：SEE-IT可行性随机对照试验

Health Soc Care Deliv Res. 2025 May 28:1-38. doi: 10.3310/EUFS2314.

本文引用的文献

Head movement dynamics in dystonia: a multi-centre retrospective study using visual perceptive deep learning.肌张力障碍中的头部运动动力学：一项使用视觉感知深度学习的多中心回顾性研究。

NPJ Digit Med. 2024 Jun 18;7(1):160. doi: 10.1038/s41746-024-01140-6.

Facial emotion recognition deficits are associated with hypomimia and related brain correlates in Parkinson's disease.面部表情识别缺陷与帕金森病中的运动不能（hypomimia）及相关的大脑相关因素有关。

J Neural Transm (Vienna). 2024 Dec;131(12):1463-1469. doi: 10.1007/s00702-023-02725-3. Epub 2024 Jan 11.

PLoS One. 2023 Mar 15;18(3):e0283111. doi: 10.1371/journal.pone.0283111. eCollection 2023.

Automated extraction of clinical measures from videos of oculofacial disorders using machine learning: feasibility, validity and reliability.使用机器学习从眼面疾病视频中自动提取临床指标：可行性、有效性和可靠性。

Eye (Lond). 2023 Sep;37(13):2810-2816. doi: 10.1038/s41433-023-02424-z. Epub 2023 Feb 1.

A multi-feature deep learning system to enhance glaucoma severity diagnosis with high accuracy and fast speed.一种多特征深度学习系统，可实现高精度和快速的青光眼严重程度诊断。

J Biomed Inform. 2022 Dec;136:104233. doi: 10.1016/j.jbi.2022.104233. Epub 2022 Oct 21.

Telemedicine in Oculoplastics: The Real-Life Application of Video Consultation Clinics.眼整形外科学中的远程医疗：视频咨询诊所的实际应用。

Ophthalmic Plast Reconstr Surg. 2021;37(3S):S104-S108. doi: 10.1097/IOP.0000000000001852.

Risk of spread in adult-onset isolated focal dystonia: a prospective international cohort study.成人起病的局灶性肌张力障碍的传播风险：一项前瞻性国际队列研究。

J Neurol Neurosurg Psychiatry. 2020 Mar;91(3):314-320. doi: 10.1136/jnnp-2019-321794. Epub 2019 Dec 17.

A neural network-based software to recognise blepharospasm symptoms and to measure eye closure time.一种基于神经网络的软件，用于识别眼睑痉挛症状和测量闭眼时间。

Comput Biol Med. 2019 Sep;112:103376. doi: 10.1016/j.compbiomed.2019.103376. Epub 2019 Jul 31.

Disease progression in blepharospasm: a 5-year longitudinal study.眼睑痉挛的疾病进展：一项 5 年的纵向研究。

Eur J Neurol. 2019 Feb;26(2):268-273. doi: 10.1111/ene.13832. Epub 2018 Nov 12.

Dystonia.肌张力障碍

Nat Rev Dis Primers. 2018 Sep 20;4(1):25. doi: 10.1038/s41572-018-0023-6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于智能手机视频，利用面部姿势估计增强的双交叉注意力建模对眼睑痉挛进行早期诊断。

Smartphone video-based early diagnosis of blepharospasm using dual cross-attention modeling enhanced by facial pose estimation.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献