Wang Shaopan, He Xin, Jian Zhongquan, Li Jie, Xu Changsheng, Chen Yuguang, Liu Yuwen, Chen Han, Huang Caihong, Hu Jiaoyue, Liu Zuguo
Institute of Artificial Intelligence, Xiamen University, Xiamen, Fujian, China.
School of Informatics, Xiamen University, Xiamen, Fujian, China.
Eye Vis (Lond). 2024 Oct 1;11(1):38. doi: 10.1186/s40662-024-00405-1.
In recent years, ophthalmology has emerged as a new frontier in medical artificial intelligence (AI) with multi-modal AI in ophthalmology garnering significant attention across interdisciplinary research. This integration of various types and data models holds paramount importance as it enables the provision of detailed and precise information for diagnosing eye and vision diseases. By leveraging multi-modal ophthalmology AI techniques, clinicians can enhance the accuracy and efficiency of diagnoses, and thus reduce the risks associated with misdiagnosis and oversight while also enabling more precise management of eye and vision health. However, the widespread adoption of multi-modal ophthalmology poses significant challenges.
In this review, we first summarize comprehensively the concept of modalities in the field of ophthalmology, the forms of fusion between modalities, and the progress of multi-modal ophthalmic AI technology. Finally, we discuss the challenges of current multi-modal AI technology applications in ophthalmology and future feasible research directions.
In the field of ophthalmic AI, evidence suggests that when utilizing multi-modal data, deep learning-based multi-modal AI technology exhibits excellent diagnostic efficacy in assisting the diagnosis of various ophthalmic diseases. Particularly, in the current era marked by the proliferation of large-scale models, multi-modal techniques represent the most promising and advantageous solution for addressing the diagnosis of various ophthalmic diseases from a comprehensive perspective. However, it must be acknowledged that there are still numerous challenges associated with the application of multi-modal techniques in ophthalmic AI before they can be effectively employed in the clinical setting.
近年来,眼科已成为医学人工智能(AI)的一个新前沿领域,眼科中的多模态AI在跨学科研究中备受关注。各种类型和数据模型的这种整合至关重要,因为它能够为诊断眼部和视力疾病提供详细而精确的信息。通过利用多模态眼科AI技术,临床医生可以提高诊断的准确性和效率,从而降低误诊和漏诊的风险,同时还能更精确地管理眼部和视力健康。然而,多模态眼科的广泛应用带来了重大挑战。
在本综述中,我们首先全面总结眼科领域中模态的概念、模态之间的融合形式以及多模态眼科AI技术的进展。最后,我们讨论了当前多模态AI技术在眼科应用中的挑战以及未来可行的研究方向。
在眼科AI领域,有证据表明,在利用多模态数据时,基于深度学习的多模态AI技术在辅助诊断各种眼科疾病方面表现出优异的诊断效果。特别是,在当前以大规模模型激增为特征的时代,多模态技术从全面的角度来看是解决各种眼科疾病诊断问题最有前途和优势的解决方案。然而,必须承认,在多模态技术能够有效地应用于临床之前,其在眼科AI中的应用仍存在众多挑战。