Ucan Murat, Kaya Buket, Kaya Mehmet
Department of Computer Technologies, Vocational School of Technical Sciences, Dicle University, Diyarbakir 21200, Turkey.
Department of Electronics and Automation, Firat University, Elazig 23119, Turkey.
Diagnostics (Basel). 2025 Jul 17;15(14):1805. doi: 10.3390/diagnostics15141805.
: Extracting meaningful medical information from chest X-ray images and transcribing it into text is a complex task that requires a high level of expertise and directly affects clinical decision-making processes. Automatic reporting systems for this field in Turkish represent an important gap in scientific research, as they have not been sufficiently addressed in the existing literature. : A deep learning-based approach called Model-SEY was developed with the aim of automatically generating Turkish medical reports from chest X-ray images. The Swin Transformer structure was used in the encoder part of the model to extract image features, while the text generation process was carried out using the cosmosGPT architecture, which was adapted specifically for the Turkish language. : With the permission of the ethics committee, a new dataset was created using image-report pairs obtained from Elazıg Fethi Sekin City Hospital and Indiana University Chest X-Ray dataset and experiments were conducted on this new dataset. In the tests conducted within the scope of the study, scores of 0.6412, 0.5335, 0.4395, 0.4395, 0.3716, and 0.2240 were obtained in BLEU-1, BLEU-2, BLEU-3, BLEU-4, and ROUGE word overlap evaluation metrics, respectively. : Quantitative and qualitative analyses of medical reports autonomously generated by the proposed model have shown that they are meaningful and consistent. The proposed model is one of the first studies in the field of autonomous reporting using deep learning architectures specific to the Turkish language, representing an important step forward in this field. It will also reduce potential human errors during diagnosis by supporting doctors in their decision-making.
从胸部X光图像中提取有意义的医学信息并将其转录为文本是一项复杂的任务,需要高水平的专业知识,并且直接影响临床决策过程。土耳其语在该领域的自动报告系统在科学研究中存在重要空白,因为现有文献对此未作充分探讨。:开发了一种名为Model-SEY的基于深度学习的方法,旨在从胸部X光图像中自动生成土耳其语医学报告。模型的编码器部分使用Swin Transformer结构来提取图像特征,而文本生成过程则使用专门为土耳其语改编的cosmosGPT架构。:在伦理委员会的许可下,使用从埃拉泽省费提·塞金市医院和印第安纳大学胸部X光数据集获得的图像-报告对创建了一个新数据集,并在这个新数据集上进行了实验。在研究范围内进行的测试中,在BLEU-1、BLEU-2、BLEU-3、BLEU-4和ROUGE词重叠评估指标中分别获得了0.6412、0.5335、0.4395、0.4395、0.3716和0.2240的分数。:对所提出模型自动生成的医学报告进行的定量和定性分析表明,这些报告是有意义且一致的。所提出的模型是使用特定于土耳其语的深度学习架构进行自主报告领域的首批研究之一,代表了该领域向前迈出的重要一步。它还将通过支持医生决策来减少诊断过程中潜在的人为错误。