Yamagata Yuki, Yamada Ryota
R-IH, BioResource Research Center RIKEN, Tsukuba, 305-0074, Japan.
BioResource Research Center RIKEN, Tsukuba, 305-0074, Japan.
Genomics Inform. 2024 Jun 17;22(1):7. doi: 10.1186/s44342-024-00011-6.
This study evaluated large language models (LLMs), particularly the GPT-4 with vision (GPT-4 V) and GPT-4 Turbo, for annotating biomedical figures, focusing on cellular senescence. We assessed the ability of LLMs to categorize and annotate complex biomedical images to enhance their accuracy and efficiency. Our experiments employed prompt engineering with figures from review articles, achieving more than 70% accuracy for label extraction and approximately 80% accuracy for node-type classification. Challenges were noted in the correct annotation of the relationship between directionality and inhibitory processes, which were exacerbated as the number of nodes increased. Using figure legends was a more precise identification of sources and targets than using captions, but sometimes lacked pathway details. This study underscores the potential of LLMs in decoding biological mechanisms from text and outlines avenues for improving inhibitory relationship representations in biomedical informatics.
本研究评估了大语言模型(LLMs),特别是具有视觉功能的GPT-4(GPT-4 V)和GPT-4 Turbo,用于注释生物医学图像,重点是细胞衰老。我们评估了大语言模型对复杂生物医学图像进行分类和注释的能力,以提高其准确性和效率。我们的实验对综述文章中的图像采用了提示工程,标签提取准确率超过70%,节点类型分类准确率约为80%。在正确注释方向性和抑制过程之间的关系方面存在挑战,随着节点数量的增加,这些挑战会加剧。使用图注比使用标题更能精确识别来源和目标,但有时缺乏通路细节。本研究强调了大语言模型在从文本中解码生物学机制方面的潜力,并概述了改善生物医学信息学中抑制关系表示的途径。