• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GPT-4o 和 Gemini 1.5 Pro 在革兰氏染色和细菌形态识别方面的能力。

Capabilities of GPT-4o and Gemini 1.5 Pro in Gram stain and bacterial shape identification.

机构信息

Department of Internal Medicine, Cleveland Clinic, Cleveland, OH 44195, USA.

Department of Infectious Diseases, Cleveland Clinic, Cleveland, OH 44195, USA.

出版信息

Future Microbiol. 2024;19(15):1283-1292. doi: 10.1080/17460913.2024.2381967. Epub 2024 Jul 29.

DOI:10.1080/17460913.2024.2381967
PMID:39069960
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11486216/
Abstract

Assessing the visual accuracy of two large language models (LLMs) in microbial classification. GPT-4o and Gemini 1.5 Pro were evaluated in distinguishing Gram-positive from Gram-negative bacteria and classifying them as cocci or bacilli using 80 Gram stain images from a labeled database. GPT-4o achieved 100% accuracy in identifying simultaneously Gram stain and shape for , and . Gemini 1.5 Pro showed more variability for similar bacteria (45, 100 and 95%, respectively). Both LLMs failed to identify both Gram stain and bacterial shape for . Cumulative accuracy plots indicated that GPT-4o consistently performed equally or better in every identification, except for shape. These results suggest that these LLMs in their unprimed state are not ready to be implemented in clinical practice and highlight the need for more research with larger datasets to improve LLMs' effectiveness in clinical microbiology.

摘要

评估两种大型语言模型(LLMs)在微生物分类中的视觉准确性。使用来自标记数据库的 80 张革兰氏染色图像,评估 GPT-4o 和 Gemini 1.5 Pro 在区分革兰氏阳性菌和革兰氏阴性菌以及将其分类为球菌或杆菌方面的准确性。GPT-4o 在识别同时的革兰氏染色和形状方面达到了 100%的准确率,对于 、 和 。Gemini 1.5 Pro 对相似的细菌显示出更多的可变性(分别为 45%、100%和 95%)。这两种 LLM 都无法识别 和 的革兰氏染色和细菌形状。累积准确率图表明,GPT-4o 在每次识别中表现一致或更好,除了 形状。这些结果表明,这些未经过预训练的 LLM 还没有准备好在临床实践中实施,并强调需要使用更大的数据集进行更多研究,以提高 LLM 在临床微生物学中的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/137b2c3cf836/IFMB_A_2381967_F0006_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/29805ee00a9f/IFMB_A_2381967_F0001_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/5eba72df04a1/IFMB_A_2381967_F0002_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/eea25ddabc67/IFMB_A_2381967_F0003_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/0f3220dd4e5a/IFMB_A_2381967_F0004_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/1ccf6f17e745/IFMB_A_2381967_F0005_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/137b2c3cf836/IFMB_A_2381967_F0006_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/29805ee00a9f/IFMB_A_2381967_F0001_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/5eba72df04a1/IFMB_A_2381967_F0002_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/eea25ddabc67/IFMB_A_2381967_F0003_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/0f3220dd4e5a/IFMB_A_2381967_F0004_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/1ccf6f17e745/IFMB_A_2381967_F0005_C.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05d8/11486216/137b2c3cf836/IFMB_A_2381967_F0006_C.jpg

相似文献

1
Capabilities of GPT-4o and Gemini 1.5 Pro in Gram stain and bacterial shape identification.GPT-4o 和 Gemini 1.5 Pro 在革兰氏染色和细菌形态识别方面的能力。
Future Microbiol. 2024;19(15):1283-1292. doi: 10.1080/17460913.2024.2381967. Epub 2024 Jul 29.
2
Diagnostic performances of GPT-4o, Claude 3 Opus, and Gemini 1.5 Pro in "Diagnosis Please" cases.GPT-4o、Claude 3 Opus 和 Gemini 1.5 Pro 在“诊断请”案例中的诊断性能。
Jpn J Radiol. 2024 Nov;42(11):1231-1235. doi: 10.1007/s11604-024-01619-y. Epub 2024 Jul 1.
3
The accuracy of Gemini, GPT-4, and GPT-4o in ECG analysis: A comparison with cardiologists and emergency medicine specialists.Gemini、GPT-4 和 GPT-4o 在心电图分析中的准确性:与心脏病专家和急诊医学专家的比较。
Am J Emerg Med. 2024 Oct;84:68-73. doi: 10.1016/j.ajem.2024.07.043. Epub 2024 Jul 30.
4
Accuracy of Large Language Models for Infective Endocarditis Prophylaxis in Dental Procedures.大型语言模型在牙科手术中预防感染性心内膜炎的准确性。
Int Dent J. 2025 Feb;75(1):206-212. doi: 10.1016/j.identj.2024.09.033. Epub 2024 Oct 12.
5
Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5 edition.评估大语言模型在与《乳腺影像报告和数据系统》第5版相关问题上的文本和视觉诊断能力。
Diagn Interv Radiol. 2025 Mar 3;31(2):111-129. doi: 10.4274/dir.2024.242876. Epub 2024 Sep 9.
6
Evaluating AI Proficiency in Nuclear Cardiology: Large Language Models take on the Board Preparation Exam.评估人工智能在核心脏病学方面的能力:大型语言模型参加资格考试准备。
medRxiv. 2024 Jul 16:2024.07.16.24310297. doi: 10.1101/2024.07.16.24310297.
7
An optimized staining technique for the detection of Gram positive and Gram negative bacteria within tissue.一种用于检测组织内革兰氏阳性菌和革兰氏阴性菌的优化染色技术。
BMC Res Notes. 2016 Apr 12;9:216. doi: 10.1186/s13104-016-1902-0.
8
The accuracy of Gram stain of respiratory specimens in excluding Staphylococcus aureus in ventilator-associated pneumonia.呼吸道标本革兰氏染色在排除呼吸机相关性肺炎中金黄色葡萄球菌方面的准确性。
J Crit Care. 2014 Oct;29(5):739-42. doi: 10.1016/j.jcrc.2014.05.011. Epub 2014 May 28.
9
Evaluation of the Gram stain of sputa from cystic fibrosis patients to predict the presence of Staphylococcus aureus.评价囊性纤维化患者痰液的革兰氏染色,以预测金黄色葡萄球菌的存在。
Diagn Microbiol Infect Dis. 2013 Oct;77(2):99-100. doi: 10.1016/j.diagmicrobio.2013.06.022. Epub 2013 Jul 23.
10
The use of a differential fluorescent staining method to detect bacteriuria.使用差异荧光染色法检测菌尿症。
Clin Lab. 2004;50(11-12):685-8.

引用本文的文献

1
Do LLMs Have 'the Eye' for MRI? Evaluating GPT-4o, Grok, and Gemini on Brain MRI Performance: First Evaluation of Grok in Medical Imaging and a Comparative Analysis.大型语言模型对磁共振成像有“洞察力”吗?评估GPT-4o、Grok和Gemini在脑部磁共振成像性能方面的表现:Grok在医学成像中的首次评估及比较分析
Diagnostics (Basel). 2025 May 24;15(11):1320. doi: 10.3390/diagnostics15111320.

本文引用的文献

1
ChatGPT as a Tool for Medical Education and Clinical Decision-Making on the Wards: Case Study.ChatGPT作为病房医学教育和临床决策工具:案例研究
JMIR Form Res. 2024 May 8;8:e51346. doi: 10.2196/51346.
2
Vision-Language Models for Feature Detection of Macular Diseases on Optical Coherence Tomography.基于视觉-语言模型的光学相干断层扫描图像黄斑病变特征检测
JAMA Ophthalmol. 2024 Jun 1;142(6):573-576. doi: 10.1001/jamaophthalmol.2024.1165.
3
Interpretable predictions of chaotic dynamical systems using dynamical system deep learning.
使用动态系统深度学习对混沌动态系统进行可解释预测。
Sci Rep. 2024 Feb 7;14(1):3143. doi: 10.1038/s41598-024-53169-y.
4
The future landscape of large language models in medicine.医学领域大语言模型的未来前景。
Commun Med (Lond). 2023 Oct 10;3(1):141. doi: 10.1038/s43856-023-00370-1.
5
Rapid Convolutional Neural Networks for Gram-Stained Image Classification at Inference Time on Mobile Devices: Empirical Study from Transfer Learning to Optimization.用于移动设备推理时革兰氏染色图像分类的快速卷积神经网络:从迁移学习到优化的实证研究
Biomedicines. 2022 Nov 4;10(11):2808. doi: 10.3390/biomedicines10112808.
6
Deep Learning Frameworks for Rapid Gram Stain Image Data Interpretation: Protocol for a Retrospective Data Analysis.用于快速革兰氏染色图像数据解读的深度学习框架:一项回顾性数据分析方案
JMIR Res Protoc. 2020 Jul 13;9(7):e16843. doi: 10.2196/16843.
7
Image analysis and artificial intelligence in infectious disease diagnostics.传染病诊断中的图像分析和人工智能。
Clin Microbiol Infect. 2020 Oct;26(10):1318-1323. doi: 10.1016/j.cmi.2020.03.012. Epub 2020 Mar 22.
8
Sputum Gram Stain for Bacterial Pathogen Diagnosis in Community-acquired Pneumonia: A Systematic Review and Bayesian Meta-analysis of Diagnostic Accuracy and Yield.社区获得性肺炎细菌病原体诊断中的痰革兰氏染色:诊断准确性和收益的系统评价和贝叶斯荟萃分析。
Clin Infect Dis. 2020 Jul 27;71(3):499-513. doi: 10.1093/cid/ciz876.
9
Automated Interpretation of Blood Culture Gram Stains by Use of a Deep Convolutional Neural Network.利用深度卷积神经网络自动解读血培养革兰氏染色
J Clin Microbiol. 2018 Feb 22;56(3). doi: 10.1128/JCM.01521-17. Print 2018 Mar.
10
Deep learning approach to bacterial colony classification.用于细菌菌落分类的深度学习方法。
PLoS One. 2017 Sep 14;12(9):e0184554. doi: 10.1371/journal.pone.0184554. eCollection 2017.