Papuc Melissa, Scheffler Patrick
University of Arizona College of Medicine Phoenix, 475 N 5th St, Phoenix, AZ, 85004, USA.
Department of Otolaryngology, Cohen Children's Medical Center, New Hyde Park, NY, USA.
Eur Arch Otorhinolaryngol. 2025 Apr;282(4):2149-2153. doi: 10.1007/s00405-024-09148-0. Epub 2024 Dec 12.
To compare the quality and readability of patient education materials on myringotomy tubes from artificial intelligence and Google search.
Three questions were posed to ChatGPT and Google Gemini addressing "Condition," "Investigation," and "Treatment" domains. Google was queried for "Ear tubes," "Myringotomy and tubes," and "Tympanostomy tubes." Text quality was assessed using the DISCERN instrument. Readability was assessed using the Flesch-Kincaid Grade Level, Flesch-Kincaid Reading Ease scores, and the Fry Readability Graph.
The average DISCERN score for websites was 52 (SD = 13.1, Median = 55.5), out of 80. The mean Flesch-Kincaid Reading Grade Level was 8 (SD = 3, Median = 7.1), and the mean Flesch-Kincaid Reading Ease score was 55 (SD = 12.3, Median = 57.7). ChatGPT and Google Gemini's "Condition" responses each had DISCERN scores of 46, Flesch-Kincaid Grade Levels of 13.1 and 9.5, and Reading Ease scores of 41 and 61. For "Investigation," DISCERN scores were 46 (ChatGPT) and 66 (Google Gemini), Grade Levels were 13.9 and 12.4, and Reading Ease scores were 38.9 and 34.9. For "Treatment," ChatGPT and Google Gemini had DISCERN scores of 45 and 34, Grade Levels of 15.7 and 9.8, and Reading Ease scores of 36.2 and 53.9.
Sites and artificial intelligence providing patient education material regarding myringotomy tubes are of "fair" quality but have readability levels above the recommended 6th grade level. Google search results were superior to artificial intelligence in readability.
比较通过人工智能和谷歌搜索获取的鼓膜置管患者教育材料的质量和可读性。
向ChatGPT和谷歌Gemini提出了三个问题,涉及“病症”“检查”和“治疗”领域。在谷歌上搜索“耳管”“鼓膜切开术和耳管”以及“鼓膜造孔管”。使用DISCERN工具评估文本质量。使用弗莱什-金凯德年级水平、弗莱什-金凯德易读性分数和弗莱阅读易读性图表评估可读性。
网站的平均DISCERN分数为52(标准差=13.1,中位数=55.5),满分80分。弗莱什-金凯德阅读年级水平平均为8(标准差=3,中位数=7.1),弗莱什-金凯德易读性分数平均为55(标准差=12.3,中位数=57.7)。ChatGPT和谷歌Gemini关于“病症”的回答的DISCERN分数均为46,弗莱什-金凯德年级水平分别为13.1和9.5,易读性分数分别为41和61。对于“检查”,DISCERN分数分别为46(ChatGPT)和66(谷歌Gemini),年级水平分别为13.9和12.4,易读性分数分别为38.9和34.9。对于“治疗”,ChatGPT和谷歌Gemini的DISCERN分数分别为45和34,年级水平分别为15.7和9.8,易读性分数分别为36.2和53.9。
提供鼓膜置管患者教育材料的网站和人工智能质量“一般”,但可读性水平高于推荐的六年级水平。谷歌搜索结果在可读性方面优于人工智能。