Unit of Infectious Diseases, Department of Medicine, Surgery, and Pharmacy, University of Sassari, Sassari, 07100, Italy.
PhD School in Biomedical Science, Biomedical Science Department, University of Sassari, Sassari, Italy.
AIDS Behav. 2024 Aug;28(8):2746-2754. doi: 10.1007/s10461-024-04391-2. Epub 2024 Jun 5.
With the advancement of artificial intelligence(AI), platforms like ChatGPT have gained traction in different fields, including Medicine. This study aims to evaluate the potential of ChatGPT in addressing questions related to HIV prevention and to assess its accuracy, completeness, and inclusivity. A team consisting of 15 physicians, six members from HIV communities, and three experts in gender and queer studies designed an assessment of ChatGPT. Queries were categorized into five thematic groups: general HIV information, behaviors increasing HIV acquisition risk, HIV and pregnancy, HIV testing, and the prophylaxis use. A team of medical doctors was in charge of developing questions to be submitted to ChatGPT. The other members critically assessed the generated responses regarding level of expertise, accuracy, completeness, and inclusivity. The median accuracy score was 5.5 out of 6, with 88.4% of responses achieving a score ≥ 5. Completeness had a median of 3 out of 3, while the median for inclusivity was 2 out of 3. Some thematic groups, like behaviors associated with HIV transmission and prophylaxis, exhibited higher accuracy, indicating variable performance across different topics. Issues of inclusivity were identified, notably the use of outdated terms and a lack of representation for some communities. ChatGPT demonstrates significant potential in providing accurate information on HIV-related topics. However, while responses were often scientifically accurate, they sometimes lacked the socio-political context and inclusivity essential for effective health communication. This underlines the importance of aligning AI-driven platforms with contemporary health communication strategies and ensuring the balance of accuracy and inclusivity.
随着人工智能(AI)的进步,像 ChatGPT 这样的平台在医学等不同领域得到了广泛应用。本研究旨在评估 ChatGPT 在回答与 HIV 预防相关问题方面的潜力,并评估其准确性、完整性和包容性。一个由 15 名医生、6 名来自 HIV 社区的成员以及 3 名性别和酷儿研究专家组成的团队设计了一个评估 ChatGPT 的方案。查询被分为五个主题组:一般 HIV 信息、增加 HIV 感染风险的行为、HIV 和怀孕、HIV 检测和预防用药。一组医生负责提出要提交给 ChatGPT 的问题。其他成员则批判性地评估了生成的回答在专业水平、准确性、完整性和包容性方面的表现。准确性的中位数评分为 6 分中的 5.5 分,88.4%的回答得分≥5。完整性的中位数为 3 分,包容性的中位数为 2 分。一些主题组,如与 HIV 传播和预防相关的行为,表现出更高的准确性,表明在不同主题上存在不同的表现。还发现了一些包容性问题,特别是使用过时的术语和一些社区代表性不足。ChatGPT 在提供 HIV 相关主题的准确信息方面显示出了显著的潜力。然而,虽然回复通常在科学上是准确的,但它们有时缺乏有效的健康沟通所必需的社会政治背景和包容性。这强调了将 AI 驱动的平台与当代健康沟通策略保持一致并确保准确性和包容性之间取得平衡的重要性。