Blind Brook High School, Rye Brook, NY, USA.
Department of Pediatrics, Boston Children's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA.
J Biomed Inform. 2024 Sep;157:104702. doi: 10.1016/j.jbi.2024.104702. Epub 2024 Jul 29.
Although rare diseases individually have a low prevalence, they collectively affect nearly 400 million individuals around the world. On average, it takes five years for an accurate rare disease diagnosis, but many patients remain undiagnosed or misdiagnosed. As machine learning technologies have been used to aid diagnostics in the past, this study aims to test ChatGPT's suitability for rare disease diagnostic support with the enhancement provided by Retrieval Augmented Generation (RAG). RareDxGPT, our enhanced ChatGPT model, supplies ChatGPT with information about 717 rare diseases from an external knowledge resource, the RareDis Corpus, through RAG. In RareDxGPT, when a query is entered, the three documents most relevant to the query in the RareDis Corpus are retrieved. Along with the query, they are returned to ChatGPT to provide a diagnosis. Additionally, phenotypes for thirty different diseases were extracted from free text from PubMed's Case Reports. They were each entered with three different prompt types: "prompt", "prompt + explanation" and "prompt + role play." The accuracy of ChatGPT and RareDxGPT with each prompt was then measured. With "Prompt", RareDxGPT had a 40 % accuracy, while ChatGPT 3.5 got 37 % of the cases correct. With "Prompt + Explanation", RareDxGPT had a 43 % accuracy, while ChatGPT 3.5 got 23 % of the cases correct. With "Prompt + Role Play", RareDxGPT had a 40 % accuracy, while ChatGPT 3.5 got 23 % of the cases correct. To conclude, ChatGPT, especially when supplying extra domain specific knowledge, demonstrates early potential for rare disease diagnosis with adjustments.
虽然罕见病个体的患病率较低,但它们共同影响着全球近 4 亿人。平均而言,一个准确的罕见病诊断需要五年时间,但许多患者仍未被诊断或误诊。过去,机器学习技术已被用于辅助诊断,因此本研究旨在测试 ChatGPT 在罕见病诊断支持方面的适用性,以及通过检索增强生成(RAG)提供的增强功能。RareDxGPT 是我们增强的 ChatGPT 模型,通过 RAG 从外部知识资源 RareDis 语料库中为 ChatGPT 提供有关 717 种罕见病的信息。在 RareDxGPT 中,当输入查询时,通过 RAG 从 RareDis 语料库中检索与查询最相关的三个文档。与查询一起返回给 ChatGPT 以提供诊断。此外,还从 PubMed 的病例报告中提取了 30 种不同疾病的表型。将每种疾病分别输入三种不同的提示类型:“提示”、“提示+解释”和“提示+角色扮演”。然后测量 ChatGPT 和 RareDxGPT 对每种提示的准确性。对于“提示”,RareDxGPT 的准确率为 40%,而 ChatGPT 3.5 的准确率为 37%。对于“提示+解释”,RareDxGPT 的准确率为 43%,而 ChatGPT 3.5 的准确率为 23%。对于“提示+角色扮演”,RareDxGPT 的准确率为 40%,而 ChatGPT 3.5 的准确率为 23%。总之,ChatGPT 尤其在提供额外的领域特定知识时,展示了在罕见病诊断方面的早期潜力,需要进行调整。