Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin 14191, Germany.
Science of Intelligence, Research Cluster of Excellence, Berlin 10587, Germany.
Proc Natl Acad Sci U S A. 2023 Aug 22;120(34):e2221473120. doi: 10.1073/pnas.2221473120. Epub 2023 Aug 14.
Collective intelligence has emerged as a powerful mechanism to boost decision accuracy across many domains, such as geopolitical forecasting, investment, and medical diagnostics. However, collective intelligence has been mostly applied to relatively simple decision tasks (e.g., binary classifications). Applications in more open-ended tasks with a much larger problem space, such as emergency management or general medical diagnostics, are largely lacking, due to the challenge of integrating unstandardized inputs from different crowd members. Here, we present a fully automated approach for harnessing collective intelligence in the domain of general medical diagnostics. Our approach leverages semantic knowledge graphs, natural language processing, and the SNOMED CT medical ontology to overcome a major hurdle to collective intelligence in open-ended medical diagnostics, namely to identify the intended diagnosis from unstructured text. We tested our method on 1,333 medical cases diagnosed on a medical crowdsourcing platform: The Human Diagnosis Project. Each case was independently rated by ten diagnosticians. Comparing the diagnostic accuracy of single diagnosticians with the collective diagnosis of differently sized groups, we find that our method substantially increases diagnostic accuracy: While single diagnosticians achieved 46% accuracy, pooling the decisions of ten diagnosticians increased this to 76%. Improvements occurred across medical specialties, chief complaints, and diagnosticians' tenure levels. Our results show the life-saving potential of tapping into the collective intelligence of the global medical community to reduce diagnostic errors and increase patient safety.
集体智慧已成为提高多个领域决策准确性的有力机制,例如地缘政治预测、投资和医疗诊断。然而,集体智慧主要应用于相对简单的决策任务(例如,二进制分类)。由于难以整合来自不同人群成员的非标准化输入,在更开放的任务(例如应急管理或一般医疗诊断)中的应用则相对较少。在这里,我们提出了一种在一般医疗诊断领域利用集体智慧的全自动方法。我们的方法利用语义知识图谱、自然语言处理和 SNOMED CT 医疗本体,克服了在开放式医疗诊断中利用集体智慧的一个主要障碍,即从非结构化文本中识别预期的诊断。我们在一个医疗众包平台上对 1333 个医疗病例进行了测试:人类诊断项目。每个病例都由十位诊断员进行独立评分。将单个诊断员的诊断准确性与不同大小群组的集体诊断进行比较,我们发现我们的方法大大提高了诊断准确性:虽然单个诊断员的准确率为 46%,但将十个诊断员的决策汇总后,准确率提高到了 76%。改进发生在多个医学专业、主要投诉和诊断员的任期水平。我们的结果表明,利用全球医疗社区的集体智慧来减少诊断错误并提高患者安全性具有挽救生命的潜力。