Yale University School of Medicine, United States.
Department of Emergency Medicine, Yale University School of Medicine, United States.
Am J Emerg Med. 2021 Jul;45:213-220. doi: 10.1016/j.ajem.2020.08.036. Epub 2020 Aug 28.
Topic identification can facilitate knowledge curation, discover thematic relationships, trends, and predict future direction. We aimed to determine through an unsupervised, machine learning approach to topic modeling the most common research themes in emergency medicine over the last 40 years and summarize their trends and characteristics.
We retrieved the complete reference entries including article abstracts from Ovid for all original research articles from 1980 to 2019 within emergency medicine for six widely-cited journals. Abstracts were processed through a natural language pipeline and analyzed by a latent Dirichlet allocation topic modeling algorithm for unsupervised topic discovery. Topics were further examined through trend analysis, word associations, co-occurrence metrics, and two-dimensional embeddings.
We retrieved 47,158 articles during the defined time period that were filtered to 20,528 articles for further analysis. Forty topics covering methodologic and clinical areas were discovered. These topics separated into distinct clusters when embedded in two-dimensional space and exhibited consistent patterns of interaction. We observed the greatest increase in popularity in research themes involving risk factors (0.4% to 5.2%), health utilization (1.2% to 5.0%), and ultrasound (0.7% to 3.3%), and a relative decline in research involving basic science (8.9% to 1.1%), cardiac arrest (6.5% to 2.2%), and vitals (6.3% to 1.3%) over the past 40 years. Our data show only very modest growth in mental health and substance abuse research (1.0% to 1.6%), despite ongoing crises.
Topic modeling via unsupervised machine learning applied to emergency medicine abstracts discovered coherent topics, trends, and patterns of interaction.
主题识别可以促进知识整理,发现主题关系、趋势,并预测未来方向。我们旨在通过无监督的机器学习方法对主题建模,确定过去 40 年来急诊医学中最常见的研究主题,并总结其趋势和特征。
我们从 Ovid 检索了 1980 年至 2019 年六份广泛引用的期刊中所有原始研究文章的完整参考文献条目,包括文章摘要。通过自然语言处理管道对摘要进行处理,并通过无监督主题发现的潜在狄利克雷分配主题建模算法进行分析。通过趋势分析、词联想、共现指标和二维嵌入进一步检查主题。
在定义的时间段内,我们检索到 47158 篇文章,经过过滤后,有 20528 篇文章进行了进一步分析。发现了 40 个涵盖方法学和临床领域的主题。这些主题在二维空间中嵌入时分为不同的簇,并表现出一致的相互作用模式。我们观察到涉及危险因素(0.4%至 5.2%)、卫生利用(1.2%至 5.0%)和超声(0.7%至 3.3%)的研究主题的受欢迎程度显著增加,而涉及基础科学(8.9%至 1.1%)、心搏骤停(6.5%至 2.2%)和生命体征(6.3%至 1.3%)的研究相对减少。我们的数据表明,尽管存在持续的危机,但心理健康和药物滥用研究仅略有增长(1.0%至 1.6%)。
通过无监督机器学习应用于急诊医学摘要的主题建模发现了一致的主题、趋势和相互作用模式。