Department of Software, Sejong University, Seoul, Republic of Korea.
Department of Computer Science & Engineering, School of Engineering and Computer Science, Oakland University, Rochester, MI, United States.
J Med Internet Res. 2020 Oct 23;22(10):e19810. doi: 10.2196/19810.
Automatic text summarization (ATS) enables users to retrieve meaningful evidence from big data of biomedical repositories to make complex clinical decisions. Deep neural and recurrent networks outperform traditional machine-learning techniques in areas of natural language processing and computer vision; however, they are yet to be explored in the ATS domain, particularly for medical text summarization.
Traditional approaches in ATS for biomedical text suffer from fundamental issues such as an inability to capture clinical context, quality of evidence, and purpose-driven selection of passages for the summary. We aimed to circumvent these limitations through achieving precise, succinct, and coherent information extraction from credible published biomedical resources, and to construct a simplified summary containing the most informative content that can offer a review particular to clinical needs.
In our proposed approach, we introduce a novel framework, termed Biomed-Summarizer, that provides quality-aware Patient/Problem, Intervention, Comparison, and Outcome (PICO)-based intelligent and context-enabled summarization of biomedical text. Biomed-Summarizer integrates the prognosis quality recognition model with a clinical context-aware model to locate text sequences in the body of a biomedical article for use in the final summary. First, we developed a deep neural network binary classifier for quality recognition to acquire scientifically sound studies and filter out others. Second, we developed a bidirectional long-short term memory recurrent neural network as a clinical context-aware classifier, which was trained on semantically enriched features generated using a word-embedding tokenizer for identification of meaningful sentences representing PICO text sequences. Third, we calculated the similarity between query and PICO text sequences using Jaccard similarity with semantic enrichments, where the semantic enrichments are obtained using medical ontologies. Last, we generated a representative summary from the high-scoring PICO sequences aggregated by study type, publication credibility, and freshness score.
Evaluation of the prognosis quality recognition model using a large dataset of biomedical literature related to intracranial aneurysm showed an accuracy of 95.41% (2562/2686) in terms of recognizing quality articles. The clinical context-aware multiclass classifier outperformed the traditional machine-learning algorithms, including support vector machine, gradient boosted tree, linear regression, K-nearest neighbor, and naïve Bayes, by achieving 93% (16127/17341) accuracy for classifying five categories: aim, population, intervention, results, and outcome. The semantic similarity algorithm achieved a significant Pearson correlation coefficient of 0.61 (0-1 scale) on a well-known BIOSSES dataset (with 100 pair sentences) after semantic enrichment, representing an improvement of 8.9% over baseline Jaccard similarity. Finally, we found a highly positive correlation among the evaluations performed by three domain experts concerning different metrics, suggesting that the automated summarization is satisfactory.
By employing the proposed method Biomed-Summarizer, high accuracy in ATS was achieved, enabling seamless curation of research evidence from the biomedical literature to use for clinical decision-making.
自动文本摘要(ATS)使用户能够从生物医学知识库的大数据中检索有意义的证据,从而做出复杂的临床决策。深度神经网络和循环网络在自然语言处理和计算机视觉领域超越了传统的机器学习技术;然而,它们在 ATS 领域尚未得到探索,特别是在医学文本摘要方面。
生物医学文本的 ATS 传统方法存在无法捕获临床上下文、证据质量以及为摘要有针对性地选择段落等基本问题。我们旨在通过从可靠的已发表生物医学资源中精确、简洁和连贯地提取信息,并构建包含最具信息量的内容的简化摘要,从而克服这些限制,该摘要可提供特定于临床需求的综述。
在我们提出的方法中,我们引入了一种名为 Biomed-Summarizer 的新框架,该框架提供基于质量感知的患者/问题(P)、干预(I)、比较(C)和结果(O)(PICO)的智能和上下文感知的生物医学文本摘要。 Biomed-Summarizer 将预后质量识别模型与临床上下文感知模型集成,以便在生物医学文章的正文中定位用于最终摘要的文本序列。首先,我们开发了一种深度神经网络二进制分类器用于质量识别,以获取科学合理的研究并过滤掉其他研究。其次,我们开发了一个双向长短时记忆循环神经网络作为临床上下文感知分类器,该分类器在使用词嵌入标记器生成的语义丰富特征上进行训练,以识别表示 PICO 文本序列的有意义的句子。第三,我们使用 Jaccard 相似性和语义丰富度来计算查询和 PICO 文本序列之间的相似性,其中语义丰富度是使用医学本体获得的。最后,我们根据研究类型、出版可信度和新鲜度评分,从高得分的 PICO 序列中生成代表性摘要。
使用与颅内动脉瘤相关的大型生物医学文献数据集评估预后质量识别模型,在识别高质量文章方面的准确率为 95.41%(2562/2686)。临床上下文感知多类分类器的性能优于传统的机器学习算法,包括支持向量机、梯度提升树、线性回归、K-最近邻和朴素贝叶斯,对五类的分类准确率达到 93%(16127/17341):目标、人群、干预、结果和结局。语义相似性算法在经过语义丰富处理后,在著名的 BIOSSES 数据集(100 对句子)上实现了 0.61(0-1 标度)的显著皮尔逊相关系数,比基线 Jaccard 相似性提高了 8.9%。最后,我们发现三位领域专家对不同指标的评估之间存在高度正相关,这表明自动化摘要令人满意。
通过使用提出的方法 Biomed-Summarizer,实现了 ATS 的高精度,从而能够从生物医学文献中无缝地整理研究证据,用于临床决策。