Sarker Abeed, Yang Yuan-Chi, Al-Garadi Mohammed Ali, Abbas Aamir
Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA, United States.
Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, United States.
Front Digit Health. 2020 Dec 4;2:585559. doi: 10.3389/fdgth.2020.585559. eCollection 2020.
As the volume of published medical research continues to grow rapidly, staying up-to-date with the best-available research evidence regarding specific topics is becoming an increasingly challenging problem for medical experts and researchers. The current COVID19 pandemic is a good example of a topic on which research evidence is rapidly evolving. Automatic query-focused text summarization approaches may help researchers to swiftly review research evidence by presenting salient and query-relevant information from newly-published articles in a condensed manner. Typical medical text summarization approaches require domain knowledge, and the performances of such systems rely on resource-heavy medical domain-specific knowledge sources and pre-processing methods (e.g., text classification) for deriving semantic information. Consequently, these systems are often difficult to speedily customize, extend, or deploy in low-resource settings, and they are often operationally slow. In this paper, we propose a fast and simple extractive summarization approach that can be easily deployed and run, and may thus aid medical experts and researchers obtain fast access to the latest research evidence. At runtime, our system utilizes similarity measurements derived from pre-trained medical domain-specific word embeddings in addition to simple features, rather than computationally-expensive pre-processing and resource-heavy knowledge bases. Automatic evaluation using ROUGE-a summary evaluation tool-on a public dataset for evidence-based medicine shows that our system's performance, despite the simple implementation, is statistically comparable with the state-of-the-art. Extrinsic manual evaluation based on recently-released COVID19 articles demonstrates that the summarizer performance is close to human agreement, which is generally low, for extractive summarization.
Front Digit Health. 2020-12-4
Comput Methods Programs Biomed. 2020-2
Comput Methods Programs Biomed. 2017-5-27
J Med Internet Res. 2020-10-23
BMC Med Inform Decis Mak. 2020-12-15
J Biomed Inform. 2018-6-15
BMC Bioinformatics. 2024-4-16
Artif Intell Med. 2017-12-6
Int J Prev Med. 2024-10-18
Lancet. 2017-2-17
PLoS One. 2016-12-9
J Biomed Inform. 2016-2
BMC Bioinformatics. 2015-1-16
J Biomed Inform. 2014-12
J Biomed Inform. 2014-12