Ramprasad Sanjana, Marshall Iain J, McInerney Denis Jered, Wallace Byron C
Northeastern University.
King's College London.
Proc Conf Assoc Comput Linguist Meet. 2023 May;2023:236-247.
We present , a system that aims to automatically summarize evidence presented in the set of randomized controlled trials most relevant to a given query. Building on prior work (Marshall et al., 2020), the system retrieves trial publications matching a query specifying a combination of condition, intervention(s), and outcome(s), and ranks these according to sample size and estimated study quality. The top- such studies are passed through a neural multi-document summarization system, yielding a synopsis of these trials. We consider two architectures: A standard sequence-to-sequence model based on BART (Lewis et al., 2019), and a multi-headed architecture intended to provide greater transparency to end-users. Both models produce fluent and relevant summaries of evidence retrieved for queries, but their tendency to introduce unsupported statements render them inappropriate for use in this domain at present. The proposed architecture may help users verify outputs allowing users to trace generated tokens back to inputs. The demonstration video is available at: https://vimeo.com/735605060 The prototype, source code, and model weights are available at: https://sanjanaramprasad.github.io/trials-summarizer/.
我们展示了一个系统,其旨在自动总结与给定查询最相关的一组随机对照试验中所呈现的证据。基于先前的工作(马歇尔等人,2020年),该系统检索与指定疾病、干预措施和结果组合的查询相匹配的试验出版物,并根据样本量和估计的研究质量对这些出版物进行排名。排名靠前的此类研究将通过一个神经多文档摘要系统,生成这些试验的概要。我们考虑了两种架构:一种基于BART的标准序列到序列模型(刘易斯等人,2019年),以及一种旨在为最终用户提供更高透明度的多头架构。这两种模型都能生成针对查询检索到的证据的流畅且相关的摘要,但它们引入无根据陈述的倾向使得它们目前不适用于该领域。所提出的架构可能有助于用户验证输出,允许用户将生成的令牌追溯到输入。演示视频可在以下网址获取:https://vimeo.com/735605060 原型、源代码和模型权重可在以下网址获取:https://sanjanaramprasad.github.io/trials-summarizer/ 。