• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Valsci:一个开源的、可自我托管的文献综述工具,用于使用大语言模型自动进行大批量科学论断验证。

Valsci: an open-source, self-hostable literature review utility for automated large-batch scientific claim verification using large language models.

作者信息

Edelman Brice, Skolnick Jeffrey

机构信息

Georgia Tech Center for the Study of Systems Biology, Atlanta, GA, USA.

出版信息

BMC Bioinformatics. 2025 May 28;26(1):140. doi: 10.1186/s12859-025-06159-4.

DOI:10.1186/s12859-025-06159-4
PMID:40437377
Abstract

BACKGROUND

The exponential growth of scientific publications poses a formidable challenge for researchers seeking to validate emerging hypotheses or synthesize existing evidence. In this paper, we introduce Valsci, an open-source, self-hostable utility that automates large-batch scientific claim verification using any OpenAI-compatible large language model. Valsci unites retrieval-augmented generation with structured bibliometric scoring and chain-of-thought prompting, enabling users to efficiently search, evaluate, and summarize evidence from the Semantic Scholar database and other academic sources. Unlike conventional standalone LLMs, which often suffer from hallucinations and unreliable citations, Valsci grounds its analyses in verifiable published findings. A guided prompt-flow approach is employed to generate query expansions, retrieve relevant excerpts, and synthesize coherent, evidence-based reports.

RESULTS

Preliminary evaluations across claims from the SciFact benchmark dataset reveal that Valsci significantly outperforms base GPT-4o outputs in citation hallucination rate while maintaining a low misclassification rate. The system is highly scalable, processing hundreds of claims per hour through asynchronous parallelization.

CONCLUSIONS

By providing an open and transparent platform for large-batch literature verification, Valsci substantially lowers the barrier to comprehensive evidence-based reviews and fosters a more reproducible research ecosystem.

摘要

背景

科学出版物的指数级增长给试图验证新出现的假设或综合现有证据的研究人员带来了巨大挑战。在本文中,我们介绍了Valsci,这是一种开源的、可自我托管的实用工具,它使用任何与OpenAI兼容的大语言模型来自动进行大批量科学论断验证。Valsci将检索增强生成与结构化文献计量评分和思维链提示相结合,使用户能够有效地从语义学者数据库和其他学术来源中搜索、评估和总结证据。与传统的独立大语言模型不同,后者经常存在幻觉和不可靠引用的问题,Valsci的分析基于可验证的已发表研究结果。采用一种有指导的提示流方法来生成查询扩展、检索相关摘录并合成连贯的、基于证据的报告。

结果

对SciFact基准数据集的论断进行的初步评估表明,Valsci在引用幻觉率方面显著优于基础GPT-4o输出,同时保持较低的错误分类率。该系统具有高度可扩展性,通过异步并行化每小时可处理数百个论断。

结论

通过为大批量文献验证提供一个开放和透明的平台,Valsci大大降低了进行全面的基于证据的综述的障碍,并促进了一个更具可重复性的研究生态系统。

相似文献

1
Valsci: an open-source, self-hostable literature review utility for automated large-batch scientific claim verification using large language models.Valsci:一个开源的、可自我托管的文献综述工具,用于使用大语言模型自动进行大批量科学论断验证。
BMC Bioinformatics. 2025 May 28;26(1):140. doi: 10.1186/s12859-025-06159-4.
2
Use of Retrieval-Augmented Large Language Model for COVID-19 Fact-Checking: Development and Usability Study.使用检索增强大语言模型进行COVID-19事实核查:开发与可用性研究。
J Med Internet Res. 2025 Apr 30;27:e66098. doi: 10.2196/66098.
3
Improving Dietary Supplement Information Retrieval: Development of a Retrieval-Augmented Generation System With Large Language Models.改善膳食补充剂信息检索:利用大语言模型开发检索增强生成系统
J Med Internet Res. 2025 Mar 19;27:e67677. doi: 10.2196/67677.
4
Steering veridical large language model analyses by correcting and enriching generated database queries: first steps toward ChatGPT bioinformatics.通过纠正和丰富生成的数据库查询来引导真实的大语言模型分析:迈向ChatGPT生物信息学的第一步。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf045.
5
Privacy-ensuring Open-weights Large Language Models Are Competitive with Closed-weights GPT-4o in Extracting Chest Radiography Findings from Free-Text Reports.在从自由文本报告中提取胸部X光检查结果方面,确保隐私的开放权重大型语言模型与封闭权重的GPT-4o具有竞争力。
Radiology. 2025 Jan;314(1):e240895. doi: 10.1148/radiol.240895.
6
Empowering large language models for automated clinical assessment with generation-augmented retrieval and hierarchical chain-of-thought.通过生成增强检索和分层思维链赋能大型语言模型进行自动化临床评估。
Artif Intell Med. 2025 Apr;162:103078. doi: 10.1016/j.artmed.2025.103078. Epub 2025 Feb 12.
7
Retrieval augmented scientific claim verification.检索增强型科学声明验证
JAMIA Open. 2024 Feb 21;7(1):ooae021. doi: 10.1093/jamiaopen/ooae021. eCollection 2024 Apr.
8
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature.Clinfo.ai:一个使用科学文献回答医学问题的开源检索增强型大型语言模型系统。
Pac Symp Biocomput. 2024;29:8-23.
9
LITERAS: Biomedical literature review and citation retrieval agents.
Comput Biol Med. 2025 Jun;192(Pt B):110363. doi: 10.1016/j.compbiomed.2025.110363. Epub 2025 May 17.
10
The emergence of large language models as tools in literature reviews: a large language model-assisted systematic review.大语言模型作为文献综述工具的出现:一项大语言模型辅助的系统综述
J Am Med Inform Assoc. 2025 Jun 1;32(6):1071-1086. doi: 10.1093/jamia/ocaf063.

本文引用的文献

1
Eight problems with literature reviews and how to fix them.文献综述的 8 个问题及解决方法。
Nat Ecol Evol. 2020 Dec;4(12):1582-1589. doi: 10.1038/s41559-020-01295-x. Epub 2020 Oct 12.
2
An index to quantify an individual's scientific research output.一个用于量化个人科研产出的指标。
Proc Natl Acad Sci U S A. 2005 Nov 15;102(46):16569-72. doi: 10.1073/pnas.0507655102. Epub 2005 Nov 7.