• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

揭示语言模型在化学研究问题解答中的力量。

Unveiling the power of language models in chemical research question answering.

作者信息

Chen Xiuying, Wang Tairan, Guo Taicheng, Guo Kehan, Zhou Juexiao, Li Haoyang, Song Zirui, Gao Xin, Zhang Xiangliang

机构信息

Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, UAE.

King Abdullah University of Science and Technology, Jeddah, Saudi Arabia.

出版信息

Commun Chem. 2025 Jan 5;8(1):4. doi: 10.1038/s42004-024-01394-x.

DOI:10.1038/s42004-024-01394-x
PMID:39757259
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11701117/
Abstract

While the abilities of language models are thoroughly evaluated in areas like general domains and biomedicine, academic chemistry remains less explored. Chemical QA tools also play a crucial role in both education and research by effectively translating complex chemical information into an understandable format. Addressing this gap, we introduce ScholarChemQA, a large-scale QA dataset constructed from chemical papers. Specifically, the questions are from paper titles with a question mark, and the multi-choice answers are reasoned out based on the corresponding abstracts. This dataset reflects typical real-world challenges, including an imbalanced data distribution and a substantial amount of unlabeled data that can be potentially useful. Correspondingly, we introduce a ChemMatch model, specifically designed to effectively answer chemical questions by fully leveraging our collected data. Experiments show that Large Language Models (LLMs) still have significant room for improvement in the field of chemistry. Moreover, ChemMatch significantly outperforms recent similar-scale baselines: https://github.com/iriscxy/chemmatch .

摘要

虽然语言模型的能力在通用领域和生物医学等领域得到了充分评估,但学术化学领域的探索仍较少。化学问答工具通过有效地将复杂的化学信息转化为可理解的格式,在教育和研究中也发挥着关键作用。为了填补这一空白,我们引入了ScholarChemQA,这是一个从化学论文构建的大规模问答数据集。具体来说,问题来自带有问号的论文标题,多项选择题答案是根据相应摘要推理得出的。该数据集反映了典型的现实世界挑战,包括数据分布不均衡以及大量可能有用的未标记数据。相应地,我们引入了ChemMatch模型,专门设计用于通过充分利用我们收集的数据来有效回答化学问题。实验表明,大语言模型(LLMs)在化学领域仍有很大的改进空间。此外,ChemMatch显著优于最近类似规模的基线:https://github.com/iriscxy/chemmatch 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/e1d10054578d/42004_2024_1394_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/dedf7ab08c9c/42004_2024_1394_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/3e27deaac510/42004_2024_1394_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/41cdee8b2b7f/42004_2024_1394_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/e153c8fc2d47/42004_2024_1394_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/310578893afb/42004_2024_1394_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/7da5991026a7/42004_2024_1394_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/e1d10054578d/42004_2024_1394_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/dedf7ab08c9c/42004_2024_1394_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/3e27deaac510/42004_2024_1394_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/41cdee8b2b7f/42004_2024_1394_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/e153c8fc2d47/42004_2024_1394_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/310578893afb/42004_2024_1394_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/7da5991026a7/42004_2024_1394_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c45d/11701117/e1d10054578d/42004_2024_1394_Fig7_HTML.jpg

相似文献

1
Unveiling the power of language models in chemical research question answering.揭示语言模型在化学研究问题解答中的力量。
Commun Chem. 2025 Jan 5;8(1):4. doi: 10.1038/s42004-024-01394-x.
2
MedChatZH: A tuning LLM for traditional Chinese medicine consultations.医聊 ChatZH:一个用于中医咨询的调优大语言模型。
Comput Biol Med. 2024 Apr;172:108290. doi: 10.1016/j.compbiomed.2024.108290. Epub 2024 Mar 13.
3
SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions.SemBioNLQA:一个语义生物医学问答系统,用于检索自然语言问题的准确和理想答案。
Artif Intell Med. 2020 Jan;102:101767. doi: 10.1016/j.artmed.2019.101767. Epub 2019 Nov 28.
4
A question-entailment approach to question answering.问题蕴涵方法在问答中的应用。
BMC Bioinformatics. 2019 Oct 22;20(1):511. doi: 10.1186/s12859-019-3119-4.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Stratified Evaluation of GPT's Question Answering in Surgery Reveals Artificial Intelligence (AI) Knowledge Gaps.对GPT在外科手术中问答的分层评估揭示了人工智能(AI)的知识差距。
Cureus. 2023 Nov 14;15(11):e48788. doi: 10.7759/cureus.48788. eCollection 2023 Nov.
7
Bridging the Gap Between Consumers' Medication Questions and Trusted Answers.弥合消费者用药问题与可靠答案之间的差距。
Stud Health Technol Inform. 2019 Aug 21;264:25-29. doi: 10.3233/SHTI190176.
8
Optimizing biomedical information retrieval with a keyword frequency-driven prompt enhancement strategy.基于关键词频率驱动的提示增强策略优化生物医学信息检索
BMC Bioinformatics. 2024 Aug 27;25(1):281. doi: 10.1186/s12859-024-05902-7.
9
Learning to Answer Visual Questions From Web Videos.学习从网络视频中回答视觉问题。
IEEE Trans Pattern Anal Mach Intell. 2025 May;47(5):3202-3218. doi: 10.1109/TPAMI.2022.3173208. Epub 2025 Apr 8.
10
A question-answering framework for automated abstract screening using large language models.基于大语言模型的自动文摘筛选的问答框架。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1939-1952. doi: 10.1093/jamia/ocae166.

引用本文的文献

1
Recent Progress of Artificial Intelligence Application in Polymer Materials.人工智能在高分子材料中的应用研究进展
Polymers (Basel). 2025 Jun 16;17(12):1667. doi: 10.3390/polym17121667.

本文引用的文献

1
Exhaustive local chemical space exploration using a transformer model.使用变压器模型进行详尽的局部化学空间探索。
Nat Commun. 2024 Aug 25;15(1):7315. doi: 10.1038/s41467-024-51672-4.
2
Hidden flaws behind expert-level accuracy of multimodal GPT-4 vision in medicine.医学领域多模态GPT-4视觉专家级准确性背后的隐藏缺陷。
NPJ Digit Med. 2024 Jul 23;7(1):190. doi: 10.1038/s41746-024-01185-7.
3
OpenMedLM: prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models.OpenMedLM:在使用开源大语言模型进行医学问答时,基于提示的工程学可以胜过微调。
Sci Rep. 2024 Jun 19;14(1):14156. doi: 10.1038/s41598-024-64827-6.
4
Augmenting large language models with chemistry tools.用化学工具增强大语言模型。
Nat Mach Intell. 2024;6(5):525-535. doi: 10.1038/s42256-024-00832-8. Epub 2024 May 8.
5
Emerging opportunities of using large language models for translation between drug molecules and indications.利用大型语言模型在药物分子和适应症之间进行翻译的新兴机会。
Sci Rep. 2024 May 10;14(1):10738. doi: 10.1038/s41598-024-61124-0.
6
Question-answering system extracts information on injection drug use from clinical notes.问答系统从临床记录中提取有关注射吸毒的信息。
Commun Med (Lond). 2024 Apr 3;4(1):61. doi: 10.1038/s43856-024-00470-6.
7
Large language model for molecular chemistry.用于分子化学的大语言模型。
Nat Comput Sci. 2023 Jan;3(1):5. doi: 10.1038/s43588-023-00399-1.
8
Improving model fairness in image-based computer-aided diagnosis.提高基于图像的计算机辅助诊断模型的公平性。
Nat Commun. 2023 Oct 6;14(1):6261. doi: 10.1038/s41467-023-41974-4.
9
The SciQA Scientific Question Answering Benchmark for Scholarly Knowledge.SciQA 学术知识科学问答基准
Sci Rep. 2023 May 4;13(1):7240. doi: 10.1038/s41598-023-33607-z.
10
BioASQ-QA: A manually curated corpus for Biomedical Question Answering.BioASQ-QA:用于生物医学问答的人工策论文本语料库。
Sci Data. 2023 Mar 27;10(1):170. doi: 10.1038/s41597-023-02068-4.