• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于大语言模型的代谢组学研究图谱。

A Large Language Model-Powered Map of Metabolomics Research.

作者信息

Bifarin Olatomiwa O, Yelluru Varun S, Simhadri Aditya, Fernández Facundo M

机构信息

School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States.

School of Computer Science, Georgia Institute of Technology, Atlanta, Georgia 30332, United States.

出版信息

bioRxiv. 2025 Mar 19:2025.03.18.643696. doi: 10.1101/2025.03.18.643696.

DOI:10.1101/2025.03.18.643696
PMID:40166287
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11957067/
Abstract

We present a comprehensive map of the metabolomics research landscape, synthesizing insights from over 80,000 publications. Using PubMedBERT, we transformed abstracts into 768-dimensional embeddings that capture the nuanced thematic structure of the field. Dimensionality reduction with t-SNE revealed distinct clusters corresponding to key domains such as analytical chemistry, plant biology, pharmacology, and clinical diagnostics. In addition, a neural topic modeling pipeline refined with GPT-4o mini reclassified the corpus into 20 distinct topics-ranging from "Plant Stress Response Mechanisms" and "NMR Spectroscopy Innovations" to "COVID-19 Metabolomic and Immune Responses." Temporal analyses further highlight trends including the rise of deep learning methods post-2015 and a continued focus on biomarker discovery. Integration of metadata such as publication statistics and sample sizes provide additional context to these evolving research dynamics. An interactive web application (https://metascape.streamlit.app/) enables dynamic exploration of these insights. Overall, this study offers a robust framework for literature synthesis that empowers researchers, clinicians, and policymakers to identify emerging research trajectories and address critical challenges in metabolomics, while also sharing our perspectives on key trends shaping the field.

摘要

我们展示了代谢组学研究领域的全面图谱,综合了来自8万多篇出版物的见解。利用PubMedBERT,我们将摘要转化为768维嵌入向量,以捕捉该领域细微的主题结构。通过t-SNE进行降维,揭示了与分析化学、植物生物学、药理学和临床诊断等关键领域相对应的不同聚类。此外,用GPT-4o mini优化的神经主题建模管道将语料库重新分类为20个不同的主题,从“植物应激反应机制”和“核磁共振光谱创新”到“COVID-19代谢组学和免疫反应”。时间分析进一步突出了一些趋势,包括2015年后深度学习方法的兴起以及对生物标志物发现的持续关注。诸如出版物统计和样本量等元数据的整合为这些不断发展的研究动态提供了更多背景信息。一个交互式网络应用程序(https://metascape.streamlit.app/)能够动态探索这些见解。总体而言,本研究提供了一个强大的文献综合框架,使研究人员、临床医生和政策制定者能够识别代谢组学中新兴的研究轨迹并应对关键挑战,同时我们也分享了对塑造该领域的关键趋势的看法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/a19c7af694f3/nihpp-2025.03.18.643696v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/f584f6a97a1d/nihpp-2025.03.18.643696v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/f79f1717a5b7/nihpp-2025.03.18.643696v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/e1a47c585719/nihpp-2025.03.18.643696v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/8bc9b19c0442/nihpp-2025.03.18.643696v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/1d0cbb9447ea/nihpp-2025.03.18.643696v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/a19c7af694f3/nihpp-2025.03.18.643696v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/f584f6a97a1d/nihpp-2025.03.18.643696v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/f79f1717a5b7/nihpp-2025.03.18.643696v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/e1a47c585719/nihpp-2025.03.18.643696v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/8bc9b19c0442/nihpp-2025.03.18.643696v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/1d0cbb9447ea/nihpp-2025.03.18.643696v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa9d/11957067/a19c7af694f3/nihpp-2025.03.18.643696v1-f0006.jpg

相似文献

1
A Large Language Model-Powered Map of Metabolomics Research.基于大语言模型的代谢组学研究图谱。
bioRxiv. 2025 Mar 19:2025.03.18.643696. doi: 10.1101/2025.03.18.643696.
2
A Large Language Model-Powered Map of Metabolomics Research.一个由大语言模型驱动的代谢组学研究图谱。
Anal Chem. 2025 Jul 15;97(27):14088-14096. doi: 10.1021/acs.analchem.5c01672. Epub 2025 Jul 3.
3
Short-Term Memory Impairment短期记忆障碍
4
Improving the usability of open health service delivery simulation models using Python and web apps.使用Python和网络应用程序提高开放式医疗服务提供模拟模型的可用性。
NIHR Open Res. 2023 Dec 15;3:48. doi: 10.3310/nihropenres.13467.1. eCollection 2023.
5
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.
6
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
7
A Spectrum of Understanding: A Qualitative Exploration of Autistic Adults' Understandings and Perceptions of Friendship(s).理解的光谱:对自闭症成年人对友谊的理解与认知的质性探索
Autism Adulthood. 2024 Dec 2;6(4):438-450. doi: 10.1089/aut.2023.0051. eCollection 2024 Dec.
8
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
9
A bibliometric analysis of research trends in mesenchymal stem cell therapy for neonatal bronchopulmonary dysplasia: 2004-2024.2004 - 2024年新生儿支气管肺发育不良间充质干细胞治疗研究趋势的文献计量分析
Front Pediatr. 2025 Jun 3;13:1558301. doi: 10.3389/fped.2025.1558301. eCollection 2025.
10
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

本文引用的文献

1
The landscape of biomedical research.生物医学研究的全景
Patterns (N Y). 2024 Apr 9;5(6):100968. doi: 10.1016/j.patter.2024.100968. eCollection 2024 Jun 14.
2
Metabolomics for Clinical Biomarker Discovery and Therapeutic Target Identification.代谢组学在临床生物标志物发现和治疗靶点鉴定中的应用。
Molecules. 2024 May 8;29(10):2198. doi: 10.3390/molecules29102198.
3
Challenges in the Metabolomics-Based Biomarker Validation Pipeline.基于代谢组学的生物标志物验证流程中的挑战。
Metabolites. 2024 Apr 3;14(4):200. doi: 10.3390/metabo14040200.
4
Plant Metabolomics: Current Initiatives and Future Prospects.植物代谢组学:当前进展与未来展望
Curr Issues Mol Biol. 2023 Nov 8;45(11):8894-8906. doi: 10.3390/cimb45110558.
5
Small molecule metabolites: discovery of biomarkers and therapeutic targets.小分子代谢物:生物标志物和治疗靶点的发现。
Signal Transduct Target Ther. 2023 Mar 20;8(1):132. doi: 10.1038/s41392-023-01399-3.
6
Metabolomics as a powerful tool for diagnostic, pronostic and drug intervention analysis in COVID-19.代谢组学作为COVID-19诊断、预后和药物干预分析的有力工具。
Front Mol Biosci. 2023 Feb 15;10:1111482. doi: 10.3389/fmolb.2023.1111482. eCollection 2023.
7
Meta-Analysis Reveals Both the Promises and the Challenges of Clinical Metabolomics.荟萃分析揭示了临床代谢组学的前景与挑战。
Cancers (Basel). 2022 Aug 18;14(16):3992. doi: 10.3390/cancers14163992.
8
Advances in mass spectrometry-based metabolomics for investigation of metabolites.基于质谱的代谢组学在代谢物研究中的进展。
RSC Adv. 2018 Jun 19;8(40):22335-22350. doi: 10.1039/c8ra01574k.
9
Deep learning meets metabolomics: a methodological perspective.深度学习与代谢组学的交汇:方法学视角。
Brief Bioinform. 2021 Mar 22;22(2):1531-1542. doi: 10.1093/bib/bbaa204.
10
NMR Spectroscopy for Metabolomics Research.用于代谢组学研究的核磁共振光谱法
Metabolites. 2019 Jun 27;9(7):123. doi: 10.3390/metabo9070123.