• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

普马:PubMed基因/细胞类型关系图谱。

PuMA: PubMed gene/cell type-relation Atlas.

作者信息

Bickmann Lucas, Sandmann Sarah, Walter Carolin, Varghese Julian

机构信息

Institute of Medical Data Science, Otto-von-Guericke University Magdeburg, Magdeburg, Germany.

Institute of Medical Informatics, University of Münster, Münster, Germany.

出版信息

BMC Bioinformatics. 2025 Jul 29;26(1):201. doi: 10.1186/s12859-025-06236-8.

DOI:10.1186/s12859-025-06236-8
PMID:40731310
Abstract

BACKGROUND

Rapid extraction and visualization of cell-specific gene expression is important for automatic cell type annotation, e.g. in single cell analysis. There is an emerging field in which tools such as curated databases or machine learning methods are used to support cell type annotation. However, complementing approaches to efficiently incorporate the latest knowledge of free-text articles from literature databases, such as PubMed, are understudied.

RESULTS

This work introduces the PubMed Gene/Cell type-Relation Atlas (PuMA) which provides a local, easy-to-use web-interface to facilitate literature-driven cell type annotation. It utilizes a pretrained machine learning based named entity recognition model in order to extract gene and cell type concepts from PubMed, links biomedical ontologies, and suggests gene to cell type relations based on a ranking score. It includes a search tool for genes and cell types, additionally providing an interactive graph visualization for exploring cross-relations. Each result is fully traceable by linking the relevant PubMed articles.

CONCLUSIONS

This work enables researchers to analyse and automatize cell type annotation based on PubMed articles. It complements manual curated marker gene databases and enables interactive visualizations. The evaluation shows that PuMA is competitive against an extensive manual curated database across three gold standard datasets and two species-mouse and human. The software framework is freely available and enables regular article imports for incremental knowledge updates.GitLab: https://imigitlab.uni-muenster.de/published/PuMA/.

摘要

背景

快速提取和可视化细胞特异性基因表达对于自动细胞类型注释非常重要,例如在单细胞分析中。目前有一个新兴领域,使用诸如经过整理的数据库或机器学习方法等工具来支持细胞类型注释。然而,对于如何有效整合来自文献数据库(如PubMed)的自由文本文章的最新知识的补充方法,研究还不够充分。

结果

这项工作引入了PubMed基因/细胞类型关系图谱(PuMA),它提供了一个本地的、易于使用的网络界面,以促进基于文献的细胞类型注释。它利用一个基于预训练机器学习的命名实体识别模型,从PubMed中提取基因和细胞类型概念,链接生物医学本体,并根据排名分数建议基因与细胞类型的关系。它包括一个基因和细胞类型搜索工具,还提供一个交互式图形可视化工具,用于探索交叉关系。通过链接相关的PubMed文章,每个结果都可以完全追溯。

结论

这项工作使研究人员能够基于PubMed文章分析和自动化细胞类型注释。它补充了手动整理的标记基因数据库,并实现了交互式可视化。评估表明,在三个金标准数据集以及小鼠和人类这两个物种上,PuMA与一个广泛的手动整理数据库相比具有竞争力。该软件框架可免费获得,并支持定期导入文章以进行增量知识更新。GitLab:https://imigitlab.uni-muenster.de/published/PuMA/ 。

相似文献

1
PuMA: PubMed gene/cell type-relation Atlas.普马:PubMed基因/细胞类型关系图谱。
BMC Bioinformatics. 2025 Jul 29;26(1):201. doi: 10.1186/s12859-025-06236-8.
2
Short-Term Memory Impairment短期记忆障碍
3
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
4
PDF Entity Annotation Tool (PEAT).PDF实体注释工具(PEAT)。
J Open Source Softw. 2025 Apr 8;10(108):5336. doi: 10.21105/joss.05336.
5
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
6
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
7
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
8
Regional cerebral blood flow single photon emission computed tomography for detection of Frontotemporal dementia in people with suspected dementia.用于检测疑似痴呆患者额颞叶痴呆的局部脑血流单光子发射计算机断层扫描
Cochrane Database Syst Rev. 2015 Jun 23;2015(6):CD010896. doi: 10.1002/14651858.CD010896.pub2.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
The measurement of collaboration within healthcare settings: a systematic review of measurement properties of instruments.医疗机构内协作的测量:对测量工具属性的系统评价
JBI Database System Rev Implement Rep. 2016 Apr;14(4):138-97. doi: 10.11124/JBISRIR-2016-2159.

本文引用的文献

1
PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge.PubTator 3.0:一款人工智能驱动的文献资源,用于解锁生物医学知识。
Nucleic Acids Res. 2024 Jul 5;52(W1):W540-W546. doi: 10.1093/nar/gkae235.
2
ChatGPT: The transformative influence of generative AI on science and healthcare.生成式 AI 对科学和医疗保健的变革性影响。
J Hepatol. 2024 Jun;80(6):977-980. doi: 10.1016/j.jhep.2023.07.028. Epub 2023 Aug 5.
3
MarkerGenie: an NLP-enabled text-mining system for biomedical entity relation extraction.MarkerGenie:一个用于生物医学实体关系提取的支持自然语言处理的文本挖掘系统。
Bioinform Adv. 2022 May 13;2(1):vbac035. doi: 10.1093/bioadv/vbac035. eCollection 2022.
4
CellMarker 2.0: an updated database of manually curated cell markers in human/mouse and web tools based on scRNA-seq data.CellMarker 2.0:一个更新的数据库,包含基于 scRNA-seq 数据的人类/小鼠细胞标志物的人工注释和网络工具。
Nucleic Acids Res. 2023 Jan 6;51(D1):D870-D876. doi: 10.1093/nar/gkac947.
5
BERN2: an advanced neural biomedical named entity recognition and normalization tool.BERN2:一种先进的神经生物医学命名实体识别和标准化工具。
Bioinformatics. 2022 Oct 14;38(20):4837-4839. doi: 10.1093/bioinformatics/btac598.
6
CellMeSH: probabilistic cell-type identification using indexed literature.CellMeSH:基于索引文献的概率细胞类型识别。
Bioinformatics. 2022 Feb 7;38(5):1393-1402. doi: 10.1093/bioinformatics/btab834.
7
Automated methods for cell type annotation on scRNA-seq data.单细胞RNA测序(scRNA-seq)数据细胞类型注释的自动化方法。
Comput Struct Biotechnol J. 2021 Jan 19;19:961-969. doi: 10.1016/j.csbj.2021.01.015. eCollection 2021.
8
Named Entity Recognition and Relation Detection for Biomedical Information Extraction.用于生物医学信息提取的命名实体识别与关系检测
Front Cell Dev Biol. 2020 Aug 28;8:673. doi: 10.3389/fcell.2020.00673. eCollection 2020.
9
Conserved NPPB+ Border Zone Switches From MEF2- to AP-1-Driven Gene Program.保守的 NPPB+ 边缘区从 MEF2 驱动的基因程序切换为 AP-1 驱动的基因程序。
Circulation. 2019 Sep 9;140(10):864-879. doi: 10.1161/CIRCULATIONAHA.118.038944. Epub 2019 Jul 1.
10
PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data.PanglaoDB:一个用于探索小鼠和人类单细胞 RNA 测序数据的网络服务器。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz046.