• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

1997 - 2024年蛋白质组学中机器学习的进展与趋势:文献计量分析

Progress and trends on machine learning in proteomics during 1997-2024: a bibliometric analysis.

作者信息

Tan Chao, Liu Hao, Zhang Zhen, Liu Xinyu, Ai Yinquan, Wu Xiumin, Jian Enlin, Song Yongyan, Yang Jin

机构信息

Clinical Medical College & Affiliated Hospital & College of Basic Medicine, Chengdu University, Chengdu, China.

出版信息

Front Med (Lausanne). 2025 Aug 15;12:1594442. doi: 10.3389/fmed.2025.1594442. eCollection 2025.

DOI:10.3389/fmed.2025.1594442
PMID:40901512
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12401104/
Abstract

OBJECTIVE

Despite growing interest in the application of machine learning (ML) in proteomics, a comprehensive and systematic mapping of this research domain has been lacking. This study addresses this gap by conducting the first large-scale bibliometric analysis focused exclusively on ML-driven proteomics, aiming to elucidate its knowledge structure, development trajectory, and emerging research trends.

METHODS

A total of 5,156 publications from the Web of Science Core Collection (1997-2024) were retrieved and analyzed. Bibliometric tools including CiteSpace 6.4.R1, VOSviewer 1.6.18, Scimago Graphica, and the R package bibliometrix were used to extract and visualize key bibliometric indicators. After data cleaning and de-duplication, analyses were conducted on keyword co-occurrence, citation networks, leading journals, influential authors, and institutional collaboration patterns to construct a comprehensive landscape of ML applications in proteomics.

RESULTS

The number of publications has grown exponentially since 2010, with an average annual growth rate of 12.53% and a notable surge of 65.14% occurring between 2019 and 2020. The United States emerged as the most productive country, while the Chinese Academy of Sciences led among institutions. AlphaFold2-related research received the highest citations, reflecting the transformative role of deep learning in protein structure prediction. Thematic clustering revealed key research foci, including deep learning algorithms, protein-protein interaction prediction, and integrative multi-omics analysis. The field is characterized by strong interdisciplinary convergence, involving computer science, molecular biology, and clinical research. High-impact journals and influential authors were also identified, providing benchmarks for academic influence and collaboration.

CONCLUSION

This study offers the first comprehensive bibliometric analysis of ML in proteomics, revealing key themes such as deep learning, pretrained models, and multi-omics integration. Future efforts should focus on building interpretable models, enhancing cross-disciplinary collaboration, and ensuring secure, standardized data use to advance precision medicine.

SYSTEMATIC REVIEW REGISTRATION

https://doi.org/10.17605/OSF.IO/F4WUG.

摘要

目的

尽管机器学习(ML)在蛋白质组学中的应用越来越受到关注,但该研究领域仍缺乏全面系统的图谱。本研究通过开展首次专门针对ML驱动的蛋白质组学的大规模文献计量分析来填补这一空白,旨在阐明其知识结构、发展轨迹和新兴研究趋势。

方法

从科学引文索引核心合集(1997 - 2024年)中检索并分析了总共5156篇出版物。使用包括CiteSpace 6.4.R1、VOSviewer 1.6.18、Scimago Graphica和R包bibliometrix在内的文献计量工具来提取和可视化关键文献计量指标。在数据清理和去重后,对关键词共现、引文网络、领先期刊、有影响力的作者和机构合作模式进行了分析,以构建ML在蛋白质组学中应用的全面图景。

结果

自2010年以来,出版物数量呈指数增长,年均增长率为12.53%,2019年至2020年间显著增长了65.14%。美国是产出最多的国家,而中国科学院在机构中领先。与AlphaFold2相关的研究获得的引用最多,反映了深度学习在蛋白质结构预测中的变革性作用。主题聚类揭示了关键研究重点,包括深度学习算法、蛋白质 - 蛋白质相互作用预测和综合多组学分析。该领域的特点是具有很强的跨学科融合,涉及计算机科学、分子生物学和临床研究。还确定了高影响力期刊和有影响力的作者,为学术影响力和合作提供了基准。

结论

本研究首次对蛋白质组学中的ML进行了全面的文献计量分析,揭示了深度学习、预训练模型和多组学整合等关键主题。未来的工作应集中在构建可解释模型、加强跨学科合作以及确保安全、标准化的数据使用,以推进精准医学。

系统评价注册

https://doi.org/10.17605/OSF.IO/F4WUG。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/3c913db26b6f/fmed-12-1594442-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/4d85525ddf95/fmed-12-1594442-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/5e2fcabdcd30/fmed-12-1594442-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/2689bf2674ee/fmed-12-1594442-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/c2092a883d56/fmed-12-1594442-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/a355c02dfa5c/fmed-12-1594442-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/293893fc6108/fmed-12-1594442-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/fb240bee90f3/fmed-12-1594442-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/942d2c509dd8/fmed-12-1594442-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/9079ea767966/fmed-12-1594442-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/3c913db26b6f/fmed-12-1594442-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/4d85525ddf95/fmed-12-1594442-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/5e2fcabdcd30/fmed-12-1594442-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/2689bf2674ee/fmed-12-1594442-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/c2092a883d56/fmed-12-1594442-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/a355c02dfa5c/fmed-12-1594442-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/293893fc6108/fmed-12-1594442-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/fb240bee90f3/fmed-12-1594442-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/942d2c509dd8/fmed-12-1594442-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/9079ea767966/fmed-12-1594442-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f1/12401104/3c913db26b6f/fmed-12-1594442-g010.jpg

相似文献

1
Progress and trends on machine learning in proteomics during 1997-2024: a bibliometric analysis.1997 - 2024年蛋白质组学中机器学习的进展与趋势:文献计量分析
Front Med (Lausanne). 2025 Aug 15;12:1594442. doi: 10.3389/fmed.2025.1594442. eCollection 2025.
2
Driving innovations in cancer research through spatial metabolomics: a bibliometric review of trends and hotspot.通过空间代谢组学推动癌症研究创新:趋势与热点的文献计量学综述
Front Immunol. 2025 Jun 10;16:1589943. doi: 10.3389/fimmu.2025.1589943. eCollection 2025.
3
Data-driven trends in critical care informatics: a bibliometric analysis of global collaborations using the MIMIC database (2004-2024).重症监护信息学中数据驱动的趋势:使用MIMIC数据库(2004 - 2024年)对全球合作的文献计量分析
Comput Biol Med. 2025 Sep;195:110670. doi: 10.1016/j.compbiomed.2025.110670. Epub 2025 Jun 27.
4
Machine learning based insights into cardiomyopathy and heart failure research: a bibliometric analysis from 2005 to 2024.基于机器学习的心肌病和心力衰竭研究洞察:2005年至2024年的文献计量分析
Front Med (Lausanne). 2025 Jul 25;12:1602077. doi: 10.3389/fmed.2025.1602077. eCollection 2025.
5
Research status, hotspots and perspectives of artificial intelligence applied to pain management: a bibliometric and visual analysis.人工智能应用于疼痛管理的研究现状、热点与展望:一项文献计量学与可视化分析
Updates Surg. 2025 Jun 28. doi: 10.1007/s13304-025-02296-w.
6
A bibliometric analysis of research trends in mesenchymal stem cell therapy for neonatal bronchopulmonary dysplasia: 2004-2024.2004 - 2024年新生儿支气管肺发育不良间充质干细胞治疗研究趋势的文献计量分析
Front Pediatr. 2025 Jun 3;13:1558301. doi: 10.3389/fped.2025.1558301. eCollection 2025.
7
Artificial intelligence in joint arthroplasty: A bibliometric analysis of global research trends (2001-2025).关节置换术中的人工智能:全球研究趋势的文献计量分析(2001 - 2025年)
Medicine (Baltimore). 2025 Aug 29;104(35):e44136. doi: 10.1097/MD.0000000000044136.
8
Research status and hotspots of oral frailty in older adults: a bibliometric analysis from 2013 to 2024.老年人口腔脆弱的研究现状与热点:2013年至2024年的文献计量分析
Front Oral Health. 2025 Jul 14;6:1533159. doi: 10.3389/froh.2025.1533159. eCollection 2025.
9
Application of non-invasive imaging in myocardial infarction: a bibliometric analysis from January 2003 to December 2022.非侵入性成像在心肌梗死中的应用:2003年1月至2022年12月的文献计量分析
Quant Imaging Med Surg. 2025 Jul 1;15(7):6340-6359. doi: 10.21037/qims-24-878. Epub 2025 Jun 30.
10
Knowledge graph and bibliometric analysis of inflammatory indicators in ovarian cancer.卵巢癌炎症指标的知识图谱与文献计量分析
Front Oncol. 2025 Jun 30;15:1533537. doi: 10.3389/fonc.2025.1533537. eCollection 2025.

本文引用的文献

1
Applying machine learning to high-dimensional proteomics datasets for the identification of Alzheimer's disease biomarkers.将机器学习应用于高维蛋白质组学数据集以鉴定阿尔茨海默病生物标志物。
Fluids Barriers CNS. 2025 Mar 3;22(1):23. doi: 10.1186/s12987-025-00634-z.
2
Mass-spectrometry-based proteomics: from single cells to clinical applications.基于质谱的蛋白质组学:从单细胞到临床应用
Nature. 2025 Feb;638(8052):901-911. doi: 10.1038/s41586-025-08584-0. Epub 2025 Feb 26.
3
Proteomics-on-a-Chip - Microfluidics meets proteomics.芯片蛋白质组学——微流控技术与蛋白质组学的交汇
Biosens Bioelectron. 2025 Apr 1;273:117122. doi: 10.1016/j.bios.2024.117122. Epub 2025 Jan 9.
4
Application of machine learning for mass spectrometry-based multi-omics in thyroid diseases.机器学习在基于质谱的甲状腺疾病多组学中的应用。
Front Mol Biosci. 2024 Dec 17;11:1483326. doi: 10.3389/fmolb.2024.1483326. eCollection 2024.
5
ProteoNet: A CNN-based framework for analyzing proteomics MS-RGB images.ProteoNet:一种基于卷积神经网络的蛋白质组学MS-RGB图像分析框架。
iScience. 2024 Nov 12;27(12):111362. doi: 10.1016/j.isci.2024.111362. eCollection 2024 Dec 20.
6
The STRING database in 2025: protein networks with directionality of regulation.2025年的STRING数据库:具有调控方向性的蛋白质网络。
Nucleic Acids Res. 2025 Jan 6;53(D1):D730-D737. doi: 10.1093/nar/gkae1113.
7
Prediction of viral oncoproteins through the combination of generative adversarial networks and machine learning techniques.通过生成对抗网络和机器学习技术的结合来预测病毒致癌蛋白。
Sci Rep. 2024 Nov 7;14(1):27108. doi: 10.1038/s41598-024-77028-y.
8
Machine learning optimized DriverDetect software for high precision prediction of deleterious mutations in human cancers.机器学习优化的 DriverDetect 软件,用于高精度预测人类癌症中的有害突变。
Sci Rep. 2024 Sep 30;14(1):22618. doi: 10.1038/s41598-024-71422-2.
9
Transforming Clinical Research: The Power of High-Throughput Omics Integration.变革临床研究:高通量组学整合的力量
Proteomes. 2024 Sep 6;12(3):25. doi: 10.3390/proteomes12030025.
10
Application of Proteomics and Machine Learning Methods to Study the Pathogenesis of Diabetic Nephropathy and Screen Urinary Biomarkers.应用蛋白质组学和机器学习方法研究糖尿病肾病的发病机制及筛选尿生物标志物
J Proteome Res. 2024 Aug 2;23(8):3612-3625. doi: 10.1021/acs.jproteome.4c00267. Epub 2024 Jul 1.