• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OmixLitMiner:一种生物信息学工具,用于通过文献检索和数据挖掘从 'Omics 数据中优先考虑生物学线索。

OmixLitMiner: A Bioinformatics Tool for Prioritizing Biological Leads from 'Omics Data Using Literature Retrieval and Data Mining.

机构信息

Bowel Cancer and Biomarker Research, Kolling Institute, The University of Sydney, Sydney 2065, Australia.

Department of Molecular Sciences, Macquarie University, Sydney 2109, Australia.

出版信息

Int J Mol Sci. 2020 Feb 19;21(4):1374. doi: 10.3390/ijms21041374.

DOI:10.3390/ijms21041374
PMID:32092871
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7073124/
Abstract

Proteomics and genomics discovery experiments generate increasingly large result tables, necessitating more researcher time to convert the biological data into new knowledge. Literature review is an important step in this process and can be tedious for large scale experiments. An informed and strategic decision about which biomolecule targets should be pursued for follow-up experiments thus remains a considerable challenge. To streamline and formalise this process of literature retrieval and analysis of discovery based 'omics data and as a decision-facilitating support tool for follow-up experiments we present OmixLitMiner, a package written in the computational language R. The tool automates the retrieval of literature from PubMed based on UniProt protein identifiers, gene names and their synonyms, combined with user defined contextual keyword search (i.e., gene ontology based). The search strategy is programmed to allow either strict or more lenient literature retrieval and the outputs are assigned to three categories describing how well characterized a regulated gene or protein is. The category helps to meet a decision, regarding which gene/protein follow-up experiments may be performed for gaining new knowledge and to exclude following already known biomarkers. We demonstrate the tool's usefulness in this retrospective study assessing three cancer proteomics and one cancer genomics publication. Using the tool, we were able to corroborate most of the decisions in these papers as well as detect additional biomolecule leads that may be valuable for future research.

摘要

蛋白质组学和基因组学发现实验产生的结果表越来越大,需要研究人员花费更多时间将生物数据转化为新知识。文献综述是这个过程中的一个重要步骤,但对于大规模实验来说可能很繁琐。因此,对于哪些生物分子靶标应该进行后续实验,做出明智和战略性的决策仍然是一个相当大的挑战。为了简化和规范化基于发现的“omics”数据的文献检索和分析过程,并为后续实验提供决策支持工具,我们提出了 OmixLitMiner,这是一个用计算语言 R 编写的软件包。该工具可以根据 UniProt 蛋白质标识符、基因名称及其同义词自动从 PubMed 中检索文献,同时结合用户定义的上下文关键字搜索(即基于基因本体论的搜索)。搜索策略被编程为允许严格或更宽松的文献检索,输出结果分为三个类别,描述调控基因或蛋白质的特征描述程度。该类别有助于做出决策,即选择哪些基因/蛋白质进行后续实验,以获得新知识,并排除已有的生物标志物。我们在这项回顾性研究中评估了三篇癌症蛋白质组学和一篇癌症基因组学出版物,展示了该工具的实用性。使用该工具,我们能够证实这些论文中的大多数决策,并发现其他可能对未来研究有价值的生物分子线索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72e7/7073124/11197c1e8eb8/ijms-21-01374-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72e7/7073124/e17a30bd9262/ijms-21-01374-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72e7/7073124/11197c1e8eb8/ijms-21-01374-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72e7/7073124/e17a30bd9262/ijms-21-01374-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72e7/7073124/11197c1e8eb8/ijms-21-01374-g002.jpg

相似文献

1
OmixLitMiner: A Bioinformatics Tool for Prioritizing Biological Leads from 'Omics Data Using Literature Retrieval and Data Mining.OmixLitMiner:一种生物信息学工具,用于通过文献检索和数据挖掘从 'Omics 数据中优先考虑生物学线索。
Int J Mol Sci. 2020 Feb 19;21(4):1374. doi: 10.3390/ijms21041374.
2
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.
3
Data mining methods in Omics-based biomarker discovery.基于组学的生物标志物发现中的数据挖掘方法。
Methods Mol Biol. 2011;719:511-26. doi: 10.1007/978-1-61779-027-0_24.
4
Text mining in livestock animal science: introducing the potential of text mining to animal sciences.文本挖掘在畜牧动物科学中的应用:介绍文本挖掘在动物科学中的应用潜力。
J Anim Sci. 2012 Oct;90(10):3666-76. doi: 10.2527/jas.2011-4841. Epub 2012 Jun 4.
5
bioNerDS: exploring bioinformatics' database and software use through literature mining.生物信息学数据库和软件的文献挖掘研究。
BMC Bioinformatics. 2013 Jun 15;14:194. doi: 10.1186/1471-2105-14-194.
6
Bioinformatics Methods to Deduce Biological Interpretation from Proteomics Data.从蛋白质组学数据推导生物学解释的生物信息学方法
Methods Mol Biol. 2017;1549:147-161. doi: 10.1007/978-1-4939-6740-7_12.
7
Text mining tools for extracting information about microbial biodiversity in food.用于从食品中提取微生物生物多样性信息的文本挖掘工具。
Food Microbiol. 2019 Aug;81:63-75. doi: 10.1016/j.fm.2018.04.011. Epub 2018 Apr 21.
8
Textpresso: an ontology-based information retrieval and extraction system for biological literature.Textpresso:一个基于本体的生物文献信息检索与提取系统。
PLoS Biol. 2004 Nov;2(11):e309. doi: 10.1371/journal.pbio.0020309. Epub 2004 Sep 21.
9
Bioinformatics Tools for Proteomics Data Interpretation.用于蛋白质组学数据解读的生物信息学工具
Adv Exp Med Biol. 2016;919:281-341. doi: 10.1007/978-3-319-41448-5_16.
10
Platforms and Pipelines for Proteomics Data Analysis and Management.蛋白质组学数据分析与管理的平台和流程
Adv Exp Med Biol. 2016;919:203-215. doi: 10.1007/978-3-319-41448-5_9.

引用本文的文献

1
Spatial proteomics reveals distinct protein patterns in cortical migration disorders caused by LIN28A overexpression and WNT activation.空间蛋白质组学揭示了由LIN28A过表达和WNT激活引起的皮质迁移障碍中不同的蛋白质模式。
Mol Cell Proteomics. 2025 Jul 16;24(9):101037. doi: 10.1016/j.mcpro.2025.101037.
2
Proteomic Landscape of Colorectal Cancer Derived Liver Metastasis Reveals Three Distinct Phenotypes With Specific Signaling and Enhanced Survival.结直肠癌肝转移的蛋白质组学图谱揭示了具有特定信号传导和延长生存期的三种不同表型。
Mol Cell Proteomics. 2025 Jul 4;24(8):101026. doi: 10.1016/j.mcpro.2025.101026.
3
An Evaluation Model for the Influence Factors of Interest in Literature Courses Based on Data Analysis and Association Rules in a Small-Sample Environment.

本文引用的文献

1
Identifying High-Priority Proteins Across the Human Diseasome Using Semantic Similarity.利用语义相似性识别人类疾病组中的高优先级蛋白质。
J Proteome Res. 2018 Dec 7;17(12):4267-4278. doi: 10.1021/acs.jproteome.8b00393. Epub 2018 Oct 9.
2
Differential Proteome Analysis of Human Neuroblastoma Xenograft Primary Tumors and Matched Spontaneous Distant Metastases.人神经母细胞瘤异种移植原发肿瘤与配对自发性远处转移的差异蛋白质组分析。
Sci Rep. 2018 Sep 18;8(1):13986. doi: 10.1038/s41598-018-32236-1.
3
Systematic Protein Prioritization for Targeted Proteomics Studies through Literature Mining.
基于数据分析和关联规则的小样本环境下文学课程兴趣影响因素评价模型
J Environ Public Health. 2022 Sep 9;2022:1900509. doi: 10.1155/2022/1900509. eCollection 2022.
4
Differential regulation of extracellular matrix proteins in three recurrent liver metastases of a single patient with colorectal cancer.在一位结直肠癌患者的三个复发性肝转移灶中,细胞外基质蛋白的差异调节。
Clin Exp Metastasis. 2020 Dec;37(6):649-656. doi: 10.1007/s10585-020-10058-8. Epub 2020 Oct 24.
通过文献挖掘进行靶向蛋白质组学研究的系统蛋白质优先级排序。
J Proteome Res. 2018 Apr 6;17(4):1383-1396. doi: 10.1021/acs.jproteome.7b00772. Epub 2018 Mar 15.
4
Successful identification of a predictive biomarker for lymph node metastasis in colorectal cancer using a proteomic approach.采用蛋白质组学方法成功鉴定结直肠癌淋巴结转移的预测生物标志物。
Oncotarget. 2017 Oct 30;8(63):106935-106947. doi: 10.18632/oncotarget.22149. eCollection 2017 Dec 5.
5
Recurrence-associated gene signature optimizes recurrence-free survival prediction of colorectal cancer.复发相关基因特征可优化结直肠癌无复发生存预测。
Mol Oncol. 2017 Nov;11(11):1544-1560. doi: 10.1002/1878-0261.12117. Epub 2017 Sep 23.
6
A Review of Recent Advancement in Integrating Omics Data with Literature Mining towards Biomedical Discoveries.整合组学数据与文献挖掘以促进生物医学发现的最新进展综述
Int J Genomics. 2017;2017:6213474. doi: 10.1155/2017/6213474. Epub 2017 Feb 26.
7
Proteomics of thyroid tumours provides new insights into their molecular composition and changes associated with malignancy.甲状腺肿瘤的蛋白质组学为其分子组成及与恶性肿瘤相关的变化提供了新的见解。
Sci Rep. 2016 Mar 30;6:23660. doi: 10.1038/srep23660.
8
Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2016 Jan 4;44(D1):D7-19. doi: 10.1093/nar/gkv1290. Epub 2015 Nov 28.
9
Silencing GPX3 Expression Promotes Tumor Metastasis in Human Thyroid Cancer.沉默GPX3表达促进人类甲状腺癌的肿瘤转移。
Curr Protein Pept Sci. 2015;16(4):316-21. doi: 10.2174/138920371604150429154840.
10
A genome-wide MeSH-based literature mining system predicts implicit gene-to-gene relationships and networks.一个基于全基因组医学主题词表的文献挖掘系统可预测隐含的基因与基因之间的关系及网络。
BMC Syst Biol. 2013 Oct 16;7 Suppl 3(Suppl 3):S9. doi: 10.1186/1752-0509-7-S3-S9.