• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用基因本体术语预测蛋白质-蛋白质相互作用网络中高度连接的“枢纽”节点。

The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks.

作者信息

Hsing Michael, Byler Kendall Grant, Cherkasov Artem

机构信息

Faculty of Graduate Studies, Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada.

出版信息

BMC Syst Biol. 2008 Sep 16;2:80. doi: 10.1186/1752-0509-2-80.

DOI:10.1186/1752-0509-2-80
PMID:18796161
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2553323/
Abstract

BACKGROUND

Protein-protein interactions mediate a wide range of cellular functions and responses and have been studied rigorously through recent large-scale proteomics experiments and bioinformatics analyses. One of the most important findings of those endeavours was the observation that 'hub' proteins participate in significant numbers of protein interactions and play critical roles in the organization and function of cellular protein interaction networks (PINs) 12. It has also been demonstrated that such hub proteins may constitute an important pool of attractive drug targets.Thus, it is crucial to be able to identify hub proteins based not only on experimental data but also by means of bioinformatics predictions.

RESULTS

A hub protein classifier has been developed based on the available interaction data and Gene Ontology (GO) annotations for proteins in the Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens genomes. In particular, by utilizing the machine learning method of boosting trees we were able to create a predictive bioinformatics tool for the identification of proteins that are likely to play the role of a hub in protein interaction networks. Testing the developed hub classifier on external sets of experimental protein interaction data in Methicillin-resistant Staphylococcus aureus (MRSA) 252 and Caenorhabditis elegans demonstrated that our approach can predict hub proteins with a high degree of accuracy.A practical application of the developed bioinformatics method has been illustrated by the effective protein bait selection for large-scale pull-down experiments that aim to map complete protein-protein interaction networks for several species.

CONCLUSION

The successful development of an accurate hub classifier demonstrated that highly-connected proteins tend to share certain relevant functional properties reflected in their Gene Ontology annotations. It is anticipated that the developed bioinformatics hub classifier will represent a useful tool for the theoretical prediction of highly-interacting proteins, the study of cellular network organizations, and the identification of prospective drug targets - even in those organisms that currently lack large-scale protein interaction data.

摘要

背景

蛋白质-蛋白质相互作用介导了广泛的细胞功能和反应,并且通过近期的大规模蛋白质组学实验和生物信息学分析得到了深入研究。这些研究中最重要的发现之一是观察到“枢纽”蛋白参与大量的蛋白质相互作用,并在细胞蛋白质相互作用网络(PINs)的组织和功能中发挥关键作用。研究还表明,这类枢纽蛋白可能构成了有吸引力的药物靶点的重要来源。因此,不仅能够基于实验数据,还能借助生物信息学预测来识别枢纽蛋白至关重要。

结果

基于大肠杆菌、酿酒酵母、黑腹果蝇和人类基因组中蛋白质的可用相互作用数据和基因本体(GO)注释,开发了一种枢纽蛋白分类器。特别是,通过利用提升树的机器学习方法,我们能够创建一种预测性生物信息学工具,用于识别可能在蛋白质相互作用网络中发挥枢纽作用的蛋白质。在耐甲氧西林金黄色葡萄球菌(MRSA)252和秀丽隐杆线虫的外部实验性蛋白质相互作用数据集上测试所开发的枢纽分类器,结果表明我们的方法能够高度准确地预测枢纽蛋白。通过为旨在绘制几个物种完整蛋白质-蛋白质相互作用网络的大规模下拉实验有效选择蛋白质诱饵,说明了所开发的生物信息学方法的实际应用。

结论

准确的枢纽分类器的成功开发表明,高度连接的蛋白质往往共享其基因本体注释中反映的某些相关功能特性。预计所开发的生物信息学枢纽分类器将成为用于理论预测高度相互作用蛋白质、研究细胞网络组织以及识别潜在药物靶点的有用工具——即使在目前缺乏大规模蛋白质相互作用数据的生物体中也是如此。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/2d8e8d5b22e7/1752-0509-2-80-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/27499a004461/1752-0509-2-80-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/8c3f74c11299/1752-0509-2-80-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/fe1204809b4c/1752-0509-2-80-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/2d8e8d5b22e7/1752-0509-2-80-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/27499a004461/1752-0509-2-80-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/8c3f74c11299/1752-0509-2-80-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/fe1204809b4c/1752-0509-2-80-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da44/2553323/2d8e8d5b22e7/1752-0509-2-80-4.jpg

相似文献

1
The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks.利用基因本体术语预测蛋白质-蛋白质相互作用网络中高度连接的“枢纽”节点。
BMC Syst Biol. 2008 Sep 16;2:80. doi: 10.1186/1752-0509-2-80.
2
Predicting highly-connected hubs in protein interaction networks by QSAR and biological data descriptors.利用定量构效关系和生物数据描述符预测蛋白质相互作用网络中的高连接枢纽蛋白
Bioinformation. 2009 Oct 15;4(4):164-8. doi: 10.6026/97320630004164.
3
Filtering high-throughput protein-protein interaction data using a combination of genomic features.使用基因组特征组合过滤高通量蛋白质-蛋白质相互作用数据。
BMC Bioinformatics. 2005 Apr 18;6:100. doi: 10.1186/1471-2105-6-100.
4
Information flow analysis of interactome networks.相互作用组网络的信息流分析
PLoS Comput Biol. 2009 Apr;5(4):e1000350. doi: 10.1371/journal.pcbi.1000350. Epub 2009 Apr 10.
5
Predicting whole genome protein interaction networks from primary sequence data in model and non-model organisms using ENTS.使用 ENTS 从模型和非模型生物的原始序列数据中预测全基因组蛋白质相互作用网络。
BMC Genomics. 2013 Sep 10;14:608. doi: 10.1186/1471-2164-14-608.
6
Exploring the relationship between hub proteins and drug targets based on GO and intrinsic disorder.基于基因本体论(GO)和内在无序性探索枢纽蛋白与药物靶点之间的关系。
Comput Biol Chem. 2015 Jun;56:41-8. doi: 10.1016/j.compbiolchem.2015.03.003. Epub 2015 Mar 23.
7
Difference in gene duplicability may explain the difference in overall structure of protein-protein interaction networks among eukaryotes.基因可复制性的差异可能解释了真核生物中蛋白质-蛋白质相互作用网络整体结构的差异。
BMC Evol Biol. 2010 Nov 18;10:358. doi: 10.1186/1471-2148-10-358.
8
Mapping the protein interaction network in methicillin-resistant Staphylococcus aureus.绘制耐甲氧西林金黄色葡萄球菌的蛋白质相互作用网络。
J Proteome Res. 2011 Mar 4;10(3):1139-50. doi: 10.1021/pr100918u. Epub 2011 Jan 28.
9
Essential protein identification based on essential protein-protein interaction prediction by Integrated Edge Weights.基于综合边权重的必需蛋白质-蛋白质相互作用预测进行必需蛋白质识别
Methods. 2015 Jul 15;83:51-62. doi: 10.1016/j.ymeth.2015.04.013. Epub 2015 Apr 16.
10
AVID: an integrative framework for discovering functional relationships among proteins.AVID:一个用于发现蛋白质间功能关系的综合框架。
BMC Bioinformatics. 2005 Jun 1;6:136. doi: 10.1186/1471-2105-6-136.

引用本文的文献

1
Comprehensive bioinformatics and machine learning analyses for breast cancer staging using TCGA dataset.使用TCGA数据集进行乳腺癌分期的综合生物信息学和机器学习分析。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae628.
2
Exploring gene regulatory interaction networks and predicting therapeutic molecules for hypopharyngeal cancer and EGFR-mutated lung adenocarcinoma.探索下咽癌和 EGFR 突变型肺腺癌的基因调控互作网络,并预测治疗分子。
FEBS Open Bio. 2024 Jul;14(7):1166-1191. doi: 10.1002/2211-5463.13807. Epub 2024 May 23.
3
Rule-Based Pruning and In Silico Identification of Essential Proteins in Yeast PPIN.

本文引用的文献

1
Use and misuse of the gene ontology annotations.基因本体注释的使用与误用。
Nat Rev Genet. 2008 Jul;9(7):509-15. doi: 10.1038/nrg2363. Epub 2008 May 13.
2
KEGG for linking genomes to life and the environment.京都基因与基因组百科全书,用于将基因组与生命及环境相联系。
Nucleic Acids Res. 2008 Jan;36(Database issue):D480-4. doi: 10.1093/nar/gkm882. Epub 2007 Dec 12.
3
NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.美国国立生物技术信息中心参考序列(RefSeq):一个经过整理的基因组、转录本和蛋白质的非冗余序列数据库。
基于规则的修剪和酵母蛋白质-蛋白质相互作用网络中必需蛋白质的计算机识别。
Cells. 2022 Aug 25;11(17):2648. doi: 10.3390/cells11172648.
4
Identification of the shared gene signatures and pathways between sarcopenia and type 2 diabetes mellitus.鉴定肌少症和 2 型糖尿病之间的共享基因特征和通路。
PLoS One. 2022 Mar 10;17(3):e0265221. doi: 10.1371/journal.pone.0265221. eCollection 2022.
5
Assessment of colon cancer molecular mechanism: a system biology approach.结肠癌分子机制评估:一种系统生物学方法。
Gastroenterol Hepatol Bed Bench. 2021 Fall;14(Suppl1):S51-S57.
6
Integrative Multi-Omics Analysis Reveals Candidate Biomarkers for Oral Squamous Cell Carcinoma.整合多组学分析揭示口腔鳞状细胞癌的候选生物标志物。
Front Oncol. 2022 Jan 14;11:794146. doi: 10.3389/fonc.2021.794146. eCollection 2021.
7
Deep Modeling of Regulating Effects of Small Molecules on Longevity-Associated Genes.小分子对长寿相关基因调控作用的深度建模
Pharmaceuticals (Basel). 2021 Sep 22;14(10):948. doi: 10.3390/ph14100948.
8
The long non-coding RNA plays critical roles in the pathogenesis of cholesterol gallstone.长链非编码RNA在胆固醇性胆结石的发病机制中起关键作用。
PeerJ. 2021 Feb 23;9:e10803. doi: 10.7717/peerj.10803. eCollection 2021.
9
Identification of biomarkers and pathways for the SARS-CoV-2 infections that make complexities in pulmonary arterial hypertension patients.鉴定导致肺动脉高压患者病情复杂化的 SARS-CoV-2 感染的生物标志物和途径。
Brief Bioinform. 2021 Mar 22;22(2):1451-1465. doi: 10.1093/bib/bbab026.
10
A Novel Computational Approach for Identifying Essential Proteins From Multiplex Biological Networks.一种从多重生物网络中识别必需蛋白质的新型计算方法。
Front Genet. 2020 Apr 21;11:343. doi: 10.3389/fgene.2020.00343. eCollection 2020.
Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5. doi: 10.1093/nar/gkl842. Epub 2006 Nov 27.
4
Intrinsic disorder is a common feature of hub proteins from four eukaryotic interactomes.内在无序是来自四个真核生物相互作用组的中心蛋白的共同特征。
PLoS Comput Biol. 2006 Aug 4;2(8):e100. doi: 10.1371/journal.pcbi.0020100. Epub 2006 Jun 23.
5
Why do hubs tend to be essential in protein networks?为什么枢纽节点在蛋白质网络中往往至关重要?
PLoS Genet. 2006 Jun 2;2(6):e88. doi: 10.1371/journal.pgen.0020088. Epub 2006 Apr 26.
6
Global landscape of protein complexes in the yeast Saccharomyces cerevisiae.酿酒酵母中蛋白质复合物的全球格局。
Nature. 2006 Mar 30;440(7084):637-43. doi: 10.1038/nature04670. Epub 2006 Mar 22.
7
Evaluation of different biological data and computational classification methods for use in protein interaction prediction.用于蛋白质相互作用预测的不同生物学数据和计算分类方法的评估。
Proteins. 2006 May 15;63(3):490-500. doi: 10.1002/prot.20865.
8
Proteome survey reveals modularity of the yeast cell machinery.蛋白质组研究揭示酵母细胞机制的模块化特性。
Nature. 2006 Mar 30;440(7084):631-6. doi: 10.1038/nature04532. Epub 2006 Jan 22.
9
Pfam: clans, web tools and services.蛋白质家族数据库(Pfam):家族分类、网络工具及服务
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D247-51. doi: 10.1093/nar/gkj149.
10
Scale-free networks in cell biology.细胞生物学中的无标度网络。
J Cell Sci. 2005 Nov 1;118(Pt 21):4947-57. doi: 10.1242/jcs.02714.