• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于超图表示的疾病基因分类。

Disease gene classification with metagraph representations.

机构信息

Computer Science and Engineering, Nanyang Technological University, Singapore.

Data Analytics Department, Institute for Infocomm Research, Singapore.

出版信息

Methods. 2017 Dec 1;131:83-92. doi: 10.1016/j.ymeth.2017.06.036. Epub 2017 Jul 8.

DOI:10.1016/j.ymeth.2017.06.036
PMID:28694066
Abstract

Protein-protein interaction (PPI) networks play an important role in studying the functional roles of proteins, including their association with diseases. However, protein interaction networks are not sufficient without the support of additional biological knowledge for proteins such as their molecular functions and biological processes. To complement and enrich PPI networks, we propose to exploit biological properties of individual proteins. More specifically, we integrate keywords describing protein properties into the PPI network, and construct a novel PPI-Keywords (PPIK) network consisting of both proteins and keywords as two different types of nodes. As disease proteins tend to have a similar topological characteristics on the PPIK network, we further propose to represent proteins with metagraphs. Different from a traditional network motif or subgraph, a metagraph can capture a particular topological arrangement involving the interactions/associations between both proteins and keywords. Based on the novel metagraph representations for proteins, we further build classifiers for disease protein classification through supervised learning. Our experiments on three different PPI databases demonstrate that the proposed method consistently improves disease protein prediction across various classifiers, by 15.3% in AUC on average. It outperforms the baselines including the diffusion-based methods (e.g., RWR) and the module-based methods by 13.8-32.9% for overall disease protein prediction. For predicting breast cancer genes, it outperforms RWR, PRINCE and the module-based baselines by 6.6-14.2%. Finally, our predictions also turn out to have better correlations with literature findings from PubMed.

摘要

蛋白质-蛋白质相互作用(PPI)网络在研究蛋白质的功能角色方面起着重要作用,包括它们与疾病的关联。然而,如果没有蛋白质的其他生物知识(如它们的分子功能和生物过程)的支持,蛋白质相互作用网络是不充分的。为了补充和丰富 PPI 网络,我们建议利用单个蛋白质的生物特性。更具体地说,我们将描述蛋白质特性的关键字集成到 PPI 网络中,并构建一个由蛋白质和关键字作为两种不同类型节点组成的新型 PPI-关键字(PPIK)网络。由于疾病蛋白在 PPIK 网络上往往具有相似的拓扑特征,我们进一步提出用超图来表示蛋白。与传统的网络模式或子图不同,超图可以捕获涉及蛋白质和关键字之间相互作用/关联的特定拓扑排列。基于蛋白质的新超图表示,我们通过监督学习进一步构建用于疾病蛋白分类的分类器。我们在三个不同的 PPI 数据库上的实验表明,该方法通过平均提高 15.3%的 AUC,一致地提高了各种分类器的疾病蛋白预测能力。它比基于扩散的方法(例如 RWR)和基于模块的方法的基线提高了 13.8-32.9%,用于整体疾病蛋白预测。对于预测乳腺癌基因,它比 RWR、PRINCE 和基于模块的基线高出 6.6-14.2%。最后,我们的预测与 PubMed 中的文献发现也具有更好的相关性。

相似文献

1
Disease gene classification with metagraph representations.基于超图表示的疾病基因分类。
Methods. 2017 Dec 1;131:83-92. doi: 10.1016/j.ymeth.2017.06.036. Epub 2017 Jul 8.
2
Disease Gene Classification with Metagraph Representations.基于元图表示的疾病基因分类
Methods Mol Biol. 2018;1807:211-224. doi: 10.1007/978-1-4939-8561-6_16.
3
Potential Schizophrenia Disease-Related Genes Prediction Using Metagraph Representations Based on a Protein-Protein Interaction Keyword Network: Framework Development and Validation.基于蛋白质-蛋白质相互作用关键词网络的元图表示法预测潜在的精神分裂症相关基因:框架开发与验证
JMIR Form Res. 2023 Nov 15;7:e50998. doi: 10.2196/50998.
4
Incorporating topological information for predicting robust cancer subnetwork markers in human protein-protein interaction network.整合拓扑信息以预测人类蛋白质-蛋白质相互作用网络中稳健的癌症子网标志物。
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):351. doi: 10.1186/s12859-016-1224-1.
5
Protein Complexes Detection Based on Semi-Supervised Network Embedding Model.基于半监督网络嵌入模型的蛋白质复合物检测。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):797-803. doi: 10.1109/TCBB.2019.2944809. Epub 2021 Apr 8.
6
Network-based ranking methods for prediction of novel disease associated microRNAs.基于网络的新型疾病相关微小RNA预测排序方法。
Comput Biol Chem. 2015 Oct;58:139-48. doi: 10.1016/j.compbiolchem.2015.07.003. Epub 2015 Jul 21.
7
Prediction of Protein-Protein Interaction via co-occurring Aligned Pattern Clusters.通过共现比对模式簇预测蛋白质-蛋白质相互作用
Methods. 2016 Nov 1;110:26-34. doi: 10.1016/j.ymeth.2016.07.018. Epub 2016 Jul 27.
8
Predicting diabetes mellitus genes via protein-protein interaction and protein subcellular localization information.通过蛋白质-蛋白质相互作用和蛋白质亚细胞定位信息预测糖尿病基因。
BMC Genomics. 2016 Aug 18;17 Suppl 4(Suppl 4):433. doi: 10.1186/s12864-016-2795-y.
9
Protein2Vec: Aligning Multiple PPI Networks with Representation Learning.蛋白质2向量:通过表征学习对齐多个蛋白质-蛋白质相互作用网络
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):240-249. doi: 10.1109/TCBB.2019.2937771. Epub 2021 Feb 3.
10
Identification of protein complexes from multi-relationship protein interaction networks.从多重关系蛋白质相互作用网络中识别蛋白质复合物。
Hum Genomics. 2016 Jul 25;10 Suppl 2(Suppl 2):17. doi: 10.1186/s40246-016-0069-z.

引用本文的文献

1
Intelligent Protein Design and Molecular Characterization Techniques: A Comprehensive Review.智能蛋白质设计与分子特征技术:全面综述。
Molecules. 2023 Nov 30;28(23):7865. doi: 10.3390/molecules28237865.
2
HetIG-PreDiG: A Heterogeneous Integrated Graph Model for Predicting Human Disease Genes based on gene expression.HetIG-PreDiG:一种基于基因表达的用于预测人类疾病基因的异构集成图模型。
PLoS One. 2023 Feb 15;18(2):e0280839. doi: 10.1371/journal.pone.0280839. eCollection 2023.
3
Integrating node embeddings and biological annotations for genes to predict disease-gene associations.
整合基因的节点嵌入和生物学注释以预测疾病-基因关联。
BMC Syst Biol. 2018 Dec 31;12(Suppl 9):138. doi: 10.1186/s12918-018-0662-y.