• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于检测蛋白质家族的网络聚类算法。

A network clustering algorithm for detection of protein families.

作者信息

Xie Jiang, Wang Minchao, Dai Dongbo, Zhang Huiran, Zhang Wu

机构信息

School of Computer Engineering and Science, Shanghai University, Shanghai 200072, China.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:6329-32. doi: 10.1109/EMBC.2012.6347441.

DOI:10.1109/EMBC.2012.6347441
PMID:23367376
Abstract

Detection of protein families in large scale database is a difficult but important biological problem. Computational clustering methods can effectively address the problem. Although there exist many clustering algorithms, most of them are just based on the threshold. Their computational performances are affected by the weight distribution greatly, and they are only valid for some special networks. A new network clustering algorithm, Markov Finding and Clustering (MFC), is proposed to cluster the proteins into their functionally specific families accurately in this paper. The MFC algorithm makes an improvement in the random walk process and reduces the affection of the noise on the clustering result. It has a good performance on these networks which are not well addressed by existing algorithms sensitive to the noise. Finally, experiments on the protein sequence datasets demonstrate that the algorithm is effective in the detection of protein families and has a better performance than the current algorithms.

摘要

在大规模数据库中检测蛋白质家族是一个困难但重要的生物学问题。计算聚类方法可以有效地解决这个问题。尽管存在许多聚类算法,但它们大多仅基于阈值。它们的计算性能受权重分布的影响很大,并且仅对某些特殊网络有效。本文提出了一种新的网络聚类算法——马尔可夫查找与聚类(MFC),以将蛋白质准确地聚类到其功能特定的家族中。MFC算法在随机游走过程中进行了改进,减少了噪声对聚类结果的影响。对于现有对噪声敏感的算法处理效果不佳的网络,它具有良好的性能。最后,在蛋白质序列数据集上的实验表明,该算法在蛋白质家族检测中是有效的,并且比当前算法具有更好的性能。

相似文献

1
A network clustering algorithm for detection of protein families.一种用于检测蛋白质家族的网络聚类算法。
Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:6329-32. doi: 10.1109/EMBC.2012.6347441.
2
Markov clustering versus affinity propagation for the partitioning of protein interaction graphs.用于蛋白质相互作用图划分的马尔可夫聚类与亲和传播算法
BMC Bioinformatics. 2009 Mar 30;10:99. doi: 10.1186/1471-2105-10-99.
3
Network Community Detection Based on the Physarum-Inspired Computational Framework.基于黏菌启发式计算框架的网络社区检测。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1916-1928. doi: 10.1109/TCBB.2016.2638824. Epub 2016 Dec 13.
4
Improving the quality of protein similarity network clustering algorithms using the network edge weight distribution.利用网络边权重分布改进蛋白质相似性网络聚类算法的质量。
Bioinformatics. 2011 Feb 1;27(3):326-33. doi: 10.1093/bioinformatics/btq655. Epub 2010 Nov 29.
5
Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.利用一种新颖的无监督方法从加权蛋白质 - 蛋白质相互作用图预测蛋白质复合物:进化增强的马尔可夫聚类。
Artif Intell Med. 2015 Mar;63(3):181-9. doi: 10.1016/j.artmed.2014.12.012. Epub 2015 Feb 18.
6
HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks.HipMCL:一种用于大规模网络的马尔可夫聚类算法的高性能并行实现。
Nucleic Acids Res. 2018 Apr 6;46(6):e33. doi: 10.1093/nar/gkx1313.
7
Evaluation and improvements of clustering algorithms for detecting remote homologous protein families.用于检测远程同源蛋白家族的聚类算法的评估与改进
BMC Bioinformatics. 2015 Feb 5;16:34. doi: 10.1186/s12859-014-0445-4.
8
Searching remote homology with spectral clustering with symmetry in neighborhood cluster kernels.基于邻域聚类核的谱聚类对称性搜索远程同源性。
PLoS One. 2013;8(2):e46468. doi: 10.1371/journal.pone.0046468. Epub 2013 Feb 15.
9
PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks.PS-MCL:基于并行鸟枪法的蛋白质相互作用网络粗化马尔可夫聚类。
BMC Bioinformatics. 2019 Jul 24;20(Suppl 13):381. doi: 10.1186/s12859-019-2856-8.
10
A hybrid clustering approach to recognition of protein families in 114 microbial genomes.一种用于识别114个微生物基因组中蛋白质家族的混合聚类方法。
BMC Bioinformatics. 2004 Apr 29;5:45. doi: 10.1186/1471-2105-5-45.

引用本文的文献

1
CNNcon: improved protein contact maps prediction using cascaded neural networks.CNNcon:使用级联神经网络改进蛋白质接触图预测。
PLoS One. 2013 Apr 23;8(4):e61533. doi: 10.1371/journal.pone.0061533. Print 2013.