• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FAMCS: finding all maximal common substructures in proteins.

作者信息

Yao Zhen, Xiao Juan, Tung Anthony K H, Sung Wing Kin

机构信息

Department of Computer Science, National University of Singapore.

出版信息

Genomics Proteomics Bioinformatics. 2005 May;3(2):107-19. doi: 10.1016/s1672-0229(05)03015-9.

DOI:10.1016/s1672-0229(05)03015-9
PMID:16393147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5172543/
Abstract

Finding the common substructures shared by two proteins is considered as one of the central issues in computational biology because of its usefulness in understanding the structure-function relationship and application in drug and vaccine design. In this paper, we propose a novel algorithm called FAMCS (Finding All Maximal Common Substructures) for the common substructure identification problem. Our method works initially at the protein secondary structural element (SSE) level and starts with the identification of all structurally similar SSE pairs. These SSE pairs are then merged into sets using a modified Apriori algorithm, which will test the similarity of various sets of SSE pairs incrementally until all the maximal sets of SSE pairs that deemed to be similar are found. The maximal common substructures of the two proteins will be formed from these maximal sets. A refinement algorithm is also proposed to fine tune the alignment from the SSE level to the residue level. Comparison of FAMCS with other methods on various proteins shows that FAMCS can address all four requirements and infer interesting biological discoveries.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/5622126adeba/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/cdf494ef1988/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/0a122b7650f7/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/b7ce32f9e203/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/954f2df45afc/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/5622126adeba/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/cdf494ef1988/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/0a122b7650f7/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/b7ce32f9e203/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/954f2df45afc/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4650/5172543/5622126adeba/gr5.jpg

相似文献

1
FAMCS: finding all maximal common substructures in proteins.
Genomics Proteomics Bioinformatics. 2005 May;3(2):107-19. doi: 10.1016/s1672-0229(05)03015-9.
2
Protein similarity from knot theory: geometric convolution and line weavings.纽结理论中的蛋白质相似性:几何卷积与线编织
J Comput Biol. 2005 Jul-Aug;12(6):609-37. doi: 10.1089/cmb.2005.12.609.
3
A relational extension of the notion of motifs: application to the common 3D protein substructures searching problem.基序概念的关系扩展:在常见三维蛋白质子结构搜索问题中的应用。
J Comput Biol. 2009 Dec;16(12):1635-60. doi: 10.1089/cmb.2008.0019.
4
MUSTANG-MR structural sieving server: applications in protein structural analysis and crystallography. MUSTANG-MR 结构筛析服务器:在蛋白质结构分析和晶体学中的应用。
PLoS One. 2010 Apr 6;5(4):e10048. doi: 10.1371/journal.pone.0010048.
5
Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions.萨米拉-可变位置比对法:一种通过重新检查字母向量位置的简单蛋白质比对方法。
J Bioinform Comput Biol. 2017 Apr;15(2):1750004. doi: 10.1142/S0219720017500044. Epub 2017 Jan 26.
6
Extraction of geometrically similar substructures: least-squares and Chebyshev fitting and the difference distance matrix.几何相似子结构的提取:最小二乘法与切比雪夫拟合以及差异距离矩阵
Proteins. 1998 Nov 15;33(3):320-8.
7
Efficient substructure RMSD query algorithms.高效子结构均方根偏差查询算法。
J Comput Biol. 2007 Nov;14(9):1201-7. doi: 10.1089/cmb.2007.0079.
8
Detection of distant structural similarities in a set of proteins using a fast graph-based method.使用一种基于快速图形的方法检测一组蛋白质中的远距离结构相似性。
Proc Int Conf Intell Syst Mol Biol. 1997;5:167-78.
9
Efficient RNA structure comparison algorithms.高效的RNA结构比较算法。
J Bioinform Comput Biol. 2017 Dec;15(6):1740009. doi: 10.1142/S0219720017400091. Epub 2017 Oct 19.
10
Angle-distance image matching techniques for protein structure comparison.用于蛋白质结构比较的角度-距离图像匹配技术
J Mol Recognit. 2008 Nov-Dec;21(6):442-52. doi: 10.1002/jmr.914.

本文引用的文献

1
FAST: a novel protein structure alignment algorithm.FAST:一种新型蛋白质结构比对算法。
Proteins. 2005 Feb 15;58(3):618-27. doi: 10.1002/prot.20331.
2
Non-sequential structure-based alignments reveal topology-independent core packing arrangements in proteins.基于非序列结构的比对揭示了蛋白质中与拓扑结构无关的核心堆积排列。
Bioinformatics. 2005 Apr 1;21(7):1010-9. doi: 10.1093/bioinformatics/bti128. Epub 2004 Nov 5.
3
Multiple structural alignment by secondary structures: algorithm and applications.基于二级结构的多重结构比对:算法与应用
Protein Sci. 2003 Nov;12(11):2492-507. doi: 10.1110/ps.03200603.
4
Common Structural Cliques: a tool for protein structure and function analysis.常见结构团:一种用于蛋白质结构与功能分析的工具。
Protein Eng. 2003 Aug;16(8):543-52. doi: 10.1093/protein/gzg080.
5
Human cystatin C, an amyloidogenic protein, dimerizes through three-dimensional domain swapping.人胱抑素C是一种淀粉样蛋白,通过三维结构域交换形成二聚体。
Nat Struct Biol. 2001 Apr;8(4):316-20. doi: 10.1038/86188.
6
Protein structure alignment using a genetic algorithm.使用遗传算法进行蛋白质结构比对。
Proteins. 2000 Mar 1;38(4):428-40. doi: 10.1002/(sici)1097-0134(20000301)38:4<428::aid-prot8>3.0.co;2-n.
7
The Protein Data Bank.蛋白质数据库。
Nucleic Acids Res. 2000 Jan 1;28(1):235-42. doi: 10.1093/nar/28.1.235.
8
Fast detection of common geometric substructure in proteins.蛋白质中常见几何子结构的快速检测
J Comput Biol. 1999 Fall-Winter;6(3-4):313-25. doi: 10.1089/106652799318292.
9
Finding local structural similarities among families of unrelated protein structures: a generic non-linear alignment algorithm.寻找不相关蛋白质结构家族之间的局部结构相似性:一种通用的非线性比对算法。
Proteins. 1999 Feb 15;34(3):341-55. doi: 10.1002/(sici)1097-0134(19990215)34:3<341::aid-prot7>3.0.co;2-z.
10
Protein structure alignment by incremental combinatorial extension (CE) of the optimal path.通过最优路径的增量组合扩展(CE)进行蛋白质结构比对。
Protein Eng. 1998 Sep;11(9):739-47. doi: 10.1093/protein/11.9.739.