• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从蛋白质三维结构图谱中高效挖掘循环子结构

Efficiently Mining Recurrent Substructures from Protein Three-Dimensional Structure Graphs.

作者信息

Saidi Rabie, Dhifli Wajdi, Maddouri Mondher, Mephu Nguifo Engelbert

机构信息

1 European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom.

2 University of Lille, Faculty of Pharmaceutical and Biological Sciences, EA2694, F-59000 Lille, France.

出版信息

J Comput Biol. 2019 Jun;26(6):561-571. doi: 10.1089/cmb.2018.0171. Epub 2018 Dec 5.

DOI:10.1089/cmb.2018.0171
PMID:30517022
Abstract

Studying protein structures is a major asset for understanding the molecular mechanisms of life. The number of publicly available protein structures has increasingly become extremely large. Yet, the classification of a protein structure remains a difficult, costly, and time-consuming task. Exploring spatial information on protein structures can provide important functional and structural insights. In this context, spatial motifs may correspond to relevant fragments, which might be very useful for a better understanding of proteins. In this article, we propose AntMot, a fast algorithm, to find spatial motifs from protein three-dimensional structures by extending the Karp-Miller-Rosenberg repetition finder, originally dedicated to sequences. The extracted motifs, termed ant-motifs, follow an ant-like shape that is composed of a backbone fragment from the primary structure, enriched with spatial refinements. We show that these motifs are biologically sound, and we used them as descriptors in the classification of several benchmark datasets. Experimental results show that our approach presents a trade-off between sequential motifs and subgraph motifs in terms of the number of extracted substructures, while providing a significant enhancement in the classification accuracy over sequential and frequent-subgraph motifs as well as alignment-based approaches.

摘要

研究蛋白质结构是理解生命分子机制的一项重要内容。公开可用的蛋白质结构数量已变得极其庞大。然而,蛋白质结构的分类仍然是一项困难、昂贵且耗时的任务。探索蛋白质结构的空间信息可以提供重要的功能和结构见解。在这种背景下,空间基序可能对应于相关片段,这对于更好地理解蛋白质可能非常有用。在本文中,我们提出了AntMot,一种快速算法,通过扩展最初用于序列的Karp-Miller-Rosenberg重复查找器,从蛋白质三维结构中找到空间基序。提取的基序,称为蚁基序,呈类似蚂蚁的形状,由一级结构中的主链片段组成,并富含空间细化信息。我们表明这些基序在生物学上是合理的,并且我们将它们用作几个基准数据集分类中的描述符。实验结果表明,我们的方法在提取的子结构数量方面在序列基序和子图基序之间进行了权衡,同时在分类准确性方面比序列和频繁子图基序以及基于比对的方法有显著提高。

相似文献

1
Efficiently Mining Recurrent Substructures from Protein Three-Dimensional Structure Graphs.从蛋白质三维结构图谱中高效挖掘循环子结构
J Comput Biol. 2019 Jun;26(6):561-571. doi: 10.1089/cmb.2018.0171. Epub 2018 Dec 5.
2
Smoothing 3D protein structure motifs through graph mining and amino acid similarities.通过图挖掘和氨基酸相似性平滑三维蛋白质结构基序
J Comput Biol. 2014 Feb;21(2):162-72. doi: 10.1089/cmb.2013.0092. Epub 2013 Oct 12.
3
Discovery of Functional Motifs from the Interface Region of Oligomeric Proteins Using Frequent Subgraph Mining.利用频繁子图挖掘技术从寡聚蛋白界面区域发现功能基序。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Sep-Oct;16(5):1537-1549. doi: 10.1109/TCBB.2017.2756879. Epub 2017 Sep 26.
4
Comparing graph representations of protein structure for mining family-specific residue-based packing motifs.比较蛋白质结构的图形表示以挖掘基于家族特异性残基的堆积基序。
J Comput Biol. 2005 Jul-Aug;12(6):657-71. doi: 10.1089/cmb.2005.12.657.
5
Functional neighbors: inferring relationships between nonhomologous protein families using family-specific packing motifs.功能邻域:利用家族特异性包装基序推断非同源蛋白质家族之间的关系。
IEEE Trans Inf Technol Biomed. 2010 Sep;14(5):1137-43. doi: 10.1109/TITB.2010.2053550. Epub 2010 Jun 21.
6
Distance-based identification of structure motifs in proteins using constrained frequent subgraph mining.使用受限频繁子图挖掘基于距离的蛋白质结构基序识别
Comput Syst Bioinformatics Conf. 2006:227-38.
7
Discovering interesting molecular substructures for molecular classification.发现分子分类的有趣分子亚结构。
IEEE Trans Nanobioscience. 2010 Jun;9(2):77-89. doi: 10.1109/TNB.2010.2042609.
8
Mining protein loops using a structural alphabet and statistical exceptionality.使用结构字母表和统计异常挖掘蛋白质环。
BMC Bioinformatics. 2010 Feb 4;11:75. doi: 10.1186/1471-2105-11-75.
9
Efficient mining gapped sequential patterns for motifs in biological sequences.用于挖掘生物序列中基序的带间隙序列模式的高效挖掘方法。
BMC Syst Biol. 2013;7 Suppl 4(Suppl 4):S7. doi: 10.1186/1752-0509-7-S4-S7. Epub 2013 Oct 23.
10
Accurate classification of protein structural families using coherent subgraph analysis.使用相干子图分析对蛋白质结构家族进行准确分类。
Pac Symp Biocomput. 2004:411-22. doi: 10.1142/9789812704856_0039.