• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

初始聚类分析。

Initial Cluster Analysis.

作者信息

Altschul Stephen F, Neuwald Andrew F

机构信息

1 National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health , Bethesda, Maryland.

2 Department of Biochemistry and Molecular Biology, Institute for Genome Sciences, University of Maryland School of Medicine , Baltimore, Maryland.

出版信息

J Comput Biol. 2018 Feb;25(2):121-129. doi: 10.1089/cmb.2017.0050. Epub 2017 Aug 3.

DOI:10.1089/cmb.2017.0050
PMID:28771374
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5806593/
Abstract

We study a simple abstract problem motivated by a variety of applications in protein sequence analysis. Consider a string of 0s and 1s of length L, and containing D 1s. If we believe that some or all of the 1s may be clustered near the start of the sequence, which subset is the most significantly so clustered, and how significant is this clustering? We approach this question using the minimum description length principle and illustrate its application by analyzing residues that distinguish translational initiation and elongation factor guanosine triphosphatases (GTPases) from other P-loop GTPases. Within a structure of yeast elongation factor 1[Formula: see text], these residues form a significant cluster centered on a region implicated in guanine nucleotide exchange. Various biomedical questions may be cast as the abstract problem considered here.

摘要

我们研究了一个由蛋白质序列分析中的各种应用所引发的简单抽象问题。考虑一个长度为L且包含D个1的0和1组成的字符串。如果我们认为部分或所有的1可能聚集在序列开头附近,那么哪个子集的聚集最为显著,以及这种聚集的显著程度如何?我们使用最小描述长度原理来处理这个问题,并通过分析区分翻译起始因子和延伸因子鸟苷三磷酸酶(GTPases)与其他P环GTPases的残基来说明其应用。在酵母延伸因子1的结构中,这些残基形成了一个以与鸟嘌呤核苷酸交换相关的区域为中心的显著簇。各种生物医学问题都可以归结为此处考虑的抽象问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/272f4d83dabd/fig-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/b68aa2fc708c/fig-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/2e3515ca4bc8/fig-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/b5b3237a22a7/fig-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/272f4d83dabd/fig-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/b68aa2fc708c/fig-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/2e3515ca4bc8/fig-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/b5b3237a22a7/fig-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bbf4/5806593/272f4d83dabd/fig-4.jpg

相似文献

1
Initial Cluster Analysis.初始聚类分析。
J Comput Biol. 2018 Feb;25(2):121-129. doi: 10.1089/cmb.2017.0050. Epub 2017 Aug 3.
2
Classification and evolution of P-loop GTPases and related ATPases.P 环鸟苷三磷酸酶及相关腺苷三磷酸酶的分类与进化
J Mol Biol. 2002 Mar 15;317(1):41-72. doi: 10.1006/jmbi.2001.5378.
3
Translation elongation factor-3 (EF-3): an evolving eukaryotic ribosomal protein?翻译延伸因子3(EF-3):一种不断演变的真核核糖体蛋白?
J Mol Evol. 1995 Sep;41(3):376-87.
4
[GTPases of translational apparatus].[翻译装置的GTP酶]
Mol Biol (Mosk). 2005 Sep-Oct;39(5):746-61.
5
Ran's C-terminal, basic patch, and nucleotide exchange mechanisms in light of a canonical structure for Rab, Rho, Ras, and Ran GTPases.基于Rab、Rho、Ras和Ran GTPases的典型结构对Ran的C末端、碱性结构域及核苷酸交换机制的研究
Genome Res. 2003 Apr;13(4):673-92. doi: 10.1101/gr.862303.
6
Fast exact algorithms for the closest string and substring problems with application to the planted (L, d)-motif model.快速精确算法求解最接近字符串和子字符串问题及其在 (L, d)-基序模型中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1400-10. doi: 10.1109/TCBB.2011.21.
7
Asymmetric coiled-coil structure with Guanine nucleotide exchange activity.具有鸟嘌呤核苷酸交换活性的不对称卷曲螺旋结构。
Structure. 2007 Feb;15(2):245-52. doi: 10.1016/j.str.2007.01.003.
8
GTPases. A turn-on and a surprise.GTP酶。一个开启因素和一个意外发现。
Nature. 1993 Dec 16;366(6456):628-9. doi: 10.1038/366628a0.
9
Prediction of multiple tandem OB-fold domains in telomere end-binding proteins Pot1 and Cdc13.端粒末端结合蛋白Pot1和Cdc13中多个串联OB折叠结构域的预测
Structure. 2004 Oct;12(10):1877-9. doi: 10.1016/j.str.2004.07.015.
10
Complementary techniques of clustering and composite pattern analysis to Saccharomyces cerevisiae gene expression.用于酿酒酵母基因表达的聚类和复合模式分析的互补技术。
Appl Bioinformatics. 2003;2(3 Suppl):S37-46.

引用本文的文献

1
SPARC: Structural properties associated with residue constraints.SPARC:与残基限制相关的结构特性。
Comput Struct Biotechnol J. 2022 Apr 7;20:1702-1715. doi: 10.1016/j.csbj.2022.04.005. eCollection 2022.
2
Identifying Function Determining Residues in Neuroimmune Semaphorin 4A.鉴定神经免疫信号素 4A 的功能决定残基。
Int J Mol Sci. 2022 Mar 11;23(6):3024. doi: 10.3390/ijms23063024.
3
Statistical investigations of protein residue direct couplings.蛋白质残基直接耦合的统计研究。

本文引用的文献

1
Inference of Functionally-Relevant N-acetyltransferase Residues Based on Statistical Correlations.基于统计相关性推断功能相关的N-乙酰转移酶残基
PLoS Comput Biol. 2016 Dec 21;12(12):e1005294. doi: 10.1371/journal.pcbi.1005294. eCollection 2016 Dec.
2
A Bayesian sampler for optimization of protein domain hierarchies.用于优化蛋白质结构域层次结构的贝叶斯采样器。
J Comput Biol. 2014 Mar;21(3):269-86. doi: 10.1089/cmb.2013.0099. Epub 2014 Feb 4.
3
CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure.
PLoS Comput Biol. 2018 Dec 31;14(12):e1006237. doi: 10.1371/journal.pcbi.1006237. eCollection 2018 Dec.
4
Inferring joint sequence-structural determinants of protein functional specificity.推断蛋白质功能特异性的关节序列结构决定因素。
Elife. 2018 Jan 16;7:e29880. doi: 10.7554/eLife.29880.
CLIPS-1D:分析多重序列比对,推断残基位置在催化、配体结合或蛋白质结构中的作用。
BMC Bioinformatics. 2012 Apr 5;13:55. doi: 10.1186/1471-2105-13-55.
4
Surveying the manifold divergence of an entire protein class for statistical clues to underlying biochemical mechanisms.全面审视整个蛋白质类别的多种差异,以获取有关潜在生化机制的统计线索。
Stat Appl Genet Mol Biol. 2011;10(1):Article 36. doi: 10.2202/1544-6115.1666. Epub 2011 Aug 4.
5
An invariant form for the prior probability in estimation problems.估计问题中先验概率的一种不变形式。
Proc R Soc Lond A Math Phys Sci. 1946;186(1007):453-61. doi: 10.1098/rspa.1946.0056.
6
INTREPID--INformation-theoretic TREe traversal for Protein functional site IDentification.INTREPID——用于蛋白质功能位点识别的信息论树遍历法
Bioinformatics. 2008 Nov 1;24(21):2445-52. doi: 10.1093/bioinformatics/btn474. Epub 2008 Sep 6.
7
Prediction of protein functional residues from sequence by probability density estimation.通过概率密度估计从序列预测蛋白质功能残基。
Bioinformatics. 2008 Mar 1;24(5):613-20. doi: 10.1093/bioinformatics/btm626. Epub 2008 Jan 2.
8
A family of evolution-entropy hybrid methods for ranking protein residues by importance.一种用于按重要性对蛋白质残基进行排序的进化-熵混合方法族。
J Mol Biol. 2004 Mar 5;336(5):1265-82. doi: 10.1016/j.jmb.2003.12.078.
9
Classification and evolution of P-loop GTPases and related ATPases.P 环鸟苷三磷酸酶及相关腺苷三磷酸酶的分类与进化
J Mol Biol. 2002 Mar 15;317(1):41-72. doi: 10.1006/jmbi.2001.5378.
10
Crystal structures of nucleotide exchange intermediates in the eEF1A-eEF1Balpha complex.eEF1A-eEF1Bα复合物中核苷酸交换中间体的晶体结构。
Nat Struct Biol. 2001 Jun;8(6):531-4. doi: 10.1038/88598.