• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从蛋白质序列中快速生成突变数据矩阵。

The rapid generation of mutation data matrices from protein sequences.

作者信息

Jones D T, Taylor W R, Thornton J M

机构信息

Department of Biochemistry and Molecular Biology, University College, London, UK.

出版信息

Comput Appl Biosci. 1992 Jun;8(3):275-82. doi: 10.1093/bioinformatics/8.3.275.

DOI:10.1093/bioinformatics/8.3.275
PMID:1633570
Abstract

An efficient means for generating mutation data matrices from large numbers of protein sequences is presented here. By means of an approximate peptide-based sequence comparison algorithm, the set sequences are clustered at the 85% identity level. The closest relating pairs of sequences are aligned, and observed amino acid exchanges tallied in a matrix. The raw mutation frequency matrix is processed in a similar way to that described by Dayhoff et al. (1978), and so the resulting matrices may be easily used in current sequence analysis applications, in place of the standard mutation data matrices, which have not been updated for 13 years. The method is fast enough to process the entire SWISS-PROT databank in 20 h on a Sun SPARCstation 1, and is fast enough to generate a matrix from a specific family or class of proteins in minutes. Differences observed between our 250 PAM mutation data matrix and the matrix calculated by Dayhoff et al. are briefly discussed.

摘要

本文介绍了一种从大量蛋白质序列生成突变数据矩阵的有效方法。借助基于近似肽的序列比较算法,将序列集在85%同一性水平上进行聚类。对最相近的序列对进行比对,并在矩阵中统计观察到的氨基酸交换情况。原始突变频率矩阵的处理方式与Dayhoff等人(1978年)描述的类似,因此所得矩阵可轻松用于当前的序列分析应用中,以替代已13年未更新的标准突变数据矩阵。该方法速度足够快,在Sun SPARCstation 1上20小时内可处理整个SWISS-PROT数据库,并且在几分钟内就能从特定的蛋白质家族或类别中生成矩阵。我们简要讨论了250 PAM突变数据矩阵与Dayhoff等人计算的矩阵之间观察到的差异。

相似文献

1
The rapid generation of mutation data matrices from protein sequences.从蛋白质序列中快速生成突变数据矩阵。
Comput Appl Biosci. 1992 Jun;8(3):275-82. doi: 10.1093/bioinformatics/8.3.275.
2
PR2ALIGN: a stand-alone software program and a web-server for protein sequence alignment using weighted biochemical properties of amino acids.PR2ALIGN:一个用于利用氨基酸加权生化特性进行蛋白质序列比对的独立软件程序和网络服务器。
BMC Res Notes. 2015 May 7;8:187. doi: 10.1186/s13104-015-1152-6.
3
A bank of protein family patterns for rapid identification of possible functions of amino acid sequences.一组用于快速识别氨基酸序列可能功能的蛋白质家族模式。
Comput Appl Biosci. 1997 Apr;13(2):115-22. doi: 10.1093/bioinformatics/13.2.115.
4
A new approach for displaying identities and differences among aligned amino acid sequences.一种展示比对后的氨基酸序列之间的相同性和差异的新方法。
Comput Appl Biosci. 1992 Jun;8(3):261-5. doi: 10.1093/bioinformatics/8.3.261.
5
A set-theoretic approach to database searching and clustering.一种用于数据库搜索和聚类的集合论方法。
Bioinformatics. 1998 Jun;14(5):430-8. doi: 10.1093/bioinformatics/14.5.430.
6
Amino acid substitution during functionally constrained divergent evolution of protein sequences.蛋白质序列功能受限的趋异进化过程中的氨基酸替换。
Protein Eng. 1994 Nov;7(11):1323-32. doi: 10.1093/protein/7.11.1323.
7
Amino acid substitution matrices from protein blocks.来自蛋白质模块的氨基酸替换矩阵。
Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915-9. doi: 10.1073/pnas.89.22.10915.
8
The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions.用于比较具有非标准组成的蛋白质的氨基酸替换矩阵的构建。
Bioinformatics. 2005 Apr 1;21(7):902-11. doi: 10.1093/bioinformatics/bti070. Epub 2004 Oct 27.
9
Alignment-free comparison of protein sequences based on reduced amino acid alphabets.基于简化氨基酸字母表的蛋白质序列无比对比较。
J Biomol Struct Dyn. 2009 Jun;26(6):763-9. doi: 10.1080/07391102.2009.10507288.
10
Protein database searches using compositionally adjusted substitution matrices.使用成分调整替代矩阵进行蛋白质数据库搜索。
FEBS J. 2005 Oct;272(20):5101-9. doi: 10.1111/j.1742-4658.2005.04945.x.

引用本文的文献

1
Progestogens and androgens influence root morphology of angiosperms in a brassinosteroid-independent manner.孕激素和雄激素以一种不依赖油菜素内酯的方式影响被子植物的根系形态。
Plant J. 2025 Sep;123(5):e70459. doi: 10.1111/tpj.70459.
2
Woody Host-Specific Type III Effector HopBL2 Is Essential for Pseudomonas savastanoi Virulence and Associates With Plasmodesmata.木质宿主特异性III型效应蛋白HopBL2对丁香假单胞菌的致病性至关重要,并与胞间连丝相关。
Mol Plant Pathol. 2025 Sep;26(9):e70142. doi: 10.1111/mpp.70142.
3
Golgi_traff phylogeny reveals ancient eukaryotic genes with recent surprises: replication and diversification of HID1 domain-containing protein unique to Schizosaccharomyces.
高尔基体运输系统的系统发育揭示了古老的真核基因,其中有近期令人惊讶的发现:裂殖酵母特有的含HID1结构域蛋白的复制与多样化。
FEMS Microbiol Lett. 2025 Jan 10;372. doi: 10.1093/femsle/fnaf088.
4
Freshwater-adapted polychaetes exhibit a complete enzymatic machinery for synthesizing long-chain polyunsaturated fatty acids.适应淡水环境的多毛纲动物展现出一套完整的用于合成长链多不饱和脂肪酸的酶机制。
Open Biol. 2025 Sep;15(9):250159. doi: 10.1098/rsob.250159. Epub 2025 Sep 3.
5
Chelation-based iron uptake mitigates the effects of prolonged high-temperature stress in cool-season grasses.基于螯合作用的铁吸收减轻了冷季型草坪草长期高温胁迫的影响。
Nat Commun. 2025 Aug 28;16(1):7709. doi: 10.1038/s41467-025-63005-0.
6
Phylogenetic Reclassification of and Species Complex.某物种复合体的系统发育重新分类
Pathogens. 2025 Jul 29;14(8):745. doi: 10.3390/pathogens14080745.
7
Structural disorder and distinctive motifs in the C-terminal region of the MADS-domain transcription factors are conserved across diverse taxa.MADS结构域转录因子C末端区域的结构紊乱和独特基序在不同分类群中是保守的。
PLoS One. 2025 Aug 22;20(8):e0330098. doi: 10.1371/journal.pone.0330098. eCollection 2025.
8
Characterization and transcriptomic analysis provide novel insights into the sexual differentiation of the dioecious plant Bama Huoma (Cannabis sativa).特征鉴定和转录组分析为雌雄异株植物巴马火麻(大麻)的性别分化提供了新见解。
BMC Plant Biol. 2025 Aug 21;25(1):1108. doi: 10.1186/s12870-025-07096-3.
9
The path-label reconciliation (PLR) dissimilarity measure for gene trees.用于基因树的路径标签协调(PLR)差异度量。
Algorithms Mol Biol. 2025 Aug 19;20(1):16. doi: 10.1186/s13015-025-00284-8.
10
Characterization of the Avian Mitochondrial-Derived Peptide MOTS-c and Its Potential Role as a Metabolic Regulator.禽源线粒体衍生肽MOTS-c的特性及其作为代谢调节剂的潜在作用。
Animals (Basel). 2025 Jul 29;15(15):2230. doi: 10.3390/ani15152230.