• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

搜索氨基酸序列中的高度变异串联重复。

Search for Highly Divergent Tandem Repeats in Amino Acid Sequences.

机构信息

Center of Bioengineering Research Center of Biotechnology RAS, 119071 Moscow, Russia.

Moscow Engineering Physics Institute, National Research Nuclear University MEPhI, 115409 Moscow, Russia.

出版信息

Int J Mol Sci. 2021 Jul 1;22(13):7096. doi: 10.3390/ijms22137096.

DOI:10.3390/ijms22137096
PMID:34281150
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8269118/
Abstract

We report a Method to Search for Highly Divergent Tandem Repeats (MSHDTR) in protein sequences which considers pairwise correlations between adjacent residues. MSHDTR was compared with some previously developed methods for searching for tandem repeats (TRs) in amino acid sequences, such as T-REKS and XSTREAM, which focus on the identification of TRs with significant sequence similarity, whereas MSHDTR detects repeats that significantly diverged during evolution, accumulating deletions, insertions, and substitutions. The application of MSHDTR to a search of the Swiss-Prot databank revealed over 15 thousand TR-containing amino acid sequences that were difficult to find using the other methods. Among the detected TRs, the most representative were those with consensus lengths of two and seven residues; these TRs were subjected to cluster analysis and the classes of patterns were identified. All TRs detected in this study have been combined into a databank accessible over the WWW.

摘要

我们报告了一种在蛋白质序列中搜索高度变异串联重复(MSHDTR)的方法,该方法考虑了相邻残基之间的成对相关性。MSHDTR 与一些以前开发的用于搜索氨基酸序列中的串联重复(TR)的方法进行了比较,例如 T-REKS 和 XSTREAM,这些方法侧重于识别具有显著序列相似性的 TR,而 MSHDTR 则检测在进化过程中显著变异、积累缺失、插入和替换的重复。将 MSHDTR 应用于对 Swiss-Prot 数据库的搜索揭示了超过 15000 个含有 TR 的氨基酸序列,这些序列使用其他方法很难找到。在检测到的 TR 中,最具代表性的是那些具有两个和七个残基共识长度的 TR;这些 TR 进行了聚类分析,并确定了模式类别。本研究中检测到的所有 TR 已组合成一个可通过万维网访问的数据库。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/2a1bc11d43b3/ijms-22-07096-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/ffc7a9c65ba7/ijms-22-07096-g001a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/5c653124ea78/ijms-22-07096-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/3b0d513cda97/ijms-22-07096-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/6665c77545f7/ijms-22-07096-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/2a1bc11d43b3/ijms-22-07096-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/ffc7a9c65ba7/ijms-22-07096-g001a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/5c653124ea78/ijms-22-07096-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/3b0d513cda97/ijms-22-07096-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/6665c77545f7/ijms-22-07096-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9c0/8269118/2a1bc11d43b3/ijms-22-07096-g005.jpg

相似文献

1
Search for Highly Divergent Tandem Repeats in Amino Acid Sequences.搜索氨基酸序列中的高度变异串联重复。
Int J Mol Sci. 2021 Jul 1;22(13):7096. doi: 10.3390/ijms22137096.
2
XSTREAM: a practical algorithm for identification and architecture modeling of tandem repeats in protein sequences.XSTREAM:一种用于蛋白质序列中串联重复序列识别与结构建模的实用算法。
BMC Bioinformatics. 2007 Oct 11;8:382. doi: 10.1186/1471-2105-8-382.
3
Detection of Highly Divergent Tandem Repeats in the Rice Genome.检测水稻基因组中的高度变异串联重复序列。
Genes (Basel). 2021 Mar 25;12(4):473. doi: 10.3390/genes12040473.
4
T-REKS: identification of Tandem REpeats in sequences with a K-meanS based algorithm.T-REKS:基于 K-均值算法的序列中串联重复序列的识别。
Bioinformatics. 2009 Oct 15;25(20):2632-8. doi: 10.1093/bioinformatics/btp482. Epub 2009 Aug 11.
5
Detection of tandem repeats in the Capsicum annuum genome.辣椒基因组中串联重复序列的检测
DNA Res. 2023 Apr 25;30(3). doi: 10.1093/dnares/dsad007.
6
Beyond tandem repeats: complex pattern structures and distant regions of similarity.超越串联重复序列:复杂模式结构与远距离相似区域。
Bioinformatics. 2002;18 Suppl 1:S31-7. doi: 10.1093/bioinformatics/18.suppl_1.s31.
7
The evolution and function of protein tandem repeats in plants.植物中蛋白质串联重复序列的进化与功能
New Phytol. 2015 Apr;206(1):397-410. doi: 10.1111/nph.13184. Epub 2014 Nov 24.
8
Tally: a scoring tool for boundary determination between repetitive and non-repetitive protein sequences.Tally:一种用于确定重复和非重复蛋白质序列之间界限的评分工具。
Bioinformatics. 2016 Jul 1;32(13):1952-8. doi: 10.1093/bioinformatics/btw118. Epub 2016 Mar 7.
9
REP2: A Web Server to Detect Common Tandem Repeats in Protein Sequences.REP2:一个用于检测蛋白质序列中常见串联重复的网络服务器。
J Mol Biol. 2021 May 28;433(11):166895. doi: 10.1016/j.jmb.2021.166895. Epub 2021 Feb 24.
10
Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species.不同哺乳动物物种多巴胺受体D4(DRD4)基因外显子III中串联重复序列的鉴定与表征。
DNA Cell Biol. 2005 Dec;24(12):795-804. doi: 10.1089/dna.2005.24.795.

引用本文的文献

1
Detection of tandem repeats in the Capsicum annuum genome.辣椒基因组中串联重复序列的检测
DNA Res. 2023 Apr 25;30(3). doi: 10.1093/dnares/dsad007.
2
Application of the MAHDS Method for Multiple Alignment of Highly Diverged Amino Acid Sequences.MAHDS方法在高度分化氨基酸序列多重比对中的应用。
Int J Mol Sci. 2022 Mar 29;23(7):3764. doi: 10.3390/ijms23073764.

本文引用的文献

1
Multiple Alignment of Promoter Sequences from the L. Genome.从 L. 基因组中启动子序列的多重比对。
Genes (Basel). 2021 Jan 21;12(2):135. doi: 10.3390/genes12020135.
2
Identification and Analysis of Long Repeats of Proteins at the Domain Level.在结构域水平上对蛋白质长重复序列的鉴定与分析。
Front Bioeng Biotechnol. 2019 Oct 8;7:250. doi: 10.3389/fbioe.2019.00250. eCollection 2019.
3
Repeatability in protein sequences.蛋白质序列的可重复性。
J Struct Biol. 2019 Nov 1;208(2):86-91. doi: 10.1016/j.jsb.2019.08.003. Epub 2019 Aug 10.
4
circMYBL2, a circRNA from MYBL2, regulates FLT3 translation by recruiting PTBP1 to promote FLT3-ITD AML progression.circMYBL2,一种来自 MYBL2 的 circRNA,通过招募 PTBP1 来调节 FLT3 翻译,从而促进 FLT3-ITD AML 的进展。
Blood. 2019 Oct 31;134(18):1533-1546. doi: 10.1182/blood.2019000802.
5
A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core.一个完全重新实现的 MPI 生物信息学工具包,其核心是一个新的 HHpred 服务器。
J Mol Biol. 2018 Jul 20;430(15):2237-2243. doi: 10.1016/j.jmb.2017.12.007. Epub 2017 Dec 16.
6
Disorder in the lifetime of a protein.蛋白质生命周期中的紊乱。
Intrinsically Disord Proteins. 2013 Nov 7;1(1):e26782. doi: 10.4161/idp.26782. eCollection 2013 Jan-Dec.
7
Search of latent periodicity in amino acid sequences by means of genetic algorithm and dynamic programming.利用遗传算法和动态规划搜索氨基酸序列中的潜在周期性。
Stat Appl Genet Mol Biol. 2016 Oct 1;15(5):381-400. doi: 10.1515/sagmb-2015-0079.
8
Identifying tandem Ankyrin repeats in protein structures.在蛋白质结构中识别串联锚蛋白重复序列。
BMC Bioinformatics. 2014 Dec 30;15(1):6599. doi: 10.1186/s12859-014-0440-9.
9
Proteins without unique 3D structures: biotechnological applications of intrinsically unstable/disordered proteins.没有独特三维结构的蛋白质:内在不稳定/无序蛋白质的生物技术应用
Biotechnol J. 2015 Mar;10(3):356-66. doi: 10.1002/biot.201400374. Epub 2014 Oct 6.
10
Topography prediction of helical transmembrane proteins by a new modification of the sliding window method.利用滑动窗口法的新改进对螺旋跨膜蛋白进行地形预测。
Biomed Res Int. 2014;2014:921218. doi: 10.1155/2014/921218. Epub 2014 May 11.