• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HMMerThread:通过将宽松的序列数据库搜索与折叠识别相结合,在整个基因组中检测远程、功能保守的结构域。

HMMerThread: detecting remote, functional conserved domains in entire genomes by combining relaxed sequence-database searches with fold recognition.

机构信息

Bioinformatics Laboratory, Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Saxony, Germany.

出版信息

PLoS One. 2011 Mar 10;6(3):e17568. doi: 10.1371/journal.pone.0017568.

DOI:10.1371/journal.pone.0017568
PMID:21423752
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3053371/
Abstract

Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak sequence similarity. Our predictions open up new avenues for biological and medical studies. Genome-wide HMMerThread domains are available at http://vm1-hmmerthread.age.mpg.de.

摘要

蛋白质中的保守结构域是实验设计和全基因组注释的主要功能信息来源之一。尽管像隐马尔可夫模型(HMM)这样的保守结构域数据库搜索工具在蛋白质具有足够的序列相似性时,能够敏感地检测到保守结构域,但它们往往会错过更多具有差异的家族成员,因为它们缺乏可靠的统计框架来检测低序列相似性。我们开发了一种大大改进的 HMMerThread 算法,该算法可以在高度差异的序列中检测到远程保守结构域。HMMerThread 将宽松的保守结构域搜索与折叠识别相结合,以消除假阳性的基于序列的鉴定。我们的软件准确性达到 90%,能够自动预测具有相关三维结构的保守结构域家族的高度差异成员。通过跨物种验证,我们为预测提供了额外的置信度。我们在包括人类在内的八个蛋白质组上运行了 HMMerThread 搜索,并提供了丰富的远程保守结构域资源,这极大地增加了整个蛋白质组的功能注释。我们仅在人类蛋白质组中就发现了约 4500 个跨物种验证的、远程保守结构域预测。例如,我们在 A-激酶锚定蛋白 10(AKAP10)的 C 端部分发现了一个 DNA 结合结构域,AKAP10 是一种 PKA 接头,与心脏心律失常和心脏性猝死有关,在应激下可能从线粒体易位到核/核仁。基于我们的预测,我们提出 AKAP10 通过这个 HLH 结构域参与应激反应的转录控制。我们还讨论了其他一些远程保守结构域,例如孢子形成、染色体分离和免疫反应期间的信号转导。HMMerThread 算法能够基于弱序列相似性自动检测蛋白质中远程保守结构域的存在。我们的预测为生物学和医学研究开辟了新的途径。全基因组 HMMerThread 结构域可在 http://vm1-hmmerthread.age.mpg.de 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/2b18901c1d38/pone.0017568.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/03f0e12d9237/pone.0017568.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/f1c56b595020/pone.0017568.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/df81ac6c4c10/pone.0017568.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/703712e4fac0/pone.0017568.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/567a65cf3b36/pone.0017568.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/2b18901c1d38/pone.0017568.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/03f0e12d9237/pone.0017568.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/f1c56b595020/pone.0017568.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/df81ac6c4c10/pone.0017568.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/703712e4fac0/pone.0017568.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/567a65cf3b36/pone.0017568.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/215c/3053371/2b18901c1d38/pone.0017568.g006.jpg

相似文献

1
HMMerThread: detecting remote, functional conserved domains in entire genomes by combining relaxed sequence-database searches with fold recognition.HMMerThread:通过将宽松的序列数据库搜索与折叠识别相结合,在整个基因组中检测远程、功能保守的结构域。
PLoS One. 2011 Mar 10;6(3):e17568. doi: 10.1371/journal.pone.0017568.
2
ProFAT: a web-based tool for the functional annotation of protein sequences.ProFAT:一个用于蛋白质序列功能注释的基于网络的工具。
BMC Bioinformatics. 2006 Oct 23;7:466. doi: 10.1186/1471-2105-7-466.
3
Improving protein structure similarity searches using domain boundaries based on conserved sequence information.基于保守序列信息利用结构域边界改进蛋白质结构相似性搜索。
BMC Struct Biol. 2009 May 19;9:33. doi: 10.1186/1472-6807-9-33.
4
Fitting hidden Markov models of protein domains to a target species: application to Plasmodium falciparum.将蛋白质结构域的隐马尔可夫模型拟合到目标物种上:在疟原虫中的应用。
BMC Bioinformatics. 2012 May 1;13:67. doi: 10.1186/1471-2105-13-67.
5
eBLOCKs: enumerating conserved protein blocks to achieve maximal sensitivity and specificity.eBLOCKs:枚举保守蛋白质模块以实现最大灵敏度和特异性。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D178-82. doi: 10.1093/nar/gki060.
6
Mechanical stability and differentially conserved physical-chemical properties of titin Ig-domains.肌联蛋白免疫球蛋白结构域的机械稳定性和差异保守的物理化学性质。
Proteins. 2009 May 15;75(3):706-18. doi: 10.1002/prot.22281.
7
The identification of complete domains within protein sequences using accurate E-values for semi-global alignment.使用精确的E值进行半全局比对,以识别蛋白质序列中的完整结构域。
Nucleic Acids Res. 2007;35(14):4678-85. doi: 10.1093/nar/gkm414. Epub 2007 Jun 27.
8
morFeus: a web-based program to detect remotely conserved orthologs using symmetrical best hits and orthology network scoring.MorFeus:一个基于网络的程序,使用对称最佳匹配和同源网络评分来检测远程保守的直系同源物。
BMC Bioinformatics. 2014 Aug 6;15(1):263. doi: 10.1186/1471-2105-15-263.
9
Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.用于蛋白质同源物的迭代序列/二级结构搜索:与氨基酸序列比对的比较及在基因组数据库中折叠识别的应用
Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988.
10
TMB-Hunt: an amino acid composition based method to screen proteomes for beta-barrel transmembrane proteins.TMB-Hunt:一种基于氨基酸组成的方法,用于在蛋白质组中筛选β-桶状跨膜蛋白。
BMC Bioinformatics. 2005 Mar 15;6:56. doi: 10.1186/1471-2105-6-56.

引用本文的文献

1
Diversity and prevalence of ANTAR RNAs across actinobacteria.放线菌中 ANTAR RNA 的多样性和普遍性。
BMC Microbiol. 2021 May 29;21(1):159. doi: 10.1186/s12866-021-02234-x.

本文引用的文献

1
The Pfam protein families database.Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22. doi: 10.1093/nar/gkp985. Epub 2009 Nov 17.
2
D-AKAP2 interacts with Rab4 and Rab11 through its RGS domains and regulates transferrin receptor recycling.D-AKAP2通过其RGS结构域与Rab4和Rab11相互作用,并调节转铁蛋白受体的循环利用。
J Biol Chem. 2009 Nov 20;284(47):32869-80. doi: 10.1074/jbc.M109.022582. Epub 2009 Sep 21.
3
Human Protein Reference Database and Human Proteinpedia as discovery tools for systems biology.
人类蛋白质参考数据库和人类蛋白质百科全书作为系统生物学的发现工具。
Methods Mol Biol. 2009;577:67-79. doi: 10.1007/978-1-60761-232-2_6.
4
Fast and accurate automatic structure prediction with HHpred.HHpred:快速准确的自动结构预测。
Proteins. 2009;77 Suppl 9:128-32. doi: 10.1002/prot.22499.
5
Beyond the Twilight Zone: automated prediction of structural properties of proteins by recursive neural networks and remote homology information.超越模糊地带:利用递归神经网络和远程同源信息自动预测蛋白质的结构特性
Proteins. 2009 Oct;77(1):181-90. doi: 10.1002/prot.22429.
6
Cohesin acetylation: from antiestablishment to establishment.黏连蛋白乙酰化:从反确立到确立
Mol Cell. 2009 Apr 10;34(1):1-2. doi: 10.1016/j.molcel.2009.03.011.
7
Building sister chromatid cohesion: smc3 acetylation counteracts an antiestablishment activity.建立姐妹染色单体黏连:SMC3乙酰化抵消一种抗建立活性。
Mol Cell. 2009 Mar 27;33(6):763-74. doi: 10.1016/j.molcel.2009.02.028.
8
A functional genomic screen identifies cellular cofactors of hepatitis C virus replication.一项功能基因组筛选鉴定出丙型肝炎病毒复制的细胞辅助因子。
Cell Host Microbe. 2009 Mar 19;5(3):298-307. doi: 10.1016/j.chom.2009.02.001.
9
Budding yeast Wpl1(Rad61)-Pds5 complex counteracts sister chromatid cohesion-establishing reaction.出芽酵母Wpl1(Rad61)-Pds5复合物对抗姐妹染色单体凝聚建立反应。
Curr Biol. 2009 Mar 24;19(6):492-7. doi: 10.1016/j.cub.2009.01.062. Epub 2009 Mar 5.
10
Protein structure prediction on the Web: a case study using the Phyre server.网络上的蛋白质结构预测:使用Phyre服务器的案例研究
Nat Protoc. 2009;4(3):363-71. doi: 10.1038/nprot.2009.2.