• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HHblits:通过 HMM-HMM 比对进行快速迭代的蛋白质序列搜索。

HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment.

机构信息

Gene Center and Center for Integrated Protein Science Munich, Ludwig-Maximilians Universität München, Munich, Germany.

出版信息

Nat Methods. 2011 Dec 25;9(2):173-5. doi: 10.1038/nmeth.1818.

DOI:10.1038/nmeth.1818
PMID:22198341
Abstract

Sequence-based protein function and structure prediction depends crucially on sequence-search sensitivity and accuracy of the resulting sequence alignments. We present an open-source, general-purpose tool that represents both query and database sequences by profile hidden Markov models (HMMs): 'HMM-HMM-based lightning-fast iterative sequence search' (HHblits; http://toolkit.genzentrum.lmu.de/hhblits/). Compared to the sequence-search tool PSI-BLAST, HHblits is faster owing to its discretized-profile prefilter, has 50-100% higher sensitivity and generates more accurate alignments.

摘要

基于序列的蛋白质功能和结构预测,关键取决于序列搜索的灵敏度和所得序列比对的准确性。我们提供了一个开源的通用工具,该工具通过轮廓隐马尔可夫模型(HMM)表示查询和数据库序列:“基于 HMM-HMM 的闪电般快速迭代序列搜索”(HHblits;http://toolkit.genzentrum.lmu.de/hhblits/)。与序列搜索工具 PSI-BLAST 相比,HHblits 由于其离散轮廓预过滤器而更快,其灵敏度提高了 50-100%,并且生成了更准确的比对。

相似文献

1
HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment.HHblits:通过 HMM-HMM 比对进行快速迭代的蛋白质序列搜索。
Nat Methods. 2011 Dec 25;9(2):173-5. doi: 10.1038/nmeth.1818.
2
HH-suite3 for fast remote homology detection and deep protein annotation.HH-suite3 用于快速远程同源检测和深度蛋白质注释。
BMC Bioinformatics. 2019 Sep 14;20(1):473. doi: 10.1186/s12859-019-3019-7.
3
Protein homology detection by HMM-HMM comparison.通过隐马尔可夫模型(HMM)比较进行蛋白质同源性检测。
Bioinformatics. 2005 Apr 1;21(7):951-60. doi: 10.1093/bioinformatics/bti125. Epub 2004 Nov 5.
4
Hidden Markov model speed heuristic and iterative HMM search procedure.隐马尔可夫模型速度启发式和迭代隐马尔可夫模型搜索过程。
BMC Bioinformatics. 2010 Aug 18;11:431. doi: 10.1186/1471-2105-11-431.
5
Alignment of multiple proteins with an ensemble of hidden Markov models.使用隐马尔可夫模型集合对多个蛋白质进行比对。
Int J Data Min Bioinform. 2010;4(1):60-71. doi: 10.1504/ijdmb.2010.030967.
6
A comparison of profile hidden Markov model procedures for remote homology detection.用于远程同源性检测的轮廓隐马尔可夫模型程序比较。
Nucleic Acids Res. 2002 Oct 1;30(19):4321-8. doi: 10.1093/nar/gkf544.
7
AlignHUSH: alignment of HMMs using structure and hydrophobicity information.AlignHUSH:使用结构和疏水性信息对齐隐马尔可夫模型。
BMC Bioinformatics. 2011 Jul 5;12:275. doi: 10.1186/1471-2105-12-275.
8
HMM-ModE--improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences.HMM-ModE——通过优化判别阈值并利用负训练序列修改发射概率,使用轮廓隐马尔可夫模型改进分类。
BMC Bioinformatics. 2007 Mar 27;8:104. doi: 10.1186/1471-2105-8-104.
9
COACH: profile-profile alignment of protein families using hidden Markov models.COACH:使用隐马尔可夫模型对蛋白质家族进行轮廓-轮廓比对。
Bioinformatics. 2004 May 22;20(8):1309-18. doi: 10.1093/bioinformatics/bth091. Epub 2004 Feb 12.
10
Protein fold recognition using HMM-HMM alignment and dynamic programming.使用隐马尔可夫模型-隐马尔可夫模型比对和动态规划进行蛋白质折叠识别。
J Theor Biol. 2016 Mar 21;393:67-74. doi: 10.1016/j.jtbi.2015.12.018. Epub 2016 Jan 19.

引用本文的文献

1
The ABC type fucose operon regulated by XtrSs through CcpA contributes to survival in macrophages.由XtrSs通过CcpA调控的ABC型岩藻糖操纵子有助于在巨噬细胞中存活。
Virulence. 2025 Dec;16(1):2553790. doi: 10.1080/21505594.2025.2553790. Epub 2025 Sep 4.
2
Accelerating Biomolecular Modeling with AtomWorks and RF3.利用AtomWorks和RF3加速生物分子建模
bioRxiv. 2025 Aug 15:2025.08.14.670328. doi: 10.1101/2025.08.14.670328.
3
Deciphering the proteome of K-12: Integrating transcriptomics and machine learning to annotate hypothetical proteins.

本文引用的文献

1
Protein 3D structure computed from evolutionary sequence variation.基于进化序列变异计算的蛋白质 3D 结构。
PLoS One. 2011;6(12):e28766. doi: 10.1371/journal.pone.0028766. Epub 2011 Dec 7.
2
Learning sparse models for a dynamic Bayesian network classifier of protein secondary structure.学习稀疏模型,用于蛋白质二级结构的动态贝叶斯网络分类器。
BMC Bioinformatics. 2011 May 13;12:154. doi: 10.1186/1471-2105-12-154.
3
Protein sequence comparison and fold recognition: progress and good-practice benchmarking.蛋白质序列比较和折叠识别:进展和良好实践基准测试。
解析K-12的蛋白质组:整合转录组学与机器学习以注释假设蛋白质。
Comput Struct Biotechnol J. 2025 Jul 24;27:3565-3578. doi: 10.1016/j.csbj.2025.07.036. eCollection 2025.
4
The grand biological universe: A comprehensive geometric construction of genome space.宏大的生物宇宙:基因组空间的全面几何构建
Innovation (Camb). 2025 Apr 30;6(8):100937. doi: 10.1016/j.xinn.2025.100937. eCollection 2025 Aug 4.
5
FungAMR: a comprehensive database for investigating fungal mutations associated with antimicrobial resistance.真菌抗菌药物耐药性:一个用于研究与抗菌药物耐药性相关真菌突变的综合数据库。
Nat Microbiol. 2025 Aug 11. doi: 10.1038/s41564-025-02084-7.
6
Protein Language Model Identifies Disordered, Conserved Motifs Implicated in Phase Separation.蛋白质语言模型识别出与相分离相关的无序保守基序。
bioRxiv. 2025 Jul 23:2024.12.12.628175. doi: 10.1101/2024.12.12.628175.
7
Modeling protein conformational ensembles by guiding AlphaFold2 with Double Electron Electron Resonance (DEER) distance distributions.通过双电子电子共振(DEER)距离分布引导AlphaFold2对蛋白质构象集合进行建模。
Nat Commun. 2025 Aug 2;16(1):7107. doi: 10.1038/s41467-025-62582-4.
8
A Novel Microencapsulated Bovine Recombinant Interferon Tau Formulation for Luteolysis Modulation in Cattle.一种用于调节牛黄体溶解的新型微囊化牛重组干扰素τ制剂
Biomolecules. 2025 Jul 14;15(7):1009. doi: 10.3390/biom15071009.
9
ASCE-PPIS: a protein-protein interaction sites predictor based on equivariant graph neural network with fusion of structure-aware pooling and graph collapse.ASCE-PPIS:一种基于等变图神经网络的蛋白质-蛋白质相互作用位点预测器,融合了结构感知池化和图折叠。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf423.
10
Boosting AlphaFold Protein Tertiary Structure Prediction through MSA Engineering and Extensive Model Sampling and Ranking in CASP16.通过在第16届蛋白质结构预测关键评估(CASP16)中进行多序列比对(MSA)工程以及广泛的模型采样和排序来提升AlphaFold蛋白质三级结构预测
bioRxiv. 2025 Jun 9:2025.06.06.658338. doi: 10.1101/2025.06.06.658338.
Curr Opin Struct Biol. 2011 Jun;21(3):404-11. doi: 10.1016/j.sbi.2011.03.005. Epub 2011 Mar 31.
4
A new generation of homology search tools based on probabilistic inference.基于概率推理的新一代同源性搜索工具。
Genome Inform. 2009 Oct;23(1):205-11.
5
Homologous over-extension: a challenge for iterative similarity searches.同源超长延伸:迭代相似性搜索的挑战。
Nucleic Acids Res. 2010 Apr;38(7):2177-89. doi: 10.1093/nar/gkp1219. Epub 2010 Jan 11.
6
The Pfam protein families database.Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22. doi: 10.1093/nar/gkp985. Epub 2009 Nov 17.
7
PDBselect 1992-2009 and PDBfilter-select.PDBselect 1992-2009 和 PDBfilter-select。
Nucleic Acids Res. 2010 Jan;38(Database issue):D318-9. doi: 10.1093/nar/gkp786. Epub 2009 Sep 25.
8
Sequence context-specific profiles for homology searching.用于同源性搜索的序列上下文特定概况。
Proc Natl Acad Sci U S A. 2009 Mar 10;106(10):3770-5. doi: 10.1073/pnas.0810767106. Epub 2009 Feb 20.
9
De novo identification of highly diverged protein repeats by probabilistic consistency.通过概率一致性对高度分化的蛋白质重复序列进行从头鉴定。
Bioinformatics. 2008 Mar 15;24(6):807-14. doi: 10.1093/bioinformatics/btn039. Epub 2008 Feb 1.
10
Data growth and its impact on the SCOP database: new developments.数据增长及其对SCOP数据库的影响:新进展
Nucleic Acids Res. 2008 Jan;36(Database issue):D419-25. doi: 10.1093/nar/gkm993. Epub 2007 Nov 13.