• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

主爆破手:一种远程相关蛋白质敏感识别方法。

Master Blaster: an approach to sensitive identification of remotely related proteins.

机构信息

Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India.

Centre for Development of Advanced Computing, Knowledge Park, Byappanahalli, Bangalore, 560038, India.

出版信息

Sci Rep. 2021 Apr 22;11(1):8746. doi: 10.1038/s41598-021-87833-4.

DOI:10.1038/s41598-021-87833-4
PMID:33888741
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8062480/
Abstract

Genome sequencing projects unearth sequences of all the protein sequences encoded in a genome. As the first step, homology detection is employed to obtain clues to structure and function of these proteins. However, high evolutionary divergence between homologous proteins challenges our ability to detect distant relationships. In the past, an approach involving multiple Position Specific Scoring Matrices (PSSMs) was found to be more effective than traditional single PSSMs. Cascaded search is another successful approach where hits of a search are queried to detect more homologues. We propose a protocol, 'Master Blaster', which combines the principles adopted in these two approaches to enhance our ability to detect remote homologues even further. Assessment of the approach was performed using known relationships available in the SCOP70 database, and the results were compared against that of PSI-BLAST and HHblits, a hidden Markov model-based method. Compared to PSI-BLAST, Master Blaster resulted in 10% improvement with respect to detection of cross superfamily connections, nearly 35% improvement in cross family and more than 80% improvement in intra family connections. From the results it was observed that HHblits is more sensitive in detecting remote homologues compared to Master Blaster. However, there are true hits from 46-folds for which Master Blaster reported homologs that are not reported by HHblits even using the optimal parameters indicating that for detecting remote homologues, use of multiple methods employing a combination of different approaches can be more effective in detecting remote homologs. Master Blaster stand-alone code is available for download in the supplementary archive.

摘要

基因组测序项目揭示了基因组中所有蛋白质序列的序列。作为第一步,同源性检测用于获得这些蛋白质的结构和功能线索。然而,同源蛋白之间的高度进化分歧挑战了我们检测远缘关系的能力。过去,发现涉及多个位置特异性评分矩阵(PSSM)的方法比传统的单个 PSSM 更有效。级联搜索是另一种成功的方法,其中搜索的命中被查询以检测更多的同源物。我们提出了一种名为“Master Blaster”的协议,该协议结合了这两种方法中采用的原理,以进一步提高我们检测远程同源物的能力。使用 SCOP70 数据库中可用的已知关系对该方法进行了评估,并将结果与 PSI-BLAST 和 HHblits(基于隐马尔可夫模型的方法)进行了比较。与 PSI-BLAST 相比,Master Blaster 在检测跨超家族连接方面提高了 10%,在跨家族方面提高了近 35%,在家族内连接方面提高了 80%以上。从结果中可以看出,与 Master Blaster 相比,HHblits 在检测远程同源物方面更敏感。然而,有 46 倍的真实命中,Master Blaster 报告了 HHblits 没有报告的同源物,即使使用最优参数也表明,对于检测远程同源物,使用多种方法结合不同方法的组合可以更有效地检测远程同源物。Master Blaster 独立代码可在补充档案中下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/a0e9f6bc789d/41598_2021_87833_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/5121bb12f444/41598_2021_87833_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/6cf3c9286afd/41598_2021_87833_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/a189ec34b655/41598_2021_87833_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/0a050762902f/41598_2021_87833_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/858b01d68222/41598_2021_87833_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/f759ae91de9b/41598_2021_87833_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/a0e9f6bc789d/41598_2021_87833_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/5121bb12f444/41598_2021_87833_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/6cf3c9286afd/41598_2021_87833_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/a189ec34b655/41598_2021_87833_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/0a050762902f/41598_2021_87833_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/858b01d68222/41598_2021_87833_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/f759ae91de9b/41598_2021_87833_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944a/8062480/a0e9f6bc789d/41598_2021_87833_Fig7_HTML.jpg

相似文献

1
Master Blaster: an approach to sensitive identification of remotely related proteins.主爆破手:一种远程相关蛋白质敏感识别方法。
Sci Rep. 2021 Apr 22;11(1):8746. doi: 10.1038/s41598-021-87833-4.
2
Assessment of a rigorous transitive profile based search method to detect remotely similar proteins.一种用于检测远距离相似蛋白质的严格传递轮廓搜索方法的评估。
J Biomol Struct Dyn. 2005 Dec;23(3):283-98. doi: 10.1080/07391102.2005.10507066.
3
Use of multiple profiles corresponding to a sequence alignment enables effective detection of remote homologues.使用与序列比对相对应的多个图谱能够有效地检测远源同源物。
Bioinformatics. 2005 Jun 15;21(12):2821-6. doi: 10.1093/bioinformatics/bti432. Epub 2005 Apr 7.
4
PL-search: a profile-link-based search method for protein remote homology detection.PL-search:一种基于轮廓链接的蛋白质远程同源性检测搜索方法。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa051.
5
Rapid and enhanced remote homology detection by cascading hidden Markov model searches in sequence space.通过在序列空间中级联隐马尔可夫模型搜索实现快速且增强的远程同源检测。
Bioinformatics. 2016 Feb 1;32(3):338-44. doi: 10.1093/bioinformatics/btv538. Epub 2015 Oct 10.
6
MulPSSM: a database of multiple position-specific scoring matrices of protein domain families.MulPSSM:蛋白质结构域家族的多位置特异性评分矩阵数据库。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D243-6. doi: 10.1093/nar/gkj043.
7
Improved detection of remote homologues using cascade PSI-BLAST: influence of neighbouring protein families on sequence coverage.利用级联 PSI-BLAST 提高远程同源物的检测:邻近蛋白质家族对序列覆盖度的影响。
PLoS One. 2013;8(2):e56449. doi: 10.1371/journal.pone.0056449. Epub 2013 Feb 20.
8
Filling-in void and sparse regions in protein sequence space by protein-like artificial sequences enables remarkable enhancement in remote homology detection capability.通过类似蛋白质的人工序列填补蛋白质序列空间中的空白和稀疏区域,可以显著提高远程同源检测能力。
J Mol Biol. 2014 Feb 20;426(4):962-79. doi: 10.1016/j.jmb.2013.11.026. Epub 2013 Dec 4.
9
SMI-BLAST: a novel supervised search framework based on PSI-BLAST for protein remote homology detection.SMI-BLAST:一种基于 PSI-BLAST 的新型监督搜索框架,用于蛋白质远程同源检测。
Bioinformatics. 2021 May 17;37(7):913-920. doi: 10.1093/bioinformatics/btaa772.
10
ProClust: improved clustering of protein sequences with an extended graph-based approach.ProClust:基于扩展的图形方法改进蛋白质序列聚类
Bioinformatics. 2002;18 Suppl 2:S182-91. doi: 10.1093/bioinformatics/18.suppl_2.s182.

引用本文的文献

1
Comparative genomics of reveals genomic adaptations in bovine genital strains.[具体研究对象]的比较基因组学揭示了牛生殖系统菌株的基因组适应性。 (原文中“of”后缺少具体内容,这里补充为“[具体研究对象]”使译文更完整)
Front Microbiol. 2025 Jan 7;15:1517151. doi: 10.3389/fmicb.2024.1517151. eCollection 2024.
2
Quantifying attention in children with intellectual and developmental disabilities through multicenter electrooculogram signal analysis.通过多中心眼动电图信号分析量化智障和发育障碍儿童的注意力。
Sci Rep. 2024 Sep 27;14(1):22186. doi: 10.1038/s41598-024-70304-x.

本文引用的文献

1
Pfam: The protein families database in 2021.Pfam:2021 年的蛋白质家族数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419. doi: 10.1093/nar/gkaa913.
2
UniProt: a worldwide hub of protein knowledge.UniProt:蛋白质知识的全球枢纽。
Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515. doi: 10.1093/nar/gky1049.
3
New computational approaches to understanding molecular protein function.理解分子蛋白质功能的新计算方法。
PLoS Comput Biol. 2018 Apr 5;14(4):e1005756. doi: 10.1371/journal.pcbi.1005756. eCollection 2018 Apr.
4
HIPPI: highly accurate protein family classification with ensembles of HMMs.HIPPI:利用隐马尔可夫模型集合进行高精度蛋白质家族分类
BMC Genomics. 2016 Nov 11;17(Suppl 10):765. doi: 10.1186/s12864-016-3097-0.
5
A comprehensive review and comparison of different computational methods for protein remote homology detection.蛋白质远程同源检测不同计算方法的综合回顾与比较。
Brief Bioinform. 2018 Mar 1;19(2):231-244. doi: 10.1093/bib/bbw108.
6
Benchmarking the next generation of homology inference tools.对下一代同源性推断工具进行基准测试。
Bioinformatics. 2016 Sep 1;32(17):2636-41. doi: 10.1093/bioinformatics/btw305. Epub 2016 Jun 1.
7
Finding Protein and Nucleotide Similarities with FASTA.使用FASTA查找蛋白质和核苷酸的相似性。
Curr Protoc Bioinformatics. 2016 Mar 24;53:3.9.1-3.9.25. doi: 10.1002/0471250953.bi0309s53.
8
Protein sequence design and its applications.蛋白质序列设计及其应用。
Curr Opin Struct Biol. 2016 Apr;37:71-80. doi: 10.1016/j.sbi.2015.12.004. Epub 2016 Jan 8.
9
The TIM Barrel Architecture Facilitated the Early Evolution of Protein-Mediated Metabolism.TIM桶状结构促进了蛋白质介导的代谢的早期进化。
J Mol Evol. 2016 Jan;82(1):17-26. doi: 10.1007/s00239-015-9722-8. Epub 2016 Jan 5.
10
Evolutionary relationship of two ancient protein superfolds.两种古老蛋白质超折叠的进化关系。
Nat Chem Biol. 2014 Sep;10(9):710-5. doi: 10.1038/nchembio.1579. Epub 2014 Jul 13.