• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

远程同源性检测方法可以结合起来,使“午夜区”的覆盖率提高10%。

Methods of remote homology detection can be combined to increase coverage by 10% in the midnight zone.

作者信息

Reid Adam James, Yeats Corin, Orengo Christine Anne

机构信息

Department of Biochemistry and Molecular Biology, University College London, Gower Street, London WC1E 6BT, UK.

出版信息

Bioinformatics. 2007 Sep 15;23(18):2353-60. doi: 10.1093/bioinformatics/btm355. Epub 2007 Aug 20.

DOI:10.1093/bioinformatics/btm355
PMID:17709341
Abstract

MOTIVATION

A recent development in sequence-based remote homologue detection is the introduction of profile-profile comparison methods. These are more powerful than previous technologies and can detect potentially homologous relationships missed by structural classifications such as CATH and SCOP. As structural classifications traditionally act as the gold standard of homology this poses a challenge in benchmarking them.

RESULTS

We present a novel approach which allows an accurate benchmark of these methods against the CATH structural classification. We then apply this approach to assess the accuracy of a range of publicly available methods for remote homology detection including several profile-profile methods (COMPASS, HHSearch, PRC) from two perspectives. First, in distinguishing homologous domains from non-homologues and second, in annotating proteomes with structural domain families. PRC is shown to be the best method for distinguishing homologues. We show that SAM is the best practical method for annotating genomes, whilst using COMPASS for the most remote homologues would increase coverage. Finally, we introduce a simple approach to increase the sensitivity of remote homologue detection by up to 10%. This is achieved by combining multiple methods with a jury vote.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基于序列的远程同源物检测的最新进展是引入了profile-profile比较方法。这些方法比以前的技术更强大,能够检测出诸如CATH和SCOP等结构分类所遗漏的潜在同源关系。由于结构分类传统上是同源性的金标准,这给对它们进行基准测试带来了挑战。

结果

我们提出了一种新颖的方法,该方法可以针对CATH结构分类对这些方法进行准确的基准测试。然后,我们从两个角度应用此方法来评估一系列用于远程同源性检测的公开可用方法的准确性,包括几种profile-profile方法(COMPASS、HHSearch、PRC)。首先,区分同源结构域和非同源结构域;其次,用结构域家族注释蛋白质组。结果表明,PRC是区分同源物的最佳方法。我们表明,SAM是注释基因组的最佳实用方法,而使用COMPASS检测最远距离的同源物会增加覆盖率。最后,我们引入了一种简单的方法,可将远程同源物检测的灵敏度提高多达10%。这是通过将多种方法与多数投票相结合来实现的。

补充信息

补充数据可在《生物信息学》在线版获取。

相似文献

1
Methods of remote homology detection can be combined to increase coverage by 10% in the midnight zone.远程同源性检测方法可以结合起来,使“午夜区”的覆盖率提高10%。
Bioinformatics. 2007 Sep 15;23(18):2353-60. doi: 10.1093/bioinformatics/btm355. Epub 2007 Aug 20.
2
The global trace graph, a novel paradigm for searching protein sequence databases.全局追踪图,一种搜索蛋白质序列数据库的新范式。
Bioinformatics. 2007 Sep 15;23(18):2361-7. doi: 10.1093/bioinformatics/btm358. Epub 2007 Sep 6.
3
Optimizing the size of the sequence profiles to increase the accuracy of protein sequence alignments generated by profile-profile algorithms.优化序列轮廓的大小,以提高由轮廓-轮廓算法生成的蛋白质序列比对的准确性。
Bioinformatics. 2008 May 1;24(9):1145-53. doi: 10.1093/bioinformatics/btn097. Epub 2008 Mar 12.
4
SVM-HUSTLE--an iterative semi-supervised machine learning approach for pairwise protein remote homology detection.SVM-HUSTLE——一种用于成对蛋白质远程同源性检测的迭代半监督机器学习方法。
Bioinformatics. 2008 Mar 15;24(6):783-90. doi: 10.1093/bioinformatics/btn028. Epub 2008 Feb 1.
5
Benchmarking PSI-BLAST in genome annotation.在基因组注释中对PSI-BLAST进行基准测试。
J Mol Biol. 1999 Nov 12;293(5):1257-71. doi: 10.1006/jmbi.1999.3233.
6
A comparison of scoring functions for protein sequence profile alignment.蛋白质序列谱比对评分函数的比较
Bioinformatics. 2004 May 22;20(8):1301-8. doi: 10.1093/bioinformatics/bth090. Epub 2004 Feb 12.
7
Remote homology detection of integral membrane proteins using conserved sequence features.利用保守序列特征进行整合膜蛋白的远程同源性检测。
Proteins. 2008 May 15;71(3):1387-99. doi: 10.1002/prot.21825.
8
The WWWH of remote homolog detection: the state of the art.远程同源物检测的“WWWH”:当前技术水平。
Brief Bioinform. 2007 Mar;8(2):78-87. doi: 10.1093/bib/bbl032. Epub 2006 Sep 26.
9
Remote homology detection based on oligomer distances.基于寡聚体距离的远程同源性检测。
Bioinformatics. 2006 Sep 15;22(18):2224-31. doi: 10.1093/bioinformatics/btl376. Epub 2006 Jul 12.
10
On single and multiple models of protein families for the detection of remote sequence relationships.用于检测远缘序列关系的蛋白质家族单模型和多模型研究
BMC Bioinformatics. 2006 Jan 31;7:48. doi: 10.1186/1471-2105-7-48.

引用本文的文献

1
On the reliability and the limits of inference of amino acid sequence alignments.关于氨基酸序列比对的可靠性和推断限制。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i255-i263. doi: 10.1093/bioinformatics/btac247.
2
Structural dynamics in the evolution of a bilobed protein scaffold.二叶瓣蛋白支架结构动力学在进化中的作用。
Proc Natl Acad Sci U S A. 2021 Dec 7;118(49). doi: 10.1073/pnas.2026165118.
3
Structure-based functional annotation of putative conserved proteins having lyase activity from Haemophilus influenzae.基于结构的来自流感嗜血杆菌的具有裂解酶活性的假定保守蛋白的功能注释
3 Biotech. 2015 Jun;5(3):317-336. doi: 10.1007/s13205-014-0231-z. Epub 2014 Jun 17.
4
Scrutinizing the immune defence inventory of Camponotus floridanus applying total transcriptome sequencing.运用全转录组测序技术审视佛罗里达弓背蚁的免疫防御体系。
BMC Genomics. 2015 Jul 22;16(1):540. doi: 10.1186/s12864-015-1748-1.
5
The natural history of biocatalytic mechanisms.生物催化机制的自然史。
PLoS Comput Biol. 2014 May 29;10(5):e1003642. doi: 10.1371/journal.pcbi.1003642. eCollection 2014 May.
6
Genome3D: a UK collaborative project to annotate genomic sequences with predicted 3D structures based on SCOP and CATH domains.Genome3D:一个英国合作项目,基于 SCOP 和 CATH 结构域,对基因组序列进行注释和预测三维结构。
Nucleic Acids Res. 2013 Jan;41(Database issue):D499-507. doi: 10.1093/nar/gks1266. Epub 2012 Nov 30.
7
Proteases in malaria parasites - a phylogenomic perspective.疟原虫中的蛋白酶——系统发生基因组学的视角。
Curr Genomics. 2011 Sep;12(6):417-27. doi: 10.2174/138920211797248565.
8
HHsvm: fast and accurate classification of profile-profile matches identified by HHsearch.HHsvm:快速准确地对 HHsearch 识别的 Profile-Profile 比对进行分类。
Bioinformatics. 2009 Dec 1;25(23):3071-6. doi: 10.1093/bioinformatics/btp555. Epub 2009 Sep 22.
9
The CATH hierarchy revisited-structural divergence in domain superfamilies and the continuity of fold space.重新审视 CATH 层次结构——结构域超家族中的差异以及折叠空间的连续性。
Structure. 2009 Aug 12;17(8):1051-62. doi: 10.1016/j.str.2009.06.015.
10
COMPASS server for homology detection: improved statistical accuracy, speed and functionality.用于同源性检测的COMPASS服务器:提高统计准确性、速度和功能。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W90-4. doi: 10.1093/nar/gkp360. Epub 2009 May 12.