• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过蛋白质序列数据库搜索方法的组合获得更高的覆盖率。

Increased coverage obtained by combination of methods for protein sequence database searching.

作者信息

Webber Caleb, Barton Geoffrey J

机构信息

EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England, UK.

出版信息

Bioinformatics. 2003 Jul 22;19(11):1397-403. doi: 10.1093/bioinformatics/btg156.

DOI:10.1093/bioinformatics/btg156
PMID:12874052
Abstract

MOTIVATION

Sequence alignment methods that compare two sequences (pairwise methods) are important tools for the detection of biological sequence relationships. In genome annotation, multiple methods are often run and agreement between methods taken as confirmation. In this paper, we assess the advantages of combining search methods by comparing seven pairwise alignment methods, including three local dynamic programming algorithms (PRSS, SSEARCH and SCANPS), two global dynamic programming algorithms (GSRCH and AMPS) and two heuristic approximations (BLAST and FASTA), individually and by pairwise intersection and union of their result lists at equal p-value cut-offs.

RESULTS

When applied singly, the dynamic programming methods SCANPS and SSEARCH gave significantly better coverage (p=0.01) compared to AMPS, GSRCH, PRSS, BLAST and FASTA. Results ranked by BLAST p-values gave significantly better coverage compared to ranking by BLAST e-values. Of 56 combinations of eight methods considered, 19 gave significant increases in coverage at low error compared to the parent methods at an equal p-value cutoff. The union of results by BLAST (p-value) and FASTA at an equal p-value cutoff gave significantly better coverage than either method individually. The best overall performance was obtained from the intersection of the results from SSEARCH and the GSRCH62 global alignment method. At an error level of five false positives, this combination found 444 true positives, a significant 12.4% increase over SSEARCH applied alone.

摘要

动机

比较两个序列的序列比对方法(成对方法)是检测生物序列关系的重要工具。在基因组注释中,通常会运行多种方法,并将方法之间的一致性作为确认依据。在本文中,我们通过比较七种成对比对方法来评估组合搜索方法的优势,这七种方法包括三种局部动态规划算法(PRSS、SSEARCH和SCANPS)、两种全局动态规划算法(GSRCH和AMPS)以及两种启发式近似算法(BLAST和FASTA),分别单独应用,并在相同的p值截止值下通过其结果列表的成对交集和并集来进行评估。

结果

单独应用时,与AMPS、GSRCH、PRSS、BLAST和FASTA相比,动态规划方法SCANPS和SSEARCH具有显著更好的覆盖率(p = 0.01)。与按BLAST e值排名相比,按BLAST p值排名的结果具有显著更好的覆盖率。在考虑的八种方法的56种组合中,与在相同p值截止值下的父方法相比,有19种在低错误率下覆盖率显著提高。在相同p值截止值下,BLAST(p值)和FASTA结果的并集比单独使用任何一种方法都具有显著更好的覆盖率。最佳总体性能来自SSEARCH结果与GSRCH62全局比对方法的交集。在五个假阳性的错误水平下,这种组合发现了444个真阳性,比单独应用SSEARCH显著增加了12.4%。

相似文献

1
Increased coverage obtained by combination of methods for protein sequence database searching.通过蛋白质序列数据库搜索方法的组合获得更高的覆盖率。
Bioinformatics. 2003 Jul 22;19(11):1397-403. doi: 10.1093/bioinformatics/btg156.
2
Fast and accurate database homology search using upper bounds of local alignment scores.使用局部比对得分的上限进行快速准确的数据库同源性搜索。
Bioinformatics. 2005 Apr 1;21(7):912-21. doi: 10.1093/bioinformatics/bti076. Epub 2004 Oct 27.
3
Assessing sequence comparison methods with the average precision criterion.
Bioinformatics. 2003 Dec 12;19(18):2456-60. doi: 10.1093/bioinformatics/btg349.
4
Serial BLAST searching.序列比对搜索
Bioinformatics. 2003 Aug 12;19(12):1492-6. doi: 10.1093/bioinformatics/btg199.
5
PISCES: a protein sequence culling server.双鱼座:一个蛋白质序列筛选服务器。
Bioinformatics. 2003 Aug 12;19(12):1589-91. doi: 10.1093/bioinformatics/btg224.
6
STORM towards protein function: systematic tailored ORF-data retrieval and management.面向蛋白质功能的STORM:系统定制的开放阅读框数据检索与管理
Appl Bioinformatics. 2003;2(3):177-9.
7
Soap-HT-BLAST: high throughput BLAST based on Web services.Soap-HT-BLAST:基于网络服务的高通量BLAST
Bioinformatics. 2003 Sep 22;19(14):1863-4. doi: 10.1093/bioinformatics/btg244.
8
Blast sampling for structural and functional analyses.用于结构和功能分析的胚细胞采样。
BMC Bioinformatics. 2007 Feb 23;8:62. doi: 10.1186/1471-2105-8-62.
9
Parallel BLAST on split databases.对拆分数据库进行并行BLAST。
Bioinformatics. 2003 Sep 22;19(14):1865-6. doi: 10.1093/bioinformatics/btg250.
10
Local structure-based sequence profile database for local and global protein structure predictions.用于局部和全局蛋白质结构预测的基于局部结构的序列谱数据库。
Bioinformatics. 2002 Dec;18(12):1650-7. doi: 10.1093/bioinformatics/18.12.1650.

引用本文的文献

1
Comparative evaluation of methods for the prediction of protein-ligand binding sites.蛋白质-配体结合位点预测方法的比较评估
J Cheminform. 2024 Nov 11;16(1):126. doi: 10.1186/s13321-024-00923-z.
2
Genomic analyses reveal a conserved glutathione homeostasis pathway in the invertebrate chordate Ciona intestinalis.基因组分析揭示了无脊椎脊索动物文昌鱼中保守的谷胱甘肽稳态途径。
Physiol Genomics. 2009 Nov 6;39(3):183-94. doi: 10.1152/physiolgenomics.00025.2009. Epub 2009 May 26.
3
GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes.
GOtcha:一种通过七个基因组注释评估蛋白质功能预测的新方法。
BMC Bioinformatics. 2004 Nov 18;5:178. doi: 10.1186/1471-2105-5-178.