• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes.ProteinWorldDB:查询来自完整基因组的蛋白质组之间的激进两两比对。
Bioinformatics. 2010 Mar 1;26(5):705-7. doi: 10.1093/bioinformatics/btq011. Epub 2010 Jan 19.
2
MODBASE, a database of annotated comparative protein structure models, and associated resources.MODBASE,一个带注释的比较蛋白质结构模型数据库及相关资源。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D217-22. doi: 10.1093/nar/gkh095.
3
Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.用于蛋白质同源物的迭代序列/二级结构搜索:与氨基酸序列比对的比较及在基因组数据库中折叠识别的应用
Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988.
4
SIMAP: the similarity matrix of proteins.SIMAP:蛋白质相似性矩阵。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D252-6. doi: 10.1093/nar/gkj106.
5
RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits.RibAlign:一种基于串联核糖体蛋白亚基的真细菌系统发育分析的软件工具和数据库。
BMC Bioinformatics. 2006 Feb 13;7:66. doi: 10.1186/1471-2105-7-66.
6
GATA: a graphic alignment tool for comparative sequence analysis.GATA:一种用于比较序列分析的图形比对工具。
BMC Bioinformatics. 2005 Jan 17;6:9. doi: 10.1186/1471-2105-6-9.
7
GenomeVISTA--an integrated software package for whole-genome alignment and visualization.GenomeVISTA——一个用于全基因组比对和可视化的集成软件包。
Bioinformatics. 2014 Sep 15;30(18):2654-5. doi: 10.1093/bioinformatics/btu355. Epub 2014 May 23.
8
Screening synteny blocks in pairwise genome comparisons through integer programming.通过整数规划在成对基因组比较中筛选同线性块。
BMC Bioinformatics. 2011 Apr 18;12:102. doi: 10.1186/1471-2105-12-102.
9
Multiple whole-genome alignments without a reference organism.无参考生物体的多个全基因组比对
Genome Res. 2009 Apr;19(4):682-9. doi: 10.1101/gr.081778.108. Epub 2009 Jan 28.
10
PASS2: an automated database of protein alignments organised as structural superfamilies.PASS2:一个以结构超家族形式组织的蛋白质比对自动化数据库。
BMC Bioinformatics. 2004 Apr 2;5:35. doi: 10.1186/1471-2105-5-35.

引用本文的文献

1
Bio-Strings: A Relational Database Data-Type for Dealing with Large Biosequences.生物字符串:一种用于处理大型生物序列的关系型数据库数据类型。
BioTech (Basel). 2022 Jul 30;11(3):31. doi: 10.3390/biotech11030031.

本文引用的文献

1
AnEnPi: identification and annotation of analogous enzymes.AnEnPi:类似酶的鉴定与注释
BMC Bioinformatics. 2008 Dec 17;9:544. doi: 10.1186/1471-2105-9-544.
2
SIMAP--structuring the network of protein similarities.SIMAP——构建蛋白质相似性网络
Nucleic Acids Res. 2008 Jan;36(Database issue):D289-92. doi: 10.1093/nar/gkm963. Epub 2007 Nov 23.
3
Identification of homologs in insignificant blast hits by exploiting extrinsic gene properties.通过利用外在基因特性鉴定无显著比对结果的同源物。
BMC Bioinformatics. 2007 Sep 21;8:356. doi: 10.1186/1471-2105-8-356.
4
MBGD: a platform for microbial comparative genomics based on the automated construction of orthologous groups.MBGD:一个基于直系同源群自动构建的微生物比较基因组学平台。
Nucleic Acids Res. 2007 Jan;35(Database issue):D343-6. doi: 10.1093/nar/gkl978. Epub 2006 Nov 29.
5
From genomics to chemical genomics: new developments in KEGG.从基因组学到化学基因组学:KEGG的新进展
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D354-7. doi: 10.1093/nar/gkj102.
6
The limits of protein sequence comparison?蛋白质序列比较的局限性?
Curr Opin Struct Biol. 2005 Jun;15(3):254-60. doi: 10.1016/j.sbi.2005.05.005.
7
The many faces of sequence alignment.序列比对的多种形式。
Brief Bioinform. 2005 Mar;6(1):6-22. doi: 10.1093/bib/6.1.6.
8
How well is enzyme function conserved as a function of pairwise sequence identity?酶功能作为成对序列同一性的函数,其保守程度如何?
J Mol Biol. 2003 Oct 31;333(4):863-82. doi: 10.1016/j.jmb.2003.08.057.
9
Enzyme function less conserved than anticipated.酶功能的保守性低于预期。
J Mol Biol. 2002 Apr 26;318(2):595-608. doi: 10.1016/S0022-2836(02)00016-5.
10
Empirical statistical estimates for sequence similarity searches.序列相似性搜索的经验性统计估计。
J Mol Biol. 1998 Feb 13;276(1):71-84. doi: 10.1006/jmbi.1997.1525.

ProteinWorldDB:查询来自完整基因组的蛋白质组之间的激进两两比对。

ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes.

机构信息

Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil.

出版信息

Bioinformatics. 2010 Mar 1;26(5):705-7. doi: 10.1093/bioinformatics/btq011. Epub 2010 Jan 19.

DOI:10.1093/bioinformatics/btq011
PMID:20089515
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2828119/
Abstract

MOTIVATION

Many analyses in modern biological research are based on comparisons between biological sequences, resulting in functional, evolutionary and structural inferences. When large numbers of sequences are compared, heuristics are often used resulting in a certain lack of accuracy. In order to improve and validate results of such comparisons, we have performed radical all-against-all comparisons of 4 million protein sequences belonging to the RefSeq database, using an implementation of the Smith-Waterman algorithm. This extremely intensive computational approach was made possible with the help of World Community Grid, through the Genome Comparison Project. The resulting database, ProteinWorldDB, which contains coordinates of pairwise protein alignments and their respective scores, is now made available. Users can download, compare and analyze the results, filtered by genomes, protein functions or clusters. ProteinWorldDB is integrated with annotations derived from Swiss-Prot, Pfam, KEGG, NCBI Taxonomy database and gene ontology. The database is a unique and valuable asset, representing a major effort to create a reliable and consistent dataset of cross-comparisons of the whole protein content encoded in hundreds of completely sequenced genomes using a rigorous dynamic programming approach.

AVAILABILITY

The database can be accessed through http://proteinworlddb.org

摘要

动机

现代生物研究中的许多分析都是基于生物序列之间的比较,从而得出功能、进化和结构推断。当比较大量的序列时,通常会使用启发式方法,这导致了一定程度的准确性不足。为了改进和验证此类比较的结果,我们使用 Smith-Waterman 算法的实现,对属于 RefSeq 数据库的 400 万个蛋白质序列进行了激进的全对全比较。这种极其密集的计算方法是在世界社区网格(通过基因组比较项目)的帮助下实现的。由此产生的数据库 ProteinWorldDB 现在提供了包含两两蛋白质比对的坐标及其各自得分的信息。用户可以下载、比较和分析结果,根据基因组、蛋白质功能或簇进行过滤。ProteinWorldDB 与来自 Swiss-Prot、Pfam、KEGG、NCBI Taxonomy 数据库和基因本体论的注释集成在一起。该数据库是一项独特而有价值的资产,它代表了使用严格的动态编程方法对数百个完全测序的基因组中编码的整个蛋白质内容进行交叉比较的可靠且一致数据集的重要努力。

可用性

该数据库可通过 http://proteinworlddb.org 访问。