• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

UFO:用于全基因组蛋白质序列超快速功能分析的网络服务器。

UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.

作者信息

Meinicke Peter

机构信息

Department of Bioinformatics, Institute of Microbiology and Genetics, Georg-August-University Göttingen, Germany.

出版信息

BMC Genomics. 2009 Sep 2;10:409. doi: 10.1186/1471-2164-10-409.

DOI:10.1186/1471-2164-10-409
PMID:19725959
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2744726/
Abstract

BACKGROUND

Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families.

DESCRIPTION

Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time.

CONCLUSION

For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.

摘要

背景

功能分析是表征和比较整个基因组功能潜力的关键技术。根据序列到功能类别的分配来估计图谱是一项计算成本高昂的任务,因为它需要将基因组中的所有蛋白质序列与通常庞大的注释序列或序列家族数据库进行比较。

描述

基于用于检测Pfam结构域的机器学习技术,用于超快速功能分析的UFO网络服务器使研究人员能够即时处理大型蛋白质序列集合。除了Pfam和GO类别的频率外,用户还能获得序列到Pfam结构域家族的特定分配。此外,与现有基因组的比较提供了相对于821个参考蛋白质组的差异分数。考虑到基础的UFO结构域检测,对206个测试基因组的结果表明该方法具有很高的灵敏度。与当前最先进的隐马尔可夫模型(HMM)相比,运行时测量显示速度有了显著提升,达到了四个数量级。对于平均大小的原核基因组,计算功能图谱及其比较通常需要大约10秒的处理时间。

结论

UFO网络服务器首次使人们能够快速了解新测序生物体的功能清单。与大量预先计算的图谱进行基因组规模的比较,可以初步推测功能相关的生物体。该服务免费提供,无需用户注册或提供有效的电子邮件地址。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/08681605c51f/1471-2164-10-409-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/5f8f1a87d539/1471-2164-10-409-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/a75e6de179e2/1471-2164-10-409-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/072589bd499e/1471-2164-10-409-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/475659c7881f/1471-2164-10-409-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/0696831e939c/1471-2164-10-409-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/08681605c51f/1471-2164-10-409-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/5f8f1a87d539/1471-2164-10-409-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/a75e6de179e2/1471-2164-10-409-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/072589bd499e/1471-2164-10-409-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/475659c7881f/1471-2164-10-409-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/0696831e939c/1471-2164-10-409-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4325/2744726/08681605c51f/1471-2164-10-409-6.jpg

相似文献

1
UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.UFO:用于全基因组蛋白质序列超快速功能分析的网络服务器。
BMC Genomics. 2009 Sep 2;10:409. doi: 10.1186/1471-2164-10-409.
2
Bioverse: Functional, structural and contextual annotation of proteins and proteomes.生物宇宙:蛋白质和蛋白质组的功能、结构及情境注释
Nucleic Acids Res. 2003 Jul 1;31(13):3736-7. doi: 10.1093/nar/gkg550.
3
CoMet--a web server for comparative functional profiling of metagenomes.CoMet--一个用于宏基因组比较功能分析的网络服务器。
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W518-23. doi: 10.1093/nar/gkr388. Epub 2011 May 26.
4
DescFold: a web server for protein fold recognition.DescFold:用于蛋白质折叠识别的网络服务器。
BMC Bioinformatics. 2009 Dec 14;10:416. doi: 10.1186/1471-2105-10-416.
5
Functional and structural genomics using PEDANT.使用PEDANT进行功能和结构基因组学研究。
Bioinformatics. 2001 Jan;17(1):44-57. doi: 10.1093/bioinformatics/17.1.44.
6
MICAS: a fully automated web server for microsatellite extraction and analysis from prokaryote and viral genomic sequences.MICAS:一个用于从原核生物和病毒基因组序列中提取和分析微卫星的全自动网络服务器。
Appl Bioinformatics. 2003;2(3):165-8.
7
ANTHEPROT: an integrated protein sequence analysis software with client/server capabilities.ANTHEPROT:一款具有客户端/服务器功能的集成蛋白质序列分析软件。
Comput Biol Med. 2001 Jul;31(4):259-67. doi: 10.1016/s0010-4825(01)00008-7.
8
Exploring genome architecture through GOV: a WWW-based gene order visualizer.通过GOV探索基因组架构:一个基于万维网的基因顺序可视化工具。
Bioinformatics. 2004 Apr 12;20(6):984-5. doi: 10.1093/bioinformatics/bth046. Epub 2004 Feb 5.
9
SledgeHMMER: a web server for batch searching the Pfam database.SledgeHMMER:一个用于批量搜索Pfam数据库的网络服务器。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W542-4. doi: 10.1093/nar/gkh395.
10
BAR-PLUS: the Bologna Annotation Resource Plus for functional and structural annotation of protein sequences.BAR-PLUS:博洛尼亚注释资源 PLUS,用于蛋白质序列的功能和结构注释。
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W197-202. doi: 10.1093/nar/gkr292. Epub 2011 May 26.

引用本文的文献

1
Genome Sequence of Rough and Smooth Variants of Pleomorphic Strain Lactobacillus farciminis CNCM-I-3699.多形性菌株发酵乳杆菌CNCM-I-3699粗糙型和平滑型变体的基因组序列
Genome Announc. 2015 Sep 17;3(5):e01059-15. doi: 10.1128/genomeA.01059-15.
2
Land use type significantly affects microbial gene transcription in soil.土地利用类型显著影响土壤中的微生物基因转录。
Microb Ecol. 2014 May;67(4):919-30. doi: 10.1007/s00248-014-0377-6. Epub 2014 Feb 20.
3
Fungal zinc metabolism and its connections to virulence.真菌的锌代谢及其与毒力的关系。

本文引用的文献

1
Assembling the marine metagenome, one cell at a time.一次一个细胞地组装海洋宏基因组。
PLoS One. 2009;4(4):e5299. doi: 10.1371/journal.pone.0005299. Epub 2009 Apr 23.
2
HAMAP: a database of completely sequenced microbial proteome sets and manually curated microbial protein families in UniProtKB/Swiss-Prot.HAMAP:一个包含完全测序的微生物蛋白质组集以及UniProtKB/Swiss-Prot中经人工整理的微生物蛋白质家族的数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D471-8. doi: 10.1093/nar/gkn661. Epub 2008 Oct 11.
3
The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes.
Front Cell Infect Microbiol. 2013 Oct 14;3:65. doi: 10.3389/fcimb.2013.00065. eCollection 2013.
4
Zap1 regulates zinc homeostasis and modulates virulence in Cryptococcus gattii.Zap1 调节锌稳态并调节隐球菌属中新生隐球菌的毒力。
PLoS One. 2012;7(8):e43773. doi: 10.1371/journal.pone.0043773. Epub 2012 Aug 20.
5
A comparative evaluation of sequence classification programs.序列分类程序的比较评估。
BMC Bioinformatics. 2012 May 10;13:92. doi: 10.1186/1471-2105-13-92.
6
CoMet--a web server for comparative functional profiling of metagenomes.CoMet--一个用于宏基因组比较功能分析的网络服务器。
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W518-23. doi: 10.1093/nar/gkr388. Epub 2011 May 26.
7
The mining of toxin-like polypeptides from EST database by single residue distribution analysis.通过单残基分布分析从 EST 数据库中挖掘毒素样多肽。
BMC Genomics. 2011 Jan 31;12:88. doi: 10.1186/1471-2164-12-88.
8
Smed454 dataset: unravelling the transcriptome of Schmidtea mediterranea.Smed454 数据集:揭示地中海星虫的转录组。
BMC Genomics. 2010 Dec 31;11:731. doi: 10.1186/1471-2164-11-731.
9
Predicting phenotypic traits of prokaryotes from protein domain frequencies.从蛋白质域频率预测原核生物的表型特征。
BMC Bioinformatics. 2010 Sep 24;11:481. doi: 10.1186/1471-2105-11-481.
10
The effect of sequencing errors on metagenomic gene prediction.测序错误对宏基因组基因预测的影响。
BMC Genomics. 2009 Nov 12;10:520. doi: 10.1186/1471-2164-10-520.
宏基因组学RAST服务器——用于宏基因组自动系统发育和功能分析的公共资源。
BMC Bioinformatics. 2008 Sep 19;9:386. doi: 10.1186/1471-2105-9-386.
4
Predicting protein function from domain content.从结构域组成预测蛋白质功能。
Bioinformatics. 2008 Aug 1;24(15):1681-7. doi: 10.1093/bioinformatics/btn312. Epub 2008 Jun 30.
5
Millimeter-scale genetic gradients and community-level molecular convergence in a hypersaline microbial mat.高盐度微生物席中的毫米级遗传梯度和群落水平的分子趋同
Mol Syst Biol. 2008;4:198. doi: 10.1038/msb.2008.35. Epub 2008 Jun 3.
6
Word correlation matrices for protein sequence analysis and remote homology detection.用于蛋白质序列分析和远程同源性检测的词相关矩阵。
BMC Bioinformatics. 2008 Jun 3;9:259. doi: 10.1186/1471-2105-9-259.
7
Gene prediction in metagenomic fragments: a large scale machine learning approach.宏基因组片段中的基因预测:一种大规模机器学习方法。
BMC Bioinformatics. 2008 Apr 28;9:217. doi: 10.1186/1471-2105-9-217.
8
Functional metagenomic profiling of nine biomes.九个生物群落的功能宏基因组分析
Nature. 2008 Apr 3;452(7187):629-32. doi: 10.1038/nature06810. Epub 2008 Mar 12.
9
The RAST Server: rapid annotations using subsystems technology.RAST服务器:使用子系统技术进行快速注释。
BMC Genomics. 2008 Feb 8;9:75. doi: 10.1186/1471-2164-9-75.
10
The Pfam protein families database.Pfam蛋白质家族数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D281-8. doi: 10.1093/nar/gkm960. Epub 2007 Nov 26.