Seq2Ref：一个有助于功能解释的网络服务器。

Seq2Ref: a web server to facilitate functional interpretation.

机构信息

Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX 75390-9050, USA.

出版信息

BMC Bioinformatics. 2013 Jan 28;14:30. doi: 10.1186/1471-2105-14-30.

DOI:10.1186/1471-2105-14-30

PMID:23356573

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3573977/

Abstract

BACKGROUND

The size of the protein sequence database has been exponentially increasing due to advances in genome sequencing. However, experimentally characterized proteins only constitute a small portion of the database, such that the majority of sequences have been annotated by computational approaches. Current automatic annotation pipelines inevitably introduce errors, making the annotations unreliable. Instead of such error-prone automatic annotations, functional interpretation should rely on annotations of 'reference proteins' that have been experimentally characterized or manually curated.

RESULTS

The Seq2Ref server uses BLAST to detect proteins homologous to a query sequence and identifies the reference proteins among them. Seq2Ref then reports publications with experimental characterizations of the identified reference proteins that might be relevant to the query. Furthermore, a plurality-based rating system is developed to evaluate the homologous relationships and rank the reference proteins by their relevance to the query.

CONCLUSIONS

The reference proteins detected by our server will lend insight into proteins of unknown function and provide extensive information to develop in-depth understanding of uncharacterized proteins. Seq2Ref is available at: http://prodata.swmed.edu/seq2ref.

摘要

背景

由于基因组测序技术的进步，蛋白质序列数据库的规模呈指数级增长。然而，实验鉴定的蛋白质仅占数据库的一小部分，因此大多数序列都是通过计算方法进行注释的。当前的自动注释流水线不可避免地会引入错误，从而导致注释不可靠。功能解释不应依赖于易错的自动注释，而应依赖于经过实验鉴定或人工整理的“参考蛋白”的注释。

结果

Seq2Ref 服务器使用 BLAST 检测与查询序列同源的蛋白质，并在其中识别参考蛋白质。然后，Seq2Ref 会报告对鉴定出的参考蛋白质进行实验鉴定的出版物，这些出版物可能与查询相关。此外，还开发了一种基于多数的评分系统，用于评估同源关系，并根据与查询的相关性对参考蛋白质进行排名。

结论

我们的服务器检测到的参考蛋白质将深入了解未知功能的蛋白质，并提供广泛的信息，以深入了解未鉴定的蛋白质。Seq2Ref 可在以下网址获取：http://prodata.swmed.edu/seq2ref。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e617/3573977/fdb2393e8285/1471-2105-14-30-1.jpg

相似文献

Seq2Ref: a web server to facilitate functional interpretation.Seq2Ref：一个有助于功能解释的网络服务器。

BMC Bioinformatics. 2013 Jan 28;14:30. doi: 10.1186/1471-2105-14-30.

ORCAN-a web-based meta-server for real-time detection and functional annotation of orthologs.ORCAN——一个用于直系同源基因实时检测和功能注释的基于网络的元服务器。

Bioinformatics. 2017 Apr 15;33(8):1224-1226. doi: 10.1093/bioinformatics/btw825.

Pclust: protein network visualization highlighting experimental data.Pclust：蛋白质网络可视化，突出显示实验数据。

Bioinformatics. 2013 Oct 15;29(20):2647-8. doi: 10.1093/bioinformatics/btt451. Epub 2013 Aug 5.

MESSA: MEta-Server for protein Sequence Analysis.MESSA：蛋白质序列分析元服务器。

BMC Biol. 2012 Oct 2;10:82. doi: 10.1186/1741-7007-10-82.

fastSCOP: a fast web server for recognizing protein structural domains and SCOP superfamilies.fastSCOP：一个用于识别蛋白质结构域和SCOP超家族的快速网络服务器。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W438-43. doi: 10.1093/nar/gkm288. Epub 2007 May 7.

HorA web server to infer homology between proteins using sequence and structural similarity.HorA网络服务器，用于利用序列和结构相似性推断蛋白质之间的同源性。

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W532-8. doi: 10.1093/nar/gkp328. Epub 2009 May 5.

PROMALS web server for accurate multiple protein sequence alignments.用于精确多蛋白序列比对的PROMALS网络服务器。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W649-52. doi: 10.1093/nar/gkm227. Epub 2007 Apr 22.

COMPASS server for remote homology inference.用于远程同源性推断的COMPASS服务器。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W653-8. doi: 10.1093/nar/gkm293. Epub 2007 May 21.

PPISearch: a web server for searching homologous protein-protein interactions across multiple species.PPISearch：一个用于搜索跨多个物种的同源蛋白质-蛋白质相互作用的网络服务器。

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W369-75. doi: 10.1093/nar/gkp309. Epub 2009 May 5.

PubServer: literature searches by homology.PubServer：同源文献检索。

Nucleic Acids Res. 2014 Jul;42(Web Server issue):W430-5. doi: 10.1093/nar/gku450. Epub 2014 Jun 23.

引用本文的文献

Beyond blast: enabling microbiologists to better extract literature, taxonomic distributions and gene neighbourhood information for protein families.超越 blast：使微生物学家能够更好地提取文献、分类分布和基因邻域信息的蛋白质家族。

Microb Genom. 2024 Feb;10(2). doi: 10.1099/mgen.0.001183.

Beyond Blast: Enabling Microbiologists to Better Extract Literature, Taxonomic Distributions and Gene Neighborhood Information for Protein Families.超越Blast：助力微生物学家更好地提取蛋白质家族的文献、分类分布和基因邻域信息。

bioRxiv. 2024 Jan 2:2023.05.03.539116. doi: 10.1101/2023.05.03.539116.

Evolution of the Epigenetic Landscape in Childhood B Acute Lymphoblastic Leukemia and Its Role in Drug Resistance.儿童急性 B 淋巴细胞白血病表观遗传景观的演变及其在耐药性中的作用。

Cancer Res. 2020 Dec 1;80(23):5189-5202. doi: 10.1158/0008-5472.CAN-20-1145. Epub 2020 Oct 16.

Comparative Genomics Analysis Provides New Insight Into Molecular Basis of Stomatal Movement in .比较基因组学分析为[具体植物名称]气孔运动的分子基础提供了新见解。（原文中“in.”后面缺少具体植物名称）

Front Plant Sci. 2019 Mar 13;10:292. doi: 10.3389/fpls.2019.00292. eCollection 2019.

PaperBLAST: Text Mining Papers for Information about Homologs.PaperBLAST：挖掘论文以获取同源物信息。

mSystems. 2017 Aug 15;2(4). doi: 10.1128/mSystems.00039-17. eCollection 2017 Jul-Aug.

A Moraxella catarrhalis two-component signal transduction system necessary for growth in liquid media affects production of two lysozyme inhibitors.一种对于在液体培养基中生长所必需的卡他莫拉菌双组分信号转导系统影响两种溶菌酶抑制剂的产生。

Infect Immun. 2015 Jan;83(1):146-60. doi: 10.1128/IAI.02486-14. Epub 2014 Oct 13.

Pclust: protein network visualization highlighting experimental data.Pclust：蛋白质网络可视化，突出显示实验数据。

Bioinformatics. 2013 Oct 15;29(20):2647-8. doi: 10.1093/bioinformatics/btt451. Epub 2013 Aug 5.

本文引用的文献

KIAA1797/FOCAD encodes a novel focal adhesion protein with tumour suppressor function in gliomas.KIAA1797/FOCAD 编码一种新型的局灶性黏附蛋白，具有神经胶质瘤中的肿瘤抑制功能。

Brain. 2012 Apr;135(Pt 4):1027-41. doi: 10.1093/brain/aws045. Epub 2012 Mar 16.

Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。

Nucleic Acids Res. 2012 Jan;40(Database issue):D13-25. doi: 10.1093/nar/gkr1184. Epub 2011 Dec 2.

Role for Escherichia coli YidD in membrane protein insertion.大肠杆菌 YidD 在膜蛋白插入中的作用。

J Bacteriol. 2011 Oct;193(19):5242-51. doi: 10.1128/JB.05429-11. Epub 2011 Jul 29.

HangOut: generating clean PSI-BLAST profiles for domains with long insertions.HangOut：生成具有长插入的域的干净 PSI-BLAST 轮廓。

Bioinformatics. 2010 Jun 15;26(12):1564-5. doi: 10.1093/bioinformatics/btq208. Epub 2010 Apr 22.

Annotation error in public databases: misannotation of molecular function in enzyme superfamilies.公共数据库中的注释错误：酶超家族中分子功能的错误注释。

PLoS Comput Biol. 2009 Dec;5(12):e1000605. doi: 10.1371/journal.pcbi.1000605. Epub 2009 Dec 11.

GenBank.GenBank。

Nucleic Acids Res. 2010 Jan;38(Database issue):D46-51. doi: 10.1093/nar/gkp1024. Epub 2009 Nov 12.

The Universal Protein Resource (UniProt) in 2010.2010 年的通用蛋白质资源（UniProt）。

Nucleic Acids Res. 2010 Jan;38(Database issue):D142-8. doi: 10.1093/nar/gkp846. Epub 2009 Oct 20.

Exploration of uncharted regions of the protein universe.探索蛋白质宇宙的未知领域。

PLoS Biol. 2009 Sep;7(9):e1000205. doi: 10.1371/journal.pbio.1000205. Epub 2009 Sep 29.

Characterization of a hemolysin gene ytjA from Bacillus subtilis.

Curr Microbiol. 2009 Jun;58(6):642-7. doi: 10.1007/s00284-009-9383-1. Epub 2009 Mar 21.

Protein function prediction--the power of multiplicity.蛋白质功能预测——多样性的力量。

Trends Biotechnol. 2009 Apr;27(4):210-9. doi: 10.1016/j.tibtech.2009.01.002. Epub 2009 Feb 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

Seq2Ref：一个有助于功能解释的网络服务器。

Seq2Ref: a web server to facilitate functional interpretation.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献