• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

注释错误在分层结构的蛋白质序列数据库中的渗透。

Percolation of annotation errors through hierarchically structured protein sequence databases.

作者信息

Gilks Walter R, Audit Benjamin, de Angelis Daniela, Tsoka Sophia, Ouzounis Christos A

机构信息

Medical Research Council Biostatistics Unit, Institute of Public Health, University of Forvive Site, Robinson Way, Cambridge CB2 2SR, UK.

出版信息

Math Biosci. 2005 Feb;193(2):223-34. doi: 10.1016/j.mbs.2004.08.001.

DOI:10.1016/j.mbs.2004.08.001
PMID:15748731
Abstract

Databases of protein sequences have grown rapidly in recent years as a result of genome sequencing projects. Annotating protein sequences with descriptions of their biological function ideally requires careful experimentation, but this work lags far behind. Instead, biological function is often imputed by copying annotations from similar protein sequences. This gives rise to annotation errors, and more seriously, to chains of misannotation. [Percolation of annotation errors in a database of protein sequences (2002)] developed a probabilistic framework for exploring the consequences of this percolation of errors through protein databases, and applied their theory to a simple database model. Here we apply the theory to hierarchically structured protein sequence databases, and draw conclusions about database quality at different levels of the hierarchy.

摘要

近年来,由于基因组测序项目,蛋白质序列数据库迅速增长。理想情况下,用其生物学功能描述对蛋白质序列进行注释需要仔细的实验,但这项工作远远滞后。相反,生物学功能通常是通过从相似蛋白质序列复制注释来推断的。这会导致注释错误,更严重的是,会导致错误注释链。《蛋白质序列数据库中注释错误的渗透》(2002年)开发了一个概率框架,用于探索这种错误渗透通过蛋白质数据库的后果,并将其理论应用于一个简单的数据库模型。在这里,我们将该理论应用于层次结构的蛋白质序列数据库,并得出关于层次结构不同级别数据库质量的结论。

相似文献

1
Percolation of annotation errors through hierarchically structured protein sequence databases.注释错误在分层结构的蛋白质序列数据库中的渗透。
Math Biosci. 2005 Feb;193(2):223-34. doi: 10.1016/j.mbs.2004.08.001.
2
Modeling the percolation of annotation errors in a database of protein sequences.蛋白质序列数据库中注释错误的渗流建模。
Bioinformatics. 2002 Dec;18(12):1641-9. doi: 10.1093/bioinformatics/18.12.1641.
3
Evaluation of annotation strategies using an entire genome sequence.使用全基因组序列评估注释策略。
Bioinformatics. 2003 Apr 12;19(6):717-26. doi: 10.1093/bioinformatics/btg077.
4
Annotation error in public databases: misannotation of molecular function in enzyme superfamilies.公共数据库中的注释错误:酶超家族中分子功能的错误注释。
PLoS Comput Biol. 2009 Dec;5(12):e1000605. doi: 10.1371/journal.pcbi.1000605. Epub 2009 Dec 11.
5
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
6
Protein functional annotation by homology.基于同源性的蛋白质功能注释。
Methods Mol Biol. 2008;484:465-90. doi: 10.1007/978-1-59745-398-1_28.
7
ProFAT: a web-based tool for the functional annotation of protein sequences.ProFAT:一个用于蛋白质序列功能注释的基于网络的工具。
BMC Bioinformatics. 2006 Oct 23;7:466. doi: 10.1186/1471-2105-7-466.
8
Protein family classification and functional annotation.蛋白质家族分类与功能注释。
Comput Biol Chem. 2003 Feb;27(1):37-47. doi: 10.1016/s1476-9271(02)00098-1.
9
Experimental and computational investigation of enzyme functional annotations uncovers misannotation in the EC 1.1.3.15 enzyme class.实验和计算研究酶功能注释揭示了 EC 1.1.3.15 酶类中的错误注释。
PLoS Comput Biol. 2021 Sep 23;17(9):e1009446. doi: 10.1371/journal.pcbi.1009446. eCollection 2021 Sep.
10
Genome sequencing and annotation: an overview.基因组测序与注释:概述
Methods Mol Biol. 2004;266:29-45. doi: 10.1385/1-59259-763-7:029.

引用本文的文献

1
FunTaxIS-lite: a simple and light solution to investigate protein functions in all living organisms.FunTaxIS-lite:一种简单、轻量的解决方案,用于研究所有生物体内蛋白质的功能。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad549.
2
Metallo-Beta-Lactamase-like Encoding Genes in Candidate Phyla Radiation: Widespread and Highly Divergent Proteins with Potential Multifunctionality.候选门类辐射中的金属β-内酰胺酶样编码基因:具有潜在多功能性的广泛且高度分化的蛋白质
Microorganisms. 2023 Jul 28;11(8):1933. doi: 10.3390/microorganisms11081933.
3
PASS: Protein Annotation Surveillance Site for Protein Annotation Using Homologous Clusters, NLP, and Sequence Similarity Networks.
PASS:使用同源簇、自然语言处理和序列相似性网络进行蛋白质注释的蛋白质注释监测站点。
Front Bioinform. 2021 Sep 29;1:749008. doi: 10.3389/fbinf.2021.749008. eCollection 2021.
4
Experimental and computational investigation of enzyme functional annotations uncovers misannotation in the EC 1.1.3.15 enzyme class.实验和计算研究酶功能注释揭示了 EC 1.1.3.15 酶类中的错误注释。
PLoS Comput Biol. 2021 Sep 23;17(9):e1009446. doi: 10.1371/journal.pcbi.1009446. eCollection 2021 Sep.
5
GP4: an integrated Gram-Positive Protein Prediction Pipeline for subcellular localization mimicking bacterial sorting.GP4:一种用于模拟细菌分拣的细胞内定位的综合革兰氏阳性蛋白预测管道。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa302.
6
Vertical and Horizontal Transmission of ESBL Plasmid from O104:H4.O104:H4 型产 ESBL 质粒的垂直和水平传播
Genes (Basel). 2020 Oct 16;11(10):1207. doi: 10.3390/genes11101207.
7
The science commons in life science research: structure, function, and value of access to genetic diversity.生命科学研究中的科学共享:获取遗传多样性的结构、功能及价值
Int Soc Sci J. 2006 Jun;58(188):299-317. doi: 10.1111/j.1468-2451.2006.00620.x. Epub 2007 Mar 5.
8
Effusion: prediction of protein function from sequence similarity networks.积液:从序列相似性网络预测蛋白质功能。
Bioinformatics. 2019 Feb 1;35(3):442-451. doi: 10.1093/bioinformatics/bty672.
9
No wisdom in the crowd: genome annotation in the era of big data - current status and future prospects.人群中没有智慧:大数据时代的基因组注释——现状与未来展望。
Microb Biotechnol. 2018 Jul;11(4):588-605. doi: 10.1111/1751-7915.13284. Epub 2018 May 28.
10
Characterisation of novel biomass degradation enzymes from the genome of Cellulomonas fimi.从纤维单胞菌基因组中鉴定新型生物质降解酶。
Enzyme Microb Technol. 2018 Jun;113:9-17. doi: 10.1016/j.enzmictec.2018.02.004. Epub 2018 Feb 15.