利用基因组相关性自动检测生物化学注释。

Automatic policing of biochemical annotations using genomic correlations.

机构信息

Center for Computational Biology and Bioinformatics and Department of Biomedical Informatics, Columbia University, Irving Cancer Research Center, New York, New York, USA.

出版信息

Nat Chem Biol. 2010 Jan;6(1):34-40. doi: 10.1038/nchembio.266. Epub 2009 Nov 22.

DOI:10.1038/nchembio.266

PMID:19935659

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2935526/

Abstract

With the increasing role of computational tools in the analysis of sequenced genomes, there is an urgent need to maintain high accuracy of functional annotations. Misannotations can be easily generated and propagated through databases by functional transfer based on sequence homology. We developed and optimized an automatic policing method to detect biochemical misannotations using context genomic correlations. The method works by finding genes with unusually weak genomic correlations in their assigned network positions. We demonstrate the accuracy of the method using a cross-validated approach. In addition, we show that the method identifies a significant number of potential misannotations in Bacillus subtilis, including metabolic assignments already shown to be incorrect experimentally. The experimental analysis of the mispredicted genes forming the leucine degradation pathway in B. subtilis demonstrates that computational policing tools can generate important biological hypotheses.

摘要

随着计算工具在分析测序基因组中的作用不断增加，迫切需要保持功能注释的高度准确性。基于序列同源性的功能转移，误注释很容易在数据库中生成和传播。我们开发并优化了一种自动检测方法，利用上下文基因组相关性来检测生化误注释。该方法通过找到在分配的网络位置中具有异常弱基因组相关性的基因来工作。我们使用交叉验证方法证明了该方法的准确性。此外，我们还表明，该方法可以识别枯草芽孢杆菌中大量潜在的误注释，包括已经通过实验证明不正确的代谢分配。枯草芽孢杆菌中亮氨酸降解途径的误预测基因的实验分析表明，计算检测工具可以生成重要的生物学假设。

相似文献

Automatic policing of biochemical annotations using genomic correlations.利用基因组相关性自动检测生物化学注释。

Nat Chem Biol. 2010 Jan;6(1):34-40. doi: 10.1038/nchembio.266. Epub 2009 Nov 22.

Toward Algorithms for Automation of Postgenomic Data Analyses: Promoter Prediction with Artificial Neural Network.迈向基因组后数据分析自动化算法的研究：基于人工神经网络的启动子预测。

OMICS. 2020 May;24(5):300-309. doi: 10.1089/omi.2019.0041. Epub 2019 Oct 1.

Genetic algorithm for assigning weights to gene expressions using functional annotations.使用功能注释为基因表达分配权重的遗传算法。

Comput Biol Med. 2019 Jan;104:149-162. doi: 10.1016/j.compbiomed.2018.11.011. Epub 2018 Nov 17.

Filtering high-throughput protein-protein interaction data using a combination of genomic features.使用基因组特征组合过滤高通量蛋白质-蛋白质相互作用数据。

BMC Bioinformatics. 2005 Apr 18;6:100. doi: 10.1186/1471-2105-6-100.

iBsu1103: a new genome-scale metabolic model of Bacillus subtilis based on SEED annotations.iBsu1103：基于SEED注释的枯草芽孢杆菌新的全基因组规模代谢模型。

Genome Biol. 2009;10(6):R69. doi: 10.1186/gb-2009-10-6-r69. Epub 2009 Jun 25.

Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.通过成对物种比较对直系同源基因和旁系同源基因进行自动聚类。

J Mol Biol. 2001 Dec 14;314(5):1041-52. doi: 10.1006/jmbi.2000.5197.

High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource.利用 PlantSEED 资源进行高通量比较、功能注释和植物基因组代谢建模。

Proc Natl Acad Sci U S A. 2014 Jul 1;111(26):9645-50. doi: 10.1073/pnas.1401329111. Epub 2014 Jun 9.

Bacillus subtilis, the model Gram-positive bacterium: 20 years of annotation refinement.枯草芽孢杆菌，革兰氏阳性模式菌：20 年的注释完善历程。

Microb Biotechnol. 2018 Jan;11(1):3-17. doi: 10.1111/1751-7915.13043.

Automatic detection of subsystem/pathway variants in genome analysis.基因组分析中自动检测子系统/通路变异

Bioinformatics. 2005 Jun;21 Suppl 1:i478-86. doi: 10.1093/bioinformatics/bti1052.

引用本文的文献

Transcriptomic Analysis Reveals the Role of tmRNA on Biofilm Formation in .转录组分析揭示了tmRNA在……生物膜形成中的作用。

Microorganisms. 2022 Jul 1;10(7):1338. doi: 10.3390/microorganisms10071338.

Parallel evolution of non-homologous isofunctional enzymes in methionine biosynthesis.甲硫氨酸生物合成中非同源同工酶的平行进化。

Nat Chem Biol. 2017 Aug;13(8):858-866. doi: 10.1038/nchembio.2397. Epub 2017 Jun 5.

Assignment of function to a domain of unknown function: DUF1537 is a new kinase family in catabolic pathways for acid sugars.未知功能结构域的功能指派：DUF1537是酸性糖分解代谢途径中的一个新激酶家族。

Proc Natl Acad Sci U S A. 2016 Jul 19;113(29):E4161-9. doi: 10.1073/pnas.1605546113. Epub 2016 Jul 11.

Successful conversion of the Bacillus subtilis BirA Group II biotin protein ligase into a Group I ligase.枯草芽孢杆菌BirA第二组生物素蛋白连接酶成功转化为第一组连接酶。

PLoS One. 2014 May 9;9(5):e96757. doi: 10.1371/journal.pone.0096757. eCollection 2014.

Simple topological properties predict functional misannotations in a metabolic network.简单的拓扑性质可预测代谢网络中的功能误注释。

Bioinformatics. 2013 Jul 1;29(13):i154-61. doi: 10.1093/bioinformatics/btt236.

Explaining microbial phenotypes on a genomic scale: GWAS for microbes.从基因组层面阐释微生物表型：针对微生物的 GWAS 分析。

Brief Funct Genomics. 2013 Jul;12(4):366-80. doi: 10.1093/bfgp/elt008. Epub 2013 Apr 26.

MIRAGE: a functional genomics-based approach for metabolic network model reconstruction and its application to cyanobacteria networks.MIRAGE：一种基于功能基因组学的代谢网络模型重建方法及其在蓝细菌网络中的应用

Genome Biol. 2012 Nov 29;13(11):R111. doi: 10.1186/gb-2012-13-11-r111.

Global probabilistic annotation of metabolic networks enables enzyme discovery.全球代谢网络概率注释可实现酶的发现。

Nat Chem Biol. 2012 Oct;8(10):848-54. doi: 10.1038/nchembio.1063.

Recent advances in mapping environmental microbial metabolisms through 13C isotopic fingerprints.通过 13C 同位素指纹图谱绘制环境微生物代谢途径的最新进展。

J R Soc Interface. 2012 Nov 7;9(76):2767-80. doi: 10.1098/rsif.2012.0396. Epub 2012 Aug 15.

A road map for the development of community systems (CoSy) biology.社区系统生物学（CoSy）的发展路线图。

Nat Rev Microbiol. 2012 Mar 27;10(5):366-72. doi: 10.1038/nrmicro2763.

本文引用的文献

Optimization by simulated annealing.模拟退火优化。

Science. 1983 May 13;220(4598):671-80. doi: 10.1126/science.220.4598.671.

Identification of a soluble diacylglycerol kinase required for lipoteichoic acid production in Bacillus subtilis.枯草芽孢杆菌中脂磷壁酸产生所需的一种可溶性二酰基甘油激酶的鉴定。

J Biol Chem. 2007 Jul 27;282(30):21738-45. doi: 10.1074/jbc.M703536200. Epub 2007 May 28.

Gene loss rate: a probabilistic measure for the conservation of eukaryotic genes.基因丢失率：一种衡量真核基因保守性的概率指标。

Nucleic Acids Res. 2007;35(1):e7. doi: 10.1093/nar/gkl792. Epub 2006 Dec 7.

Expression dynamics of a cellular metabolic network.细胞代谢网络的表达动力学

Mol Syst Biol. 2005;1:2005.0016. doi: 10.1038/msb4100023. Epub 2005 Aug 2.

Identifying metabolic enzymes with multiple types of association evidence.利用多种关联证据识别代谢酶。

BMC Bioinformatics. 2006 Mar 29;7:177. doi: 10.1186/1471-2105-7-177.

Predicting genes for orphan metabolic activities using phylogenetic profiles.利用系统发育谱预测孤儿代谢活动的基因。

Genome Biol. 2006;7(2):R17. doi: 10.1186/gb-2006-7-2-r17. Epub 2006 Feb 15.

A three-protein signaling pathway governing immunity to a bacterial cannibalism toxin.一条控制对细菌同类相食毒素免疫的三蛋白信号通路。

Cell. 2006 Feb 10;124(3):549-59. doi: 10.1016/j.cell.2005.11.041.

MetaCyc: a multiorganism database of metabolic pathways and enzymes.MetaCyc：一个多生物体代谢途径和酶的数据库。

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D511-6. doi: 10.1093/nar/gkj128.

From genomics to chemical genomics: new developments in KEGG.从基因组学到化学基因组学：KEGG的新进展

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D354-7. doi: 10.1093/nar/gkj102.

The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes.基因组注释的子系统方法及其在千人基因组注释计划中的应用。

Nucleic Acids Res. 2005 Oct 7;33(17):5691-702. doi: 10.1093/nar/gki866. Print 2005.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用基因组相关性自动检测生物化学注释。

Automatic policing of biochemical annotations using genomic correlations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献