Suppr超能文献

y 组定义了大肠杆菌中缺乏功能实验证据的 35%的基因。

The y-ome defines the 35% of Escherichia coli genes that lack experimental evidence of function.

机构信息

Bioengineering Department, University of California, San Diego, La Jolla, CA 92093, USA.

Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA.

出版信息

Nucleic Acids Res. 2019 Mar 18;47(5):2446-2454. doi: 10.1093/nar/gkz030.

Abstract

Experimental studies of Escherichia coli K-12 MG1655 often implicate poorly annotated genes in cellular phenotypes. However, we lack a systematic understanding of these genes. How many are there? What information is available for them? And what features do they share that could explain the gap in our understanding? Efforts to build predictive, whole-cell models of E. coli inevitably face this knowledge gap. We approached these questions systematically by assembling annotations from the knowledge bases EcoCyc, EcoGene, UniProt and RegulonDB. We identified the genes that lack experimental evidence of function (the 'y-ome') which include 1600 of 4623 unique genes (34.6%), of which 111 have absolutely no evidence of function. An additional 220 genes (4.7%) are pseudogenes or phantom genes. y-ome genes tend to have lower expression levels and are enriched in the termination region of the E. coli chromosome. Where evidence is available for y-ome genes, it most often points to them being membrane proteins and transporters. We resolve the misconception that a gene in E. coli whose primary name starts with 'y' is unannotated, and we discuss the value of the y-ome for systematic improvement of E. coli knowledge bases and its extension to other organisms.

摘要

大肠杆菌 K-12 MG1655 的实验研究常常涉及到细胞表型中注释不充分的基因。然而,我们对这些基因缺乏系统的了解。它们有多少?它们有哪些信息?它们有哪些共同的特征可以解释我们理解上的差距?构建大肠杆菌的预测性全细胞模型的努力不可避免地会面临这一知识空白。我们通过从 EcoCyc、EcoGene、UniProt 和 RegulonDB 知识库中组装注释来系统地解决这些问题。我们确定了缺乏功能实验证据的基因(“y 组”),其中包括 4623 个独特基因中的 1600 个(34.6%),其中 111 个绝对没有功能证据。另外 220 个基因(4.7%)是假基因或幻影基因。y 组基因的表达水平较低,并且在大肠杆菌染色体的终止区域富集。在 y 组基因有证据的情况下,它最常指向它们是膜蛋白和转运蛋白。我们纠正了一个误解,即大肠杆菌中以“y”开头的基因的主要名称是未注释的,我们还讨论了 y 组在系统地改进大肠杆菌知识库及其扩展到其他生物体方面的价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce1/6412132/6a13e15c8197/gkz030fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验