Suppr超能文献

核糖体测序、免疫肽组学和蛋白质组学能告诉我们非典型蛋白质组的什么信息?

What Can Ribo-Seq, Immunopeptidomics, and Proteomics Tell Us About the Noncanonical Proteome?

机构信息

Division of Pediatric Hematology/Oncology, Department of Pediatrics, University of Michigan Medical School, Ann Arbor, Michigan, USA; Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, Michigan, USA.

Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.

出版信息

Mol Cell Proteomics. 2023 Sep;22(9):100631. doi: 10.1016/j.mcpro.2023.100631. Epub 2023 Aug 11.

Abstract

Ribosome profiling (Ribo-Seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of noncanonical sites of ribosome translation outside the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7000 noncanonical ORFs are translated, which, at first glance, has the potential to expand the number of human protein CDSs by 30%, from ∼19,500 annotated CDSs to over 26,000 annotated CDSs. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of noncanonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome but searching for guidance on how to proceed. Here, we discuss the current state of noncanonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein coding."

摘要

核糖体图谱(Ribo-Seq)通过阐明数千个核糖体翻译的非规范编码序列(CDS)外的非规范翻译位点,证明了对人类基因组和蛋白质组的理解具有变革性。保守估计表明,至少有 7000 个非规范的 ORF 被翻译,这乍一看有可能将人类蛋白质 CDS 的数量增加 30%,从约 19500 个注释的 CDS 增加到超过 26000 个注释的 CDS。然而,对这些 ORF 的进一步研究提出了许多问题,即它们中有多少真正产生蛋白质产物,以及根据对该术语的传统理解,其中有多少可以被理解为蛋白质。更复杂的是,非规范 ORF 的已发表估计值差异很大,范围从几千到几十万。这项研究的综合结果使基因组学和蛋白质组学社区既对人类基因组中可能存在新的编码区域感到兴奋,又在寻找如何着手的指导。在这里,我们讨论了非规范 ORF 研究、数据库和解释的现状,重点讨论了如何评估给定的 ORF 是否可以被称为“蛋白质编码”。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f5e/10506109/a2b747130cb7/ga1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验