Suppr超能文献

通过对蛋白质的比较分析发现了数百个新的人类(及其他)外显子。

uncovers hundreds of novel human (and other) exons though comparative analysis of proteins.

作者信息

Nord Alexander J, Wheeler Travis J

机构信息

Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America.

R. Ken Coit College of Pharmacy, University of Arizona, Tucson, Arizona, United States of America.

出版信息

bioRxiv. 2024 May 6:2024.05.05.592595. doi: 10.1101/2024.05.05.592595.

Abstract

BACKGROUND

Eukaryotic genes are often composed of multiple exons that are stitched together by out the intervening introns. These exons may be conditionally joined in different combinations to produce a collection of related, but distinct, mRNA transcripts. For protein-coding genes, these products of lead to production of related protein variants () of a gene. Complete labeling of the protein-coding content of a eukaryotic genome requires discovery of mRNA encoding all isoforms, but it is impractical to enumerate all possible combinations of tissue, developmental stage, and environmental context; as a result, many true exons go unlabeled in genome annotations.

RESULTS

One way to address the combinatoric challenge of finding all isoforms in a single organism is to leverage sequencing efforts for other organisms - each time a new organism is sequenced, it may be under a new combination of conditions, so that a previously unobserved isoform may be sequenced. We present , a software tool that identifies previously undocumented exons in organisms by comparing isoforms across species. We demonstrate 's utility by locating hundreds of novel exons in the genomes of human, mouse, and rat, as well as in the ferret genome. Further, we provide analyses supporting the notion that most of the new exons reported by are likely to be part of a true (but unobserved) isoform of the containing species.

摘要

背景

真核基因通常由多个外显子组成,这些外显子通过去除中间的内含子拼接在一起。这些外显子可能以不同组合有条件地连接,以产生一系列相关但不同的mRNA转录本。对于蛋白质编码基因,这些产物导致一个基因产生相关的蛋白质变体()。对真核基因组的蛋白质编码内容进行完整标注需要发现编码所有异构体的mRNA,但枚举组织、发育阶段和环境背景的所有可能组合是不切实际的;因此,许多真正的外显子在基因组注释中未被标注。

结果

解决在单个生物体中找到所有异构体这一组合难题的一种方法是利用对其他生物体的测序工作——每次对一个新生物体进行测序时,它可能处于新的条件组合下,这样一个以前未观察到的异构体可能会被测序。我们展示了,一种通过比较跨物种的异构体来识别生物体中以前未记录的外显子的软件工具。我们通过在人类、小鼠和大鼠的基因组以及雪貂基因组中定位数百个新外显子来证明的实用性。此外,我们提供的分析支持了这样一种观点,即报告的大多数新外显子可能是所含物种真正(但未观察到)的异构体的一部分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e278/11092782/735aa238bdc3/nihpp-2024.05.05.592595v2-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验