• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在 29 种哺乳动物基因组中定位具有额外重叠功能的选择下的蛋白质编码序列。

Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes.

机构信息

Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.

出版信息

Genome Res. 2011 Nov;21(11):1916-28. doi: 10.1101/gr.108753.110. Epub 2011 Oct 12.

DOI:10.1101/gr.108753.110
PMID:21994248
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3205576/
Abstract

The degeneracy of the genetic code allows protein-coding DNA and RNA sequences to simultaneously encode additional, overlapping functional elements. A sequence in which both protein-coding and additional overlapping functions have evolved under purifying selection should show increased evolutionary conservation compared to typical protein-coding genes--especially at synonymous sites. In this study, we use genome alignments of 29 placental mammals to systematically locate short regions within human ORFs that show conspicuously low estimated rates of synonymous substitution across these species. The 29-species alignment provides statistical power to locate more than 10,000 such regions with resolution down to nine-codon windows, which are found within more than a quarter of all human protein-coding genes and contain ∼2% of their synonymous sites. We collect numerous lines of evidence that the observed synonymous constraint in these regions reflects selection on overlapping functional elements including splicing regulatory elements, dual-coding genes, RNA secondary structures, microRNA target sites, and developmental enhancers. Our results show that overlapping functional elements are common in mammalian genes, despite the vast genomic landscape.

摘要

遗传密码的简并性允许蛋白质编码的 DNA 和 RNA 序列同时编码额外的、重叠的功能元件。在纯化选择下进化出蛋白质编码和额外重叠功能的序列,与典型的蛋白质编码基因相比,应该表现出更高的进化保守性——尤其是在同义位点。在这项研究中,我们使用了 29 种胎盘哺乳动物的基因组比对,系统地定位了人类 ORF 内的短区域,这些区域在这些物种中表现出明显较低的同义替代估计率。29 种物种的比对提供了统计能力,能够以分辨率为 9 个密码子窗口定位超过 10000 个这样的区域,这些区域存在于超过四分之一的人类蛋白质编码基因中,包含它们约 2%的同义位点。我们收集了大量证据表明,这些区域观察到的同义约束反映了对重叠功能元件的选择,包括剪接调控元件、双编码基因、RNA 二级结构、microRNA 靶位点和发育增强子。我们的结果表明,尽管基因组景观广阔,但重叠的功能元件在哺乳动物基因中很常见。

相似文献

1
Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes.在 29 种哺乳动物基因组中定位具有额外重叠功能的选择下的蛋白质编码序列。
Genome Res. 2011 Nov;21(11):1916-28. doi: 10.1101/gr.108753.110. Epub 2011 Oct 12.
2
FRESCo: finding regions of excess synonymous constraint in diverse viruses.FRESCo:在多种病毒中寻找同义密码子过度限制区域
Genome Biol. 2015 Feb 17;16(1):38. doi: 10.1186/s13059-015-0603-7.
3
A high-resolution map of human evolutionary constraint using 29 mammals.利用 29 种哺乳动物绘制人类进化约束的高分辨率图谱。
Nature. 2011 Oct 12;478(7370):476-82. doi: 10.1038/nature10530.
4
Detecting overlapping coding sequences in virus genomes.检测病毒基因组中的重叠编码序列。
BMC Bioinformatics. 2006 Feb 16;7:75. doi: 10.1186/1471-2105-7-75.
5
Ultraconserved coding regions outside the homeobox of mammalian Hox genes.哺乳动物Hox基因同源异型框之外的超保守编码区域。
BMC Evol Biol. 2008 Sep 24;8:260. doi: 10.1186/1471-2148-8-260.
6
Extensive purifying selection acting on synonymous sites in HIV-1 Group M sequences.广泛的纯化选择作用于HIV-1 M组序列中的同义位点。
Virol J. 2008 Dec 23;5:160. doi: 10.1186/1743-422X-5-160.
7
Evidence for a novel overlapping coding sequence in POLG initiated at a CUG start codon.证据表明,POLG 在 CUG 起始密码子处起始的新型重叠编码序列。
BMC Genet. 2020 Mar 6;21(1):25. doi: 10.1186/s12863-020-0828-7.
8
New tools to analyze overlapping coding regions.用于分析重叠编码区域的新工具。
BMC Bioinformatics. 2016 Dec 13;17(1):530. doi: 10.1186/s12859-016-1389-7.
9
Exploration for functional nucleotide sequence candidates within coding regions of mammalian genes.哺乳动物基因编码区内功能核苷酸序列候选区的探索。
DNA Res. 2011 Jun;18(3):177-87. doi: 10.1093/dnares/dsr010. Epub 2011 May 17.
10
Statistical evidence for conserved, local secondary structure in the coding regions of eukaryotic mRNAs and pre-mRNAs.真核生物mRNA和前体mRNA编码区域中保守的局部二级结构的统计证据。
Nucleic Acids Res. 2005 Nov 7;33(19):6338-48. doi: 10.1093/nar/gki923. Print 2005.

引用本文的文献

1
Comparative RNA Genomics.比较 RNA 基因组学。
Methods Mol Biol. 2024;2802:347-393. doi: 10.1007/978-1-0716-3838-5_12.
2
Detection of evolutionary conserved and accelerated genomic regions related to adaptation to thermal niches in lizards.检测与蜥蜴适应热生态位相关的进化保守和加速的基因组区域。
Ecol Evol. 2024 Mar 7;14(3):e11117. doi: 10.1002/ece3.11117. eCollection 2024 Mar.
3
Selection on synonymous sites: the unwanted transcript hypothesis.同义位点选择:不需要的转录本假说。
Nat Rev Genet. 2024 Jun;25(6):431-448. doi: 10.1038/s41576-023-00686-7. Epub 2024 Jan 31.
4
rare codon UUA: from features associated with 2 related locations to candidate phage regulatory translational bypassing.稀有密码子 UUA:从与 2 个相关位置相关的特征到候选噬菌体调节性翻译绕过。
RNA Biol. 2023 Jan;20(1):926-942. doi: 10.1080/15476286.2023.2270812. Epub 2023 Nov 15.
5
Transcriptional Regulation and Implications for Controlling Gene Expression.转录调控及其对基因表达控制的影响
J Dev Biol. 2022 Jan 10;10(1):4. doi: 10.3390/jdb10010004.
6
Common Features in lncRNA Annotation and Classification: A Survey.长链非编码RNA注释与分类的共同特征:一项综述。
Noncoding RNA. 2021 Dec 13;7(4):77. doi: 10.3390/ncrna7040077.
7
SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes.比较 44 种 Sarbecovirus 基因组分析 SARS-CoV-2 的基因组成和 COVID-19 的突变影响。
Nat Commun. 2021 May 11;12(1):2642. doi: 10.1038/s41467-021-22905-7.
8
Impact of Synonymous Genome Recoding on the HIV Life Cycle.同义基因组重编码对HIV生命周期的影响。
Front Microbiol. 2021 Mar 16;12:606087. doi: 10.3389/fmicb.2021.606087. eCollection 2021.
9
Evidence for secondary-variant genetic burden and non-random distribution across biological modules in a recessive ciliopathy.隐性纤毛病中二级变异遗传负担和生物模块中非随机分布的证据。
Nat Genet. 2020 Nov;52(11):1145-1150. doi: 10.1038/s41588-020-0707-1. Epub 2020 Oct 12.
10
SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes.通过比较44种Sarbecovirus基因组分析严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因内容及2019冠状病毒病(COVID-19)突变的影响
Res Sq. 2020 Oct 1:rs.3.rs-80345. doi: 10.21203/rs.3.rs-80345/v1.

本文引用的文献

1
New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes.通过比较分析脊椎动物基因组鉴定出的人类调控 RNA 结构的新家族。
Genome Res. 2011 Nov;21(11):1929-43. doi: 10.1101/gr.112516.110. Epub 2011 Oct 12.
2
Translation efficiency is determined by both codon bias and folding energy.翻译效率由密码子偏爱性和折叠能共同决定。
Proc Natl Acad Sci U S A. 2010 Feb 23;107(8):3645-50. doi: 10.1073/pnas.0909910107. Epub 2010 Feb 2.
3
Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content.黑猩猩和人类的 Y 染色体在结构和基因组成上有显著差异。
Nature. 2010 Jan 28;463(7280):536-9. doi: 10.1038/nature08700. Epub 2010 Jan 13.
4
Exonic remnants of whole-genome duplication reveal cis-regulatory function of coding exons.外显子残余的全基因组复制揭示了编码外显子的顺式调控功能。
Nucleic Acids Res. 2010 Mar;38(4):1071-85. doi: 10.1093/nar/gkp1124. Epub 2009 Dec 6.
5
COMIT: identification of noncoding motifs under selection in coding sequences.COMIT:鉴定编码序列中受选择影响的非编码基序。
Genome Biol. 2009;10(11):R133. doi: 10.1186/gb-2009-10-11-r133. Epub 2009 Nov 20.
6
RNAz 2.0: improved noncoding RNA detection.RNAz 2.0:改进的非编码RNA检测
Pac Symp Biocomput. 2010:69-79.
7
Detection of nonneutral substitution rates on mammalian phylogenies.检测哺乳动物系统发育上的非中性替代率。
Genome Res. 2010 Jan;20(1):110-21. doi: 10.1101/gr.097857.109. Epub 2009 Oct 26.
8
Mechanisms of alternative splicing regulation: insights from molecular and genomics approaches.可变剪接调控机制:来自分子和基因组学方法的见解
Nat Rev Mol Cell Biol. 2009 Nov;10(11):741-54. doi: 10.1038/nrm2777. Epub 2009 Sep 23.
9
The coexistence of the nucleosome positioning code with the genetic code on eukaryotic genomes.核小体定位密码与真核生物基因组上遗传密码的共存。
Nucleic Acids Res. 2009 Oct;37(19):6466-76. doi: 10.1093/nar/gkp689. Epub 2009 Aug 21.
10
Chromatin organization marks exon-intron structure.染色质组织标记外显子-内含子结构。
Nat Struct Mol Biol. 2009 Sep;16(9):990-5. doi: 10.1038/nsmb.1659.