• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类蛋白编码基因 5' 端的 GC 含量正在发生突变性衰减。

The GC-content at the 5' ends of human protein-coding genes is undergoing mutational decay.

机构信息

Department of Biochemistry, University of Toronto, Toronto, Ontario, M5G1M1, Canada.

Department of Electrical Engineering and Computer Science, York University, Toronto, Ontario, M3J1P3, Canada.

出版信息

Genome Biol. 2024 Aug 13;25(1):219. doi: 10.1186/s13059-024-03364-x.

DOI:10.1186/s13059-024-03364-x
PMID:39138526
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11323403/
Abstract

BACKGROUND

In vertebrates, most protein-coding genes have a peak of GC-content near their 5' transcriptional start site (TSS). This feature promotes both the efficient nuclear export and translation of mRNAs. Despite the importance of GC-content for RNA metabolism, its general features, origin, and maintenance remain mysterious. We investigate the evolutionary forces shaping GC-content at the transcriptional start site (TSS) of genes through both comparative genomic analysis of nucleotide substitution rates between different species and by examining human de novo mutations.

RESULTS

Our data suggests that GC-peaks at TSSs were present in the last common ancestor of amniotes, and likely that of vertebrates. We observe that in apes and rodents, where recombination is directed away from TSSs by PRDM9, GC-content at the 5' end of protein-coding gene is currently undergoing mutational decay. In canids, which lack PRDM9 and perform recombination at TSSs, GC-content at the 5' end of protein-coding is increasing. We show that these patterns extend into the 5' end of the open reading frame, thus impacting synonymous codon position choices.

CONCLUSIONS

Our results indicate that the dynamics of this GC-peak in amniotes is largely shaped by historic patterns of recombination. Since decay of GC-content towards the mutation rate equilibrium is the default state for non-functional DNA, the observed decrease in GC-content at TSSs in apes and rodents indicates that the GC-peak is not being maintained by selection on most protein-coding genes in those species.

摘要

背景

在脊椎动物中,大多数蛋白质编码基因在其 5'转录起始位点(TSS)附近有一个 GC 含量峰值。这个特征促进了 mRNA 的高效核输出和翻译。尽管 GC 含量对 RNA 代谢很重要,但它的一般特征、起源和维持仍然是神秘的。我们通过比较不同物种之间核苷酸取代率的基因组分析以及检查人类新生突变,研究了塑造基因转录起始位点(TSS)处 GC 含量的进化力量。

结果

我们的数据表明,在羊膜动物和脊椎动物的最后共同祖先中,TSS 处存在 GC 峰。我们观察到,在灵长类动物和啮齿类动物中,由于 PRDM9 将重组引导远离 TSS,蛋白质编码基因 5'端的 GC 含量目前正在经历突变衰减。在缺乏 PRDM9 并在 TSS 处进行重组的犬科动物中,蛋白质编码基因 5'端的 GC 含量正在增加。我们表明,这些模式扩展到开放阅读框的 5'端,从而影响同义密码子位置的选择。

结论

我们的结果表明,在羊膜动物中,这个 GC 峰的动态主要是由历史上的重组模式塑造的。由于向突变率平衡的 GC 含量衰减是无功能 DNA 的默认状态,在灵长类动物和啮齿类动物中 TSS 处 GC 含量的减少表明,在这些物种中的大多数蛋白质编码基因中,GC 峰不是通过选择来维持的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/e25656625be4/13059_2024_3364_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/21f59e319e81/13059_2024_3364_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/5a3b1b0ab2ad/13059_2024_3364_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/69ecb47b0112/13059_2024_3364_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/2b3a3d2a7fa1/13059_2024_3364_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/ee55ed63a3c4/13059_2024_3364_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/38f14167a209/13059_2024_3364_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/e25656625be4/13059_2024_3364_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/21f59e319e81/13059_2024_3364_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/5a3b1b0ab2ad/13059_2024_3364_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/69ecb47b0112/13059_2024_3364_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/2b3a3d2a7fa1/13059_2024_3364_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/ee55ed63a3c4/13059_2024_3364_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/38f14167a209/13059_2024_3364_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f970/11323403/e25656625be4/13059_2024_3364_Fig7_HTML.jpg

相似文献

1
The GC-content at the 5' ends of human protein-coding genes is undergoing mutational decay.人类蛋白编码基因 5' 端的 GC 含量正在发生突变性衰减。
Genome Biol. 2024 Aug 13;25(1):219. doi: 10.1186/s13059-024-03364-x.
2
GC-biased gene conversion and selection affect GC content in the Oryza genus (rice).GC 偏向性基因转换和选择会影响稻属(水稻)中的 GC 含量。
Mol Biol Evol. 2011 Sep;28(9):2695-706. doi: 10.1093/molbev/msr104. Epub 2011 Apr 18.
3
Vertebrate codon bias indicates a highly GC-rich ancestral genome.脊椎动物密码子偏向性表明其祖先基因组富含 GC。
Gene. 2013 Apr 25;519(1):113-9. doi: 10.1016/j.gene.2013.01.033. Epub 2013 Jan 31.
4
Biased gene conversion and GC-content evolution in the coding sequences of reptiles and vertebrates.爬行动物和脊椎动物编码序列中的偏向基因转换与GC含量进化。
Genome Biol Evol. 2014 Dec 19;7(1):240-50. doi: 10.1093/gbe/evu277.
5
Runaway GC Evolution in Gerbil Genomes.沙鼠基因组中的 GC 剧烈演变
Mol Biol Evol. 2020 Aug 1;37(8):2197-2210. doi: 10.1093/molbev/msaa072.
6
The evolution of transcription-associated biases of mutations across vertebrates.转录相关突变偏倚在脊椎动物中的演化。
BMC Evol Biol. 2010 Jun 18;10:187. doi: 10.1186/1471-2148-10-187.
7
Selective forces and mutational biases drive stop codon usage in the human genome: a comparison with sense codon usage.选择力和突变偏好驱动人类基因组中终止密码子的使用:与有义密码子使用的比较。
BMC Genomics. 2016 May 17;17:366. doi: 10.1186/s12864-016-2692-4.
8
GC content evolution in coding regions of angiosperm genomes: a unifying hypothesis.被子植物基因组编码区 GC 含量的进化:一个统一的假说。
Trends Genet. 2014 Jul;30(7):263-70. doi: 10.1016/j.tig.2014.05.002. Epub 2014 Jun 7.
9
GC-compositional strand bias around transcription start sites in plants and fungi.植物和真菌中转录起始位点周围的GC组成链偏向性。
BMC Genomics. 2005 Feb 28;6:26. doi: 10.1186/1471-2164-6-26.
10
Transcription-related mutations and GC content drive variation in nucleotide substitution rates across the genomes of Arabidopsis thaliana and Arabidopsis lyrata.转录相关突变和GC含量驱动拟南芥和琴叶拟南芥基因组中核苷酸替换率的变异。
BMC Evol Biol. 2007 Apr 23;7:66. doi: 10.1186/1471-2148-7-66.

引用本文的文献

1
High-resolution profiling reveals coupled transcriptional and translational regulation of transgenes.高分辨率分析揭示了转基因的转录和翻译调控的耦合。
Nucleic Acids Res. 2025 Jun 6;53(11). doi: 10.1093/nar/gkaf528.
2
Coupling mechanisms coordinating mRNA translation with stages of the mRNA lifecycle.将mRNA翻译与mRNA生命周期各阶段相协调的偶联机制。
RNA Biol. 2025 Dec;22(1):1-12. doi: 10.1080/15476286.2025.2483001. Epub 2025 Mar 24.
3
Structural and Functional Insights into Targeting GCCG Sites in the EGFR Promoter by Two DNA Intercalators to Inhibit Breast Cancer Metastasis.

本文引用的文献

1
High prevalence of PRDM9-independent recombination hotspots in placental mammals.胎盘哺乳动物中 PRDM9 非依赖性重组热点的高发生率。
Proc Natl Acad Sci U S A. 2024 Jun 4;121(23):e2401973121. doi: 10.1073/pnas.2401973121. Epub 2024 May 29.
2
Patterns of recombination in snakes reveal a tug-of-war between PRDM9 and promoter-like features.蛇类重组模式揭示了 PRDM9 与启动子样特征之间的拉锯战。
Science. 2024 Feb 23;383(6685):eadj7026. doi: 10.1126/science.adj7026.
3
mRNA nuclear export: how mRNA identity features distinguish functional RNAs from junk transcripts.
两种DNA嵌入剂靶向表皮生长因子受体(EGFR)启动子中的GCCG位点以抑制乳腺癌转移的结构与功能研究
J Med Chem. 2025 Mar 27;68(6):6601-6615. doi: 10.1021/acs.jmedchem.4c03203. Epub 2025 Mar 3.
4
Genome biology of long non-coding RNAs in humans: A virtual karyotype.人类长链非编码RNA的基因组生物学:一种虚拟核型。
Comput Struct Biotechnol J. 2025 Jan 31;27:575-584. doi: 10.1016/j.csbj.2025.01.026. eCollection 2025.
mRNA 核输出:mRNA 身份特征如何区分功能 RNA 与垃圾转录本。
RNA Biol. 2024 Jan;21(1):1-12. doi: 10.1080/15476286.2023.2293339. Epub 2023 Dec 13.
4
Down the Penrose stairs, or how selection for fewer recombination hotspots maintains their existence.沿着彭罗斯阶梯走下去,或者说,选择减少重组热点如何维持它们的存在。
Elife. 2023 Oct 13;12:e83769. doi: 10.7554/eLife.83769.
5
Structural basis for high-order complex of SARNP and DDX39B to facilitate mRNP assembly.SARNP 和 DDX39B 高序复合物促进 mRNP 组装的结构基础。
Cell Rep. 2023 Aug 29;42(8):112988. doi: 10.1016/j.celrep.2023.112988. Epub 2023 Aug 14.
6
Promoter evolution of mammalian gene duplicates.哺乳动物基因副本的启动子进化。
BMC Biol. 2023 Apr 13;21(1):80. doi: 10.1186/s12915-023-01590-6.
7
RBM33 directs the nuclear export of transcripts containing GC-rich elements.RBM33 指导含有 GC 丰富元件的转录本的核输出。
Genes Dev. 2022 May 1;36(9-10):550-565. doi: 10.1101/gad.349456.122. Epub 2022 May 19.
8
Non-Darwinian Molecular Biology.非达尔文分子生物学
Front Genet. 2022 Feb 16;13:831068. doi: 10.3389/fgene.2022.831068. eCollection 2022.
9
GC-content biases in protein-coding genes act as an "mRNA identity" feature for nuclear export.GC 含量偏倚在编码蛋白基因中充当核输出的“mRNA 身份”特征。
Bioessays. 2021 Feb;43(2):e2000197. doi: 10.1002/bies.202000197. Epub 2020 Nov 9.
10
Gene Architecture and Sequence Composition Underpin Selective Dependency of Nuclear Export of Long RNAs on NXF1 and the TREX Complex.基因结构和序列组成是长 RNA 核输出依赖于 NXF1 和 TREX 复合物的选择性基础。
Mol Cell. 2020 Jul 16;79(2):251-267.e6. doi: 10.1016/j.molcel.2020.05.013. Epub 2020 Jun 5.