• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

作为基因组标点符号的GC/AT含量峰值。

GC/AT-content spikes as genomic punctuation marks.

作者信息

Zhang Lingang, Kasif Simon, Cantor Charles R, Broude Natalia E

机构信息

Center for Advanced Biotechnology, Boston University, Boston, MA 02215, USA.

出版信息

Proc Natl Acad Sci U S A. 2004 Nov 30;101(48):16855-60. doi: 10.1073/pnas.0407821101. Epub 2004 Nov 17.

DOI:10.1073/pnas.0407821101
PMID:15548610
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC534751/
Abstract

Large-scale analysis of the GC-content distribution at the gene level reveals both common features and basic differences in genomes of different groups of species. Sharp changes in GC content are detected at the transcription boundaries for all species analyzed, including human, mouse, rat, chicken, fruit fly, and worm. However, two substantially distinct groups of GC-content profiles can be recognized: warm-blooded vertebrates including human, mouse, rat, and chicken, and invertebrates including fruit fly and worm. In vertebrates, sharp positive and negative spikes of GC content are observed at the transcription start and stop sites, respectively, and there is also a progressive decrease in GC content from the 5' untranslated region to the 3' untranslated region along the gene. In invertebrates, the positive and negative GC-content spikes at the transcription start and stop sites are preceded by spikes of opposite value, and the highest GC content is found in the coding regions of the genes. Cross-correlation analysis indicates high frequencies of GC-content spikes at transcription start and stop sites. The strong conservation of this genomic feature seen in comparisons of the human/mouse and human/rat orthologs, and the clustering of genes with GC-content spikes on chromosomes imply a biological function. The GC-content spikes at transcription boundaries may reflect a general principle of genomic punctuation. Our analysis also provides means for identifying these GC-content spikes in individual genomic sequences.

摘要

在基因水平上对GC含量分布进行大规模分析,揭示了不同物种基因组中的共同特征和基本差异。在所有分析的物种(包括人类、小鼠、大鼠、鸡、果蝇和线虫)的转录边界处都检测到了GC含量的急剧变化。然而,可以识别出两组截然不同的GC含量谱:包括人类、小鼠、大鼠和鸡在内的温血脊椎动物,以及包括果蝇和线虫在内的无脊椎动物。在脊椎动物中,分别在转录起始和终止位点观察到GC含量的急剧正峰和负峰,并且沿着基因从5'非翻译区到3'非翻译区GC含量也逐渐降低。在无脊椎动物中,转录起始和终止位点的正、负GC含量峰之前是相反值的峰,并且在基因的编码区发现最高的GC含量。互相关分析表明在转录起始和终止位点GC含量峰的频率很高。在人类/小鼠和人类/大鼠直系同源物的比较中看到的这种基因组特征的强烈保守性,以及具有GC含量峰的基因在染色体上的聚类意味着一种生物学功能。转录边界处的GC含量峰可能反映了基因组标点的一般原则。我们的分析还提供了在个体基因组序列中识别这些GC含量峰的方法。

相似文献

1
GC/AT-content spikes as genomic punctuation marks.作为基因组标点符号的GC/AT含量峰值。
Proc Natl Acad Sci U S A. 2004 Nov 30;101(48):16855-60. doi: 10.1073/pnas.0407821101. Epub 2004 Nov 17.
2
Diversity in isochore structure among cold-blooded vertebrates based on GC content of coding and non-coding sequences.基于编码和非编码序列的GC含量的冷血脊椎动物等密度区结构多样性。
Genetica. 2007 Mar;129(3):281-9. doi: 10.1007/s10709-006-0009-2. Epub 2006 Aug 1.
3
The rate, not the spectrum, of base pair substitutions changes at a GC-content transition in the human NF1 gene region: implications for the evolution of the mammalian genome structure.在人类NF1基因区域,碱基对替换的速率而非频谱在GC含量转变时发生变化:对哺乳动物基因组结构进化的启示。
Genetics. 2007 Jan;175(1):421-8. doi: 10.1534/genetics.106.064386. Epub 2006 Oct 22.
4
Compositional dynamics of guanine and cytosine content in prokaryotic genomes.原核生物基因组中鸟嘌呤和胞嘧啶含量的组成动态
Res Microbiol. 2007 May;158(4):363-70. doi: 10.1016/j.resmic.2007.02.007. Epub 2007 Mar 6.
5
Systematic analysis of alternative promoters correlated with alternative splicing in human genes.与人类基因可变剪接相关的可变启动子的系统分析。
Genomics. 2009 May;93(5):420-5. doi: 10.1016/j.ygeno.2009.01.008. Epub 2009 Feb 11.
6
Reverse polarization in amino acid and nucleotide substitution patterns between human-mouse orthologs of two compositional extrema.两种组成极端情况的人鼠直系同源基因之间氨基酸和核苷酸替换模式的反向极化。
DNA Res. 2007 Aug 31;14(4):141-54. doi: 10.1093/dnares/dsm015. Epub 2007 Sep 25.
7
Relationships between replication timing and GC content of cancer-related genes on human chromosomes 11q and 21q.人类染色体11q和21q上癌症相关基因的复制时间与GC含量之间的关系。
Gene. 2009 Mar 15;433(1-2):26-31. doi: 10.1016/j.gene.2008.12.004. Epub 2008 Dec 14.
8
Genome-wide analysis reveals strong correlation between CpG islands with nearby transcription start sites of genes and their tissue specificity.全基因组分析揭示了CpG岛与其附近基因转录起始位点之间的强相关性及其组织特异性。
Gene. 2005 May 9;350(2):129-36. doi: 10.1016/j.gene.2005.01.012. Epub 2005 Mar 19.
9
The pig genome: compositional analysis and identification of the gene-richest regions in chromosomes and nuclei.猪基因组:组成分析以及染色体和细胞核中基因最丰富区域的鉴定
Gene. 2004 Dec 22;343(2):245-51. doi: 10.1016/j.gene.2004.09.011.
10
Relationship between gene expression and GC-content in mammals: statistical significance and biological relevance.哺乳动物中基因表达与GC含量之间的关系:统计学意义及生物学相关性。
Hum Mol Genet. 2005 Feb 1;14(3):421-7. doi: 10.1093/hmg/ddi038. Epub 2004 Dec 8.

引用本文的文献

1
Chloroplast genome comparison and taxonomic reassessment of Polygonatum sensu Lato (Asparagaceae): implications for molecular marker development in traditional medicinal plants.广义黄精属(天门冬科)叶绿体基因组比较与分类重新评估:对传统药用植物分子标记开发的启示
BMC Genomics. 2025 Sep 2;26(1):796. doi: 10.1186/s12864-025-12012-y.
2
AAV yield, bioactivity, and particle heterogeneity are impacted by genome size and non-coding DNA elements.腺相关病毒(AAV)的产量、生物活性和颗粒异质性受基因组大小和非编码DNA元件的影响。
Mol Ther Methods Clin Dev. 2025 Jun 2;33(3):101499. doi: 10.1016/j.omtm.2025.101499. eCollection 2025 Sep 11.
3
The quantitative impact of 3'UTRs on gene expression.3'非翻译区对基因表达的定量影响。
Nucleic Acids Res. 2025 Jun 20;53(12). doi: 10.1093/nar/gkaf568.
4
Structural Features of 5' Untranslated Region in Translational Control of Eukaryotes.真核生物翻译控制中5'非翻译区的结构特征
Int J Mol Sci. 2025 Feb 25;26(5):1979. doi: 10.3390/ijms26051979.
5
Comparative genetic analysis of pathogenic and attenuated strains of Junín virus.胡宁病毒致病株与减毒株的比较遗传分析
Genetica. 2025 Feb 8;153(1):12. doi: 10.1007/s10709-025-00228-5.
6
A study on the codon usage bias of arenavirus common genes.沙粒病毒常见基因密码子使用偏好性研究
Front Microbiol. 2025 Jan 23;15:1490076. doi: 10.3389/fmicb.2024.1490076. eCollection 2024.
7
inDrops-2: a flexible, versatile and cost-efficient droplet microfluidic approach for high-throughput scRNA-seq of fresh and preserved clinical samples.inDrops-2:一种灵活、通用且经济高效的微滴微流控方法,用于新鲜和保存的临床样本的高通量单细胞RNA测序。
Nucleic Acids Res. 2025 Jan 11;53(2). doi: 10.1093/nar/gkae1312.
8
Why the ROS matters: One-electron oxidants focus DNA damage and repair on G-quadruplexes for gene regulation.活性氧为何重要:单电子氧化剂使DNA损伤与修复聚焦于基因调控的G-四链体上。
DNA Repair (Amst). 2025 Jan;145:103789. doi: 10.1016/j.dnarep.2024.103789. Epub 2024 Nov 16.
9
Optimized chemical labeling method for isolation of 8-oxoG-modified RNA, ChLoRox-Seq, identifies mRNAs enriched in oxidation and transcriptome-wide distribution biases of oxidation events post environmental stress.优化的化学标记法用于分离 8-氧鸟嘌呤修饰 RNA,ChLoRox-Seq,鉴定了氧化富集的 mRNA 以及环境应激后氧化事件的全转录组分布偏倚。
RNA Biol. 2024 Jan;21(1):132-148. doi: 10.1080/15476286.2024.2427903. Epub 2024 Nov 19.
10
Widespread 3'UTR capped RNAs derive from G-rich regions in proximity to AGO2 binding sites.广泛存在的 3'UTR 加帽 RNA 来源于接近 AGO2 结合位点的富含 G 区域。
BMC Biol. 2024 Nov 7;22(1):254. doi: 10.1186/s12915-024-02032-7.

本文引用的文献

1
Isochore structures in the mouse genome.小鼠基因组中的等容线结构。
Genomics. 2004 Mar;83(3):384-94. doi: 10.1016/j.ygeno.2003.09.011.
2
The Gene Ontology (GO) database and informatics resource.基因本体论(GO)数据库及信息资源。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D258-61. doi: 10.1093/nar/gkh036.
3
An isochore map of the human genome based on the Z curve method.基于Z曲线方法的人类基因组等容线图。
Gene. 2003 Oct 23;317(1-2):127-35. doi: 10.1016/s0378-1119(03)00665-6.
4
Identification of isochore boundaries in the human genome using the technique of wavelet multiresolution analysis.运用小波多分辨率分析技术识别人类基因组中的等容线边界。
Biochem Biophys Res Commun. 2003 Nov 7;311(1):215-22. doi: 10.1016/j.bbrc.2003.09.198.
5
Identifying biological themes within lists of genes with EASE.使用EASE在基因列表中识别生物学主题。
Genome Biol. 2003;4(10):R70. doi: 10.1186/gb-2003-4-10-r70. Epub 2003 Sep 11.
6
GeneFizz: A web tool to compare genetic (coding/non-coding) and physical (helix/coil) segmentations of DNA sequences. Gene discovery and evolutionary perspectives.基因Fizz:一种用于比较DNA序列的遗传(编码/非编码)和物理(螺旋/卷曲)分割的网络工具。基因发现与进化视角。
Nucleic Acids Res. 2003 Jul 1;31(13):3843-9. doi: 10.1093/nar/gkg627.
7
DNA helix: the importance of being GC-rich.DNA螺旋:富含GC的重要性。
Nucleic Acids Res. 2003 Apr 1;31(7):1838-44. doi: 10.1093/nar/gkg296.
8
Effects of GC content and mutational pressure on the lengths of exons and coding sequences.鸟嘌呤-胞嘧啶含量和突变压力对外显子长度和编码序列的影响。
J Mol Evol. 2003 Mar;56(3):362-70. doi: 10.1007/s00239-002-2406-1.
9
The UCSC Genome Browser Database.加州大学圣克鲁兹分校基因组浏览器数据库。
Nucleic Acids Res. 2003 Jan 1;31(1):51-4. doi: 10.1093/nar/gkg129.
10
Large clusters of co-expressed genes in the Drosophila genome.果蝇基因组中大量共表达基因簇。
Nature. 2002 Dec 12;420(6916):666-9. doi: 10.1038/nature01216.