基因组中等程度的不均匀性与大量的RNA二级结构相关。

Genomic mid-range inhomogeneity correlates with an abundance of RNA secondary structures.

作者信息

Bechtel Jason M, Wittenschlaeger Thomas, Dwyer Trisha, Song Jun, Arunachalam Sasi, Ramakrishnan Sadeesh K, Shepard Samuel, Fedorov Alexei

机构信息

Program in Bioinformatics and Proteomics/Genomics, University of Toledo Health Science Campus, Toledo, OH 43614, USA.

出版信息

BMC Genomics. 2008 Jun 12;9:284. doi: 10.1186/1471-2164-9-284.

DOI:10.1186/1471-2164-9-284

PMID:18549495

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2442090/

Abstract

BACKGROUND

Genomes possess different levels of non-randomness, in particular, an inhomogeneity in their nucleotide composition. Inhomogeneity is manifest from the short-range where neighboring nucleotides influence the choice of base at a site, to the long-range, commonly known as isochores, where a particular base composition can span millions of nucleotides. A separate genomic issue that has yet to be thoroughly elucidated is the role that RNA secondary structure (SS) plays in gene expression.

RESULTS

We present novel data and approaches that show that a mid-range inhomogeneity (~30 to 1000 nt) not only exists in mammalian genomes but is also significantly associated with strong RNA SS. A whole-genome bioinformatics investigation of local SS in a set of 11,315 non-redundant human pre-mRNA sequences has been carried out. Four distinct components of these molecules (5'-UTRs, exons, introns and 3'-UTRs) were considered separately, since they differ in overall nucleotide composition, sequence motifs and periodicities. For each pre-mRNA component, the abundance of strong local SS (< -25 kcal/mol) was a factor of two to ten greater than a random expectation model. The randomization process preserves the short-range inhomogeneity of the corresponding natural sequences, thus, eliminating short-range signals as possible contributors to any observed phenomena.

CONCLUSION

We demonstrate that the excess of strong local SS in pre-mRNAs is linked to the little explored phenomenon of genomic mid-range inhomogeneity (MRI). MRI is an interdependence between nucleotide choice and base composition over a distance of 20-1000 nt. Additionally, we have created a public computational resource to support further study of genomic MRI.

摘要

背景

基因组具有不同程度的非随机性，特别是其核苷酸组成存在不均匀性。这种不均匀性从短程范围（相邻核苷酸影响某一位置碱基的选择）到长程范围（通常称为等密度区，特定碱基组成可跨越数百万个核苷酸）都有体现。另一个尚未得到充分阐明的基因组问题是RNA二级结构（SS）在基因表达中所起的作用。

结果

我们展示了新的数据和方法，表明中等范围的不均匀性（约30至1000个核苷酸）不仅存在于哺乳动物基因组中，而且与强大的RNA二级结构显著相关。我们对一组11315条非冗余人类前体mRNA序列中的局部二级结构进行了全基因组生物信息学研究。由于这些分子的四个不同组成部分（5'非翻译区、外显子、内含子和3'非翻译区）在整体核苷酸组成、序列基序和周期性方面存在差异，因此对它们分别进行了考虑。对于每个前体mRNA组成部分，强大局部二级结构（<-25千卡/摩尔）的丰度比随机期望模型高出两到十倍。随机化过程保留了相应自然序列的短程不均匀性，从而消除了短程信号作为任何观察到的现象的可能贡献因素。

结论

我们证明，前体mRNA中强大局部二级结构的过量与尚未充分探索的基因组中等范围不均匀性（MRI）现象有关。MRI是指在20至1000个核苷酸的距离上核苷酸选择与碱基组成之间的相互依存关系。此外，我们创建了一个公共计算资源，以支持对基因组MRI的进一步研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d80/2442090/a545a087f981/1471-2164-9-284-1.jpg

相似文献

Genomic mid-range inhomogeneity correlates with an abundance of RNA secondary structures.基因组中等程度的不均匀性与大量的RNA二级结构相关。

BMC Genomics. 2008 Jun 12;9:284. doi: 10.1186/1471-2164-9-284.

Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency.结构RNA比具有相同二核苷酸频率的随机RNA具有更低的折叠能。

RNA. 2005 May;11(5):578-91. doi: 10.1261/rna.7220505.

5'- and 3'-noncoding regions in flavivirus RNA.黄病毒RNA中的5'和3'非编码区。

Adv Virus Res. 2003;59:177-228. doi: 10.1016/s0065-3527(03)59006-6.

RNAMotif, an RNA secondary structure definition and search algorithm.RNA基序，一种RNA二级结构定义及搜索算法。

Nucleic Acids Res. 2001 Nov 15;29(22):4724-35. doi: 10.1093/nar/29.22.4724.

[Structure and function of the non-coding regions of hepatitis C viral RNA].[丙型肝炎病毒RNA非编码区的结构与功能]

Postepy Biochem. 2006;52(1):62-71.

The influence of the nucleotide sequences of random Shine-Dalgarno and spacer region on bovine growth hormone gene expression.随机Shine-Dalgarno序列和间隔区的核苷酸序列对牛生长激素基因表达的影响。

J Microbiol. 2006 Feb;44(1):64-71.

Structural and functional features of eukaryotic mRNA untranslated regions.真核生物mRNA非翻译区的结构和功能特征

Gene. 2001 Oct 3;276(1-2):73-81. doi: 10.1016/s0378-1119(01)00674-6.

In vitro RNA synthesis from exogenous dengue viral RNA templates requires long range interactions between 5'- and 3'-terminal regions that influence RNA structure.从外源性登革病毒RNA模板进行的体外RNA合成需要5'和3'末端区域之间的长程相互作用，这种相互作用会影响RNA结构。

J Biol Chem. 2001 May 11;276(19):15581-91. doi: 10.1074/jbc.M010923200. Epub 2001 Feb 5.

Elements located upstream and downstream of the major splice donor site influence the ability of HIV-2 leader RNA to dimerize in vitro.位于主要剪接供体位点上游和下游的元件会影响HIV-2前导RNA在体外二聚化的能力。

Biochemistry. 2003 Mar 11;42(9):2634-42. doi: 10.1021/bi0271190.

Folding free energies of 5'-UTRs impact post-transcriptional regulation on a genomic scale in yeast.5'非翻译区的折叠自由能在基因组水平上影响酵母的转录后调控。

PLoS Comput Biol. 2005 Dec;1(7):e72. doi: 10.1371/journal.pcbi.0010072. Epub 2005 Dec 9.

引用本文的文献

Profound Non-Randomness in Dinucleotide Arrangements within Ultra-Conserved Non-Coding Elements and the Human Genome.超保守非编码元件及人类基因组中双核苷酸排列的深度非随机性

Biology (Basel). 2023 Aug 12;12(8):1125. doi: 10.3390/biology12081125.

Nucleotide Composition of Ultra-Conserved Elements Shows Excess of GpC and Depletion of GG and CC Dinucleotides.超保守元件的核苷酸组成显示 GpC 的过剩和 GG 和 CC 二核苷酸的耗竭。

Genes (Basel). 2022 Nov 7;13(11):2053. doi: 10.3390/genes13112053.

Adapting Biased Gene Conversion theory to account for intensive GC-content deterioration in the human genome by novel mutations.将有偏基因转换理论改编为新的突变，以解释人类基因组中密集 GC 含量的恶化。

PLoS One. 2020 Apr 30;15(4):e0232167. doi: 10.1371/journal.pone.0232167. eCollection 2020.

The common origin of symmetry and structure in genetic sequences.遗传序列中对称和结构的共同起源。

Sci Rep. 2018 Oct 25;8(1):15817. doi: 10.1038/s41598-018-34136-w.

1000 human genomes carry widespread signatures of GC biased gene conversion.1000 个人类基因组携带广泛的 GC 偏向基因转换的特征。

BMC Genomics. 2018 Apr 16;19(1):256. doi: 10.1186/s12864-018-4593-1.

Genome evolution by matrix algorithms: cellular automata approach to population genetics.基于矩阵算法的基因组进化：细胞自动机在群体遗传学中的应用方法

Genome Biol Evol. 2014 Apr;6(4):988-99. doi: 10.1093/gbe/evu075.

Genomic MRI--a public resource for studying sequence patterns within genomic DNA.

J Vis Exp. 2011 May 9(51):2663. doi: 10.3791/2663.

Insights into metazoan evolution from Alvinella pompejana cDNAs.从 Alvinella pompejana cDNA 中洞察后生动物的进化。

BMC Genomics. 2010 Nov 16;11:634. doi: 10.1186/1471-2164-11-634.

Critical association of ncRNA with introns.非编码 RNA 与内含子的关键关联。

Nucleic Acids Res. 2011 Mar;39(6):2357-66. doi: 10.1093/nar/gkq1080. Epub 2010 Nov 10.

The peculiarities of large intron splicing in animals.动物中大内含子剪接的特点。

PLoS One. 2009 Nov 16;4(11):e7853. doi: 10.1371/journal.pone.0007853.

本文引用的文献

The neoselectionist theory of genome evolution.基因组进化的新选择主义理论

Proc Natl Acad Sci U S A. 2007 May 15;104(20):8385-90. doi: 10.1073/pnas.0701652104. Epub 2007 May 9.

Genome-wide transcription and the implications for genomic organization.全基因组转录及其对基因组组织的影响。

Nat Rev Genet. 2007 Jun;8(6):413-23. doi: 10.1038/nrg2083. Epub 2007 May 8.

Human catechol-O-methyltransferase haplotypes modulate protein expression by altering mRNA secondary structure.人类儿茶酚-O-甲基转移酶单倍型通过改变mRNA二级结构来调节蛋白质表达。

Science. 2006 Dec 22;314(5807):1930-3. doi: 10.1126/science.1131262.

CpGcluster: a distance-based algorithm for CpG-island detection.CpG簇：一种基于距离的CpG岛检测算法。

BMC Bioinformatics. 2006 Oct 12;7:446. doi: 10.1186/1471-2105-7-446.

A machine learning strategy to identify candidate binding sites in human protein-coding sequence.一种用于识别人类蛋白质编码序列中候选结合位点的机器学习策略。

BMC Bioinformatics. 2006 Sep 26;7:419. doi: 10.1186/1471-2105-7-419.

A new perspective on isochore evolution.等容线进化的新视角。

Gene. 2006 Dec 30;385:71-4. doi: 10.1016/j.gene.2006.04.030. Epub 2006 Aug 5.

A systematic analysis of disease-associated variants in the 3' regulatory regions of human protein-coding genes II: the importance of mRNA secondary structure in assessing the functionality of 3' UTR variants.人类蛋白质编码基因3'调控区疾病相关变异的系统分析II：mRNA二级结构在评估3'非翻译区变异功能中的重要性

Hum Genet. 2006 Oct;120(3):301-33. doi: 10.1007/s00439-006-0218-x. Epub 2006 Jun 29.

Advances in the Exon-Intron Database (EID).外显子-内含子数据库（EID）的进展。

Brief Bioinform. 2006 Jun;7(2):178-85. doi: 10.1093/bib/bbl003. Epub 2006 Mar 9.

Impact of RNA structure on the prediction of donor and acceptor splice sites.RNA结构对供体和受体剪接位点预测的影响

BMC Bioinformatics. 2006 Jun 13;7:297. doi: 10.1186/1471-2105-7-297.

A periodic pattern of mRNA secondary structure created by the genetic code.由遗传密码产生的mRNA二级结构的周期性模式。

Nucleic Acids Res. 2006 May 8;34(8):2428-37. doi: 10.1093/nar/gkl287. Print 2006.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基因组中等程度的不均匀性与大量的RNA二级结构相关。

Genomic mid-range inhomogeneity correlates with an abundance of RNA secondary structures.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献