• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

真核生物基因组显示出 -mer 组成的强烈进化保守性以及内含子和基因间区之间的相关性贡献。

Eukaryotic Genomes Show Strong Evolutionary Conservation of -mer Composition and Correlation Contributions between Introns and Intergenic Regions.

机构信息

Kirchhoff-Institute for Physics, Heidelberg University, INF 227, 69117 Heidelberg, Germany.

出版信息

Genes (Basel). 2021 Oct 1;12(10):1571. doi: 10.3390/genes12101571.

DOI:10.3390/genes12101571
PMID:34680967
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8536142/
Abstract

Several strongly conserved DNA sequence patterns in and between introns and intergenic regions (IIRs) consisting of short tandem repeats (STRs) with repeat lengths <3 bp have already been described in the kingdom of . In this work, we expanded the search and analysis of conserved DNA sequence patterns to a wider range of genomes. Our aims were to confirm the conservation of these patterns, to support the hypothesis on their functional constraints and/or the identification of unknown patterns. We pairwise compared genomic DNA sequences of genes, exons, CDS, introns and intergenic regions of 34 (land plants), 30 and 29 using established -mer-based (alignment-free) comparison methods. Additionally, the results were compared with values derived for in former studies. We confirmed strong correlations between the sequence structures of IIRs spanning over the entire domain of . We found that the high correlations within introns, intergenic regions and between the two are a result of conserved abundancies of STRs with repeat units ≤2 bp (e.g., (AT)n). For some sequence patterns and their inverse complementary sequences, we found a violation of equal distribution on complementary DNA strands in a subset of genomes. Looking at mismatches within the identified STR patterns, we found specific preferences for certain nucleotides stable over all four phylogenetic kingdoms. We conclude that all of these conserved patterns between IIRs indicate a shared function of these sequence structures related to STRs.

摘要

已经在域中描述了内含子和基因间区(IIRs)内和之间的几个由短串联重复(STR)组成的高度保守的 DNA 序列模式,这些 STR 的重复长度<3bp。在这项工作中,我们将保守的 DNA 序列模式的搜索和分析扩展到更广泛的范围。我们的目的是确认这些模式的保守性,支持它们的功能约束假说和/或鉴定未知的模式。我们使用已建立的基于-mer 的(无比对)比较方法,对 34 个(陆地植物)、30 个和 29 个的基因、外显子、CDS、内含子和基因间区的基因组 DNA 序列进行了两两比较。此外,将结果与以前研究中为得出的值进行了比较。我们证实了跨越整个域的 IIRs 的序列结构之间存在很强的相关性。我们发现,内含子、基因间区和两者之间的高度相关性是由于重复单元≤2bp 的 STR (例如(AT)n)的丰富度保持不变所致。对于一些序列模式及其互补序列,我们在一部分基因组中发现互补 DNA 链上存在分配不均的情况。观察鉴定出的 STR 模式内的错配,我们发现特定核苷酸在所有四个系统发育域中都具有稳定的偏好。我们得出的结论是,IIRs 之间的所有这些保守模式都表明这些与 STR 相关的序列结构具有共同的功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/1ee37d2b1d24/genes-12-01571-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/8d26f21e7cab/genes-12-01571-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/6b7e313a9e88/genes-12-01571-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/501c6db46d78/genes-12-01571-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/a89ab0352be6/genes-12-01571-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/fd690eae2752/genes-12-01571-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/1ee37d2b1d24/genes-12-01571-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/8d26f21e7cab/genes-12-01571-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/6b7e313a9e88/genes-12-01571-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/501c6db46d78/genes-12-01571-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/a89ab0352be6/genes-12-01571-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/fd690eae2752/genes-12-01571-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/989e/8536142/1ee37d2b1d24/genes-12-01571-g006.jpg

相似文献

1
Eukaryotic Genomes Show Strong Evolutionary Conservation of -mer Composition and Correlation Contributions between Introns and Intergenic Regions.真核生物基因组显示出 -mer 组成的强烈进化保守性以及内含子和基因间区之间的相关性贡献。
Genes (Basel). 2021 Oct 1;12(10):1571. doi: 10.3390/genes12101571.
2
Conservation of k-mer Composition and Correlation Contribution between Introns and Intergenic Regions of Animalia Genomes.动物基因组内含子与基因间区域之间k-mer组成的保守性及相关性贡献
Genes (Basel). 2018 Oct 4;9(10):482. doi: 10.3390/genes9100482.
3
Evolutionary implications of intron-exon distribution and the properties and sequences of the RPL10A gene in eukaryotes.内含子-外显子分布的进化意义以及真核生物中 RPL10A 基因的性质和序列。
Mol Phylogenet Evol. 2013 Mar;66(3):857-67. doi: 10.1016/j.ympev.2012.11.013. Epub 2012 Nov 29.
4
Similar Ratios of Introns to Intergenic Sequence across Animal Genomes.动物基因组中内含子与基因间序列的相似比例。
Genome Biol Evol. 2017 Jun 1;9(6):1582-1598. doi: 10.1093/gbe/evx103.
5
Evolutionary convergence on highly-conserved 3' intron structures in intron-poor eukaryotes and insights into the ancestral eukaryotic genome.内含子较少的真核生物中高度保守的3'内含子结构的进化趋同以及对原始真核生物基因组的见解。
PLoS Genet. 2008 Aug 8;4(8):e1000148. doi: 10.1371/journal.pgen.1000148.
6
Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.对果蝇中保守非编码DNA的分析揭示了基因间序列和内含子序列中类似的限制。
Genome Res. 2001 Aug;11(8):1335-45. doi: 10.1101/gr.178701.
7
Intron evolution: testing hypotheses of intron evolution using the phylogenomics of tetraspanins.内含子进化:利用四跨膜蛋白的系统发育基因组学检验内含子进化假说。
PLoS One. 2009;4(3):e4680. doi: 10.1371/journal.pone.0004680. Epub 2009 Mar 5.
8
Contrasting patterns of sequence divergence and base composition between Drosophila introns and intergenic regions.果蝇内含子与基因间区域之间序列分歧和碱基组成的对比模式。
Biol Lett. 2006 Dec 22;2(4):604-7. doi: 10.1098/rsbl.2006.0521.
9
Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva.泰勒虫中非编码DNA的特性及潜在顺式调控元件的鉴定
BMC Genomics. 2008 Dec 3;9:582. doi: 10.1186/1471-2164-9-582.
10
Patterns of evolutionary constraints in intronic and intergenic DNA of Drosophila.果蝇内含子和基因间DNA的进化限制模式
Genome Res. 2004 Feb;14(2):273-9. doi: 10.1101/gr.1329204.

引用本文的文献

1
Short tandem repeats delineate gene bodies across eukaryotes.短串联重复序列描绘了真核生物中的基因体。
Nat Commun. 2024 Dec 30;15(1):10902. doi: 10.1038/s41467-024-55276-w.
2
Peculiar -mer Spectra Are Correlated with 3D Contact Frequencies and Breakpoint Regions in the Human Genome.奇异 -mer 谱与人类基因组中的 3D 接触频率和断点区域相关。
Genes (Basel). 2024 Sep 25;15(10):1247. doi: 10.3390/genes15101247.
3
Condensed Matter Systems Exposed to Radiation: Multiscale Theory, Simulations, and Experiment.受辐射的凝聚态物质系统:多尺度理论、模拟与实验

本文引用的文献

1
Predatory protists.捕食性原生生物。
Curr Biol. 2020 May 18;30(10):R510-R516. doi: 10.1016/j.cub.2020.03.052.
2
Eukaryotic transcription factors can track and control their target genes using DNA antennas.真核转录因子可以使用 DNA 天线来跟踪和控制其靶基因。
Nat Commun. 2020 Jan 28;11(1):540. doi: 10.1038/s41467-019-14217-8.
3
Conservation of k-mer Composition and Correlation Contribution between Introns and Intergenic Regions of Animalia Genomes.动物基因组内含子与基因间区域之间k-mer组成的保守性及相关性贡献
Chem Rev. 2024 Jul 10;124(13):8014-8129. doi: 10.1021/acs.chemrev.3c00902. Epub 2024 Jun 6.
4
A survey of k-mer methods and applications in bioinformatics.生物信息学中k-mer方法及其应用综述。
Comput Struct Biotechnol J. 2024 May 21;23:2289-2303. doi: 10.1016/j.csbj.2024.05.025. eCollection 2024 Dec.
5
Specific Patterns in Correlations of Super-Short Tandem Repeats (SSTRs) with G+C Content, Genic and Intergenic Regions, and Retrotransposons on All Human Chromosomes.所有人类染色体上超短串联重复序列(SSTRs)与 G+C 含量、基因和基因间区域以及反转录转座子相关性的特定模式。
Genes (Basel). 2023 Dec 25;15(1):33. doi: 10.3390/genes15010033.
6
Differences in alternative splicing and their potential underlying factors between animals and plants.动植物之间可变剪接的差异及其潜在的影响因素。
J Adv Res. 2024 Oct;64:83-98. doi: 10.1016/j.jare.2023.11.017. Epub 2023 Nov 20.
7
Moderation of Structural DNA Properties by Coupled Dinucleotide Contents in Eukaryotes.真核生物中双核苷酸含量对结构 DNA 性质的调节。
Genes (Basel). 2023 Mar 20;14(3):755. doi: 10.3390/genes14030755.
8
Spatial-Temporal Genome Regulation in Stress-Response and Cell-Fate Change.应激反应和细胞命运转变中的时空基因组调控。
Int J Mol Sci. 2023 Jan 31;24(3):2658. doi: 10.3390/ijms24032658.
9
Imaging Intron Evolution.成像内含子进化
Methods Protoc. 2022 Jun 24;5(4):53. doi: 10.3390/mps5040053.
Genes (Basel). 2018 Oct 4;9(10):482. doi: 10.3390/genes9100482.
4
The human noncoding genome defined by genetic diversity.遗传多样性定义的人类非编码基因组。
Nat Genet. 2018 Mar;50(3):333-337. doi: 10.1038/s41588-018-0062-7. Epub 2018 Feb 26.
5
Similar Ratios of Introns to Intergenic Sequence across Animal Genomes.动物基因组中内含子与基因间序列的相似比例。
Genome Biol Evol. 2017 Jun 1;9(6):1582-1598. doi: 10.1093/gbe/evx103.
6
K-mer Content, Correlation, and Position Analysis of Genome DNA Sequences for the Identification of Function and Evolutionary Features.用于功能和进化特征识别的基因组DNA序列的K-mer含量、相关性及位置分析
Genes (Basel). 2017 Apr 19;8(4):122. doi: 10.3390/genes8040122.
7
An expanded sequence context model broadly explains variability in polymorphism levels across the human genome.一个扩展的序列上下文模型广泛解释了人类基因组中多态性水平的变异性。
Nat Genet. 2016 Apr;48(4):349-55. doi: 10.1038/ng.3511. Epub 2016 Feb 15.
8
The 3D folding of metazoan genomes correlates with the association of similar repetitive elements.后生动物基因组的三维折叠与相似重复元件的关联相关。
Nucleic Acids Res. 2016 Jan 8;44(1):245-55. doi: 10.1093/nar/gkv1292. Epub 2015 Nov 24.
9
Repbase Update, a database of repetitive elements in eukaryotic genomes.Repbase Update,一个真核生物基因组中重复元件的数据库。
Mob DNA. 2015 Jun 2;6:11. doi: 10.1186/s13100-015-0041-9. eCollection 2015.
10
A LINE-1-encoded reverse transcriptase-dependent regulatory mechanism is active in embryogenesis and tumorigenesis.一种由LINE-1编码的依赖逆转录酶的调控机制在胚胎发育和肿瘤发生过程中具有活性。
Ann N Y Acad Sci. 2015 Apr;1341:164-71. doi: 10.1111/nyas.12637. Epub 2015 Jan 13.