Suppr超能文献

长链非编码RNA识别相关CpG岛检测与甲基化分析

LncRNA Recognition-Associated CpG Island Detection and Methylation Analysis.

作者信息

You Yujie, Xiao Ming, Zhang Le

机构信息

College of computer science, Sichuan University, Chengdu, China.

出版信息

Methods Mol Biol. 2025;2883:281-297. doi: 10.1007/978-1-0716-4290-0_12.

Abstract

CpG islands (CGIs) are rare, interspersed DNA sequences, which possess a significant deviation from background genomic distribution by exhibiting patterns of GC-rich and CpG-rich sequence, the density of which provides a good classification feature for long noncoding RNA (lncRNA) promoters. By reviewing previous CpG-related studies, we consider that the transcription regulation of about half of the human genes, mostly housekeeping (HK) genes, involves CGIs, their methylation states, CpG spacing, and other chromosomal parameters. However, the precise CGI definition and positioning of CGIs within gene structures, as well as specific CGI-associated regulatory mechanisms, all remain to be elucidated at individual gene and gene family levels, together with consideration of species and lineage specificity. Although previous studies have already classified CGIs into high-CpG (HCGI), intermediate-CpG (ICGI), and low-CpG (LCGI) densities based on CpG density variation, the correlation between CGI density and gene expression regulation, such as co-regulation of CGIs and TATA-box on HK genes, is not clear. Here, we introduce such a problem-solving protocol for human genome annotation, which is based on a combination of GTEx, JBLA, and GO analysis. Next, we discuss why CGI-associated genes are most likely regulated by HCGI and tend to be HK genes; The HCGI/TATA± and LCGI/TATA± combinations show different GO enrichment, whereas the ICGI/TATA± combination is less characteristic than LCGI/TATA± based on GO enrichment analysis.

摘要

CpG岛(CGIs)是罕见的、散布的DNA序列,其通过呈现富含GC和富含CpG的序列模式,与背景基因组分布存在显著偏差,其密度为长链非编码RNA(lncRNA)启动子提供了良好的分类特征。通过回顾以往与CpG相关的研究,我们认为大约一半的人类基因,主要是管家(HK)基因的转录调控涉及CpG岛、它们的甲基化状态、CpG间距和其他染色体参数。然而,CpG岛的精确定义、其在基因结构中的定位以及特定的与CpG岛相关的调控机制,在个体基因和基因家族水平上仍有待阐明,同时需要考虑物种和谱系特异性。尽管以往的研究已经根据CpG密度变化将CpG岛分为高CpG(HCGI)、中等CpG(ICGI)和低CpG(LCGI)密度,但CpG岛密度与基因表达调控之间的相关性,如CpG岛与HK基因上TATA盒的共同调控,尚不清楚。在这里,我们介绍了一种基于GTEx、JBLA和GO分析相结合的人类基因组注释问题解决方案。接下来,我们讨论为什么与CpG岛相关的基因最有可能受HCGI调控且倾向于是HK基因;基于GO富集分析,HCGI/TATA±和LCGI/TATA±组合显示出不同的GO富集,而ICGI/TATA±组合的特征性不如LCGI/TATA±。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验