• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类基因组中CpG岛的组成性搜索。

Compositional searching of CpG islands in the human genome.

作者信息

Luque-Escamilla Pedro Luis, Martínez-Aroza José, Oliver José L, Gómez-Lopera Juan Francisco, Román-Roldán Ramón

机构信息

Department of Engineering and Mining Mechanics, University of Jaén, Escuela Politécnica Superior, Campus Las Lagunillas s/n, 23071 Jaén, Spain.

出版信息

Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Jun;71(6 Pt 1):061925. doi: 10.1103/PhysRevE.71.061925. Epub 2005 Jun 29.

DOI:10.1103/PhysRevE.71.061925
PMID:16089783
Abstract

We report on an entropic edge detector based on the local calculation of the Jensen-Shannon divergence with application to the search for CpG islands. CpG islands are pieces of the genome related to gene expression and cell differentiation, and thus to cancer formation. Searching for these CpG islands is a major task in genetics and bioinformatics. Some algorithms have been proposed in the literature, based on moving statistics in a sliding window, but its size may greatly influence the results. The local use of Jensen-Shannon divergence is a completely different strategy: the nucleotide composition inside the islands is different from that in their environment, so a statistical distance--the Jensen-Shannon divergence--between the composition of two adjacent windows may be used as a measure of their dissimilarity. Sliding this double window over the entire sequence allows us to segment it compositionally. The fusion of those segments into greater ones that satisfy certain identification criteria must be achieved in order to obtain the definitive results. We find that the local use of Jensen-Shannon divergence is very suitable in processing DNA sequences for searching for compositionally different structures such as CpG islands, as compared to other algorithms in literature.

摘要

我们报告了一种基于詹森 - 香农散度局部计算的熵边缘检测器,并将其应用于寻找CpG岛。CpG岛是基因组中与基因表达和细胞分化相关的片段,因此与癌症形成有关。寻找这些CpG岛是遗传学和生物信息学中的一项主要任务。文献中已经提出了一些基于滑动窗口中移动统计量的算法,但其窗口大小可能会对结果产生很大影响。局部使用詹森 - 香农散度是一种完全不同的策略:岛内的核苷酸组成与其周围环境不同,因此两个相邻窗口组成之间的统计距离——詹森 - 香农散度——可以用作它们差异的度量。在整个序列上滑动这个双窗口使我们能够按组成对其进行分割。为了获得最终结果,必须将这些片段融合成满足某些识别标准的更大片段。我们发现,与文献中的其他算法相比,局部使用詹森 - 香农散度在处理DNA序列以寻找诸如CpG岛等组成不同的结构时非常合适。

相似文献

1
Compositional searching of CpG islands in the human genome.人类基因组中CpG岛的组成性搜索。
Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Jun;71(6 Pt 1):061925. doi: 10.1103/PhysRevE.71.061925. Epub 2005 Jun 29.
2
Segmentation algorithm for DNA sequences.DNA序列的分割算法
Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Oct;72(4 Pt 1):041917. doi: 10.1103/PhysRevE.72.041917. Epub 2005 Oct 17.
3
Comparative testing of DNA segmentation algorithms using benchmark simulations.使用基准模拟比较 DNA 分割算法。
Mol Biol Evol. 2010 May;27(5):1015-24. doi: 10.1093/molbev/msp307. Epub 2009 Dec 16.
4
CpG island mapping by epigenome prediction.通过表观基因组预测进行CpG岛定位
PLoS Comput Biol. 2007 Jun;3(6):e110. doi: 10.1371/journal.pcbi.0030110. Epub 2007 May 2.
5
Predicting methylation status of CpG islands in the human brain.预测人类大脑中CpG岛的甲基化状态。
Bioinformatics. 2006 Sep 15;22(18):2204-9. doi: 10.1093/bioinformatics/btl377. Epub 2006 Jul 12.
6
Identification and measurement of neighbor-dependent nucleotide substitution processes.邻域依赖性核苷酸替代过程的识别与测量。
Bioinformatics. 2005 May 15;21(10):2322-8. doi: 10.1093/bioinformatics/bti376. Epub 2005 Mar 15.
7
Sequence context analysis of 8.2 million single nucleotide polymorphisms in the human genome.人类基因组中820万个单核苷酸多态性的序列上下文分析。
Gene. 2006 Feb 1;366(2):316-24. doi: 10.1016/j.gene.2005.08.024. Epub 2005 Nov 28.
8
Application of FirstEF to find promoters and first exons in the human genome.应用FirstEF在人类基因组中寻找启动子和首个外显子。
Curr Protoc Bioinformatics. 2003 May;Chapter 4:Unit4.7. doi: 10.1002/0471250953.bi0407s01.
9
PRIMEGENS-v2: genome-wide primer design for analyzing DNA methylation patterns of CpG islands.PRIMEGENS-v2:用于分析CpG岛DNA甲基化模式的全基因组引物设计
Bioinformatics. 2008 Sep 1;24(17):1837-42. doi: 10.1093/bioinformatics/btn320. Epub 2008 Jun 25.
10
High-level organization of isochores into gigantic superstructures in the human genome.人类基因组中等位基因的高级组织形成巨大的超结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 1):031908. doi: 10.1103/PhysRevE.83.031908. Epub 2011 Mar 15.

引用本文的文献

1
LncRNA Recognition-Associated CpG Island Detection and Methylation Analysis.长链非编码RNA识别相关CpG岛检测与甲基化分析
Methods Mol Biol. 2025;2883:281-297. doi: 10.1007/978-1-0716-4290-0_12.
2
Effective automated feature construction and selection for classification of biological sequences.用于生物序列分类的有效自动特征构建与选择
PLoS One. 2014 Jul 17;9(7):e99982. doi: 10.1371/journal.pone.0099982. eCollection 2014.
3
Segmentation of time series with long-range fractal correlations.具有长程分形相关性的时间序列分割
Eur Phys J B. 2012 Jun 1;85(6). doi: 10.1140/epjb/e2012-20969-5.
4
Identification of CpG islands in DNA sequences using statistically optimal null filters.使用统计最优空值过滤器鉴定DNA序列中的CpG岛。
EURASIP J Bioinform Syst Biol. 2012 Aug 29;2012(1):12. doi: 10.1186/1687-4153-2012-12.
5
CpGPAP: CpG island predictor analysis platform.CpGPAP:CpG 岛预测分析平台。
BMC Genet. 2012 Mar 2;13:13. doi: 10.1186/1471-2156-13-13.
6
Prediction of CpG-island function: CpG clustering vs. sliding-window methods.CpG 岛功能预测:CpG 聚类与滑动窗口方法。
BMC Genomics. 2010 May 26;11:327. doi: 10.1186/1471-2164-11-327.
7
CpG island mapping by epigenome prediction.通过表观基因组预测进行CpG岛定位
PLoS Comput Biol. 2007 Jun;3(6):e110. doi: 10.1371/journal.pcbi.0030110. Epub 2007 May 2.
8
CpGcluster: a distance-based algorithm for CpG-island detection.CpG簇:一种基于距离的CpG岛检测算法。
BMC Bioinformatics. 2006 Oct 12;7:446. doi: 10.1186/1471-2105-7-446.