Suppr超能文献

IsoFinder:基因组序列中同线区的计算预测

IsoFinder: computational prediction of isochores in genome sequences.

作者信息

Oliver José L, Carpena Pedro, Hackenberg Michael, Bernaola-Galván Pedro

机构信息

Departamento de Genética, Instituto de Biotecnología, Facultad de Ciencias, Universidad de Granada, Spain.

出版信息

Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W287-92. doi: 10.1093/nar/gkh399.

Abstract

Isochores are long genome segments homogeneous in G+C. Here, we describe an algorithm (IsoFinder) running on the web (http://bioinfo2.ugr.es/IsoF/isofinder.html) able to predict isochores at the sequence level. We move a sliding pointer from left to right along the DNA sequence. At each position of the pointer, we compute the mean G+C values to the left and to the right of the pointer. We then determine the position of the pointer for which the difference between left and right mean values (as measured by the t-statistic) reaches its maximum. Next, we determine the statistical significance of this potential cutting point, after filtering out short-scale heterogeneities below 3 kb by applying a coarse-graining technique. Finally, the program checks whether this significance exceeds a probability threshold. If so, the sequence is cut at this point into two subsequences; otherwise, the sequence remains undivided. The procedure continues recursively for each of the two resulting subsequences created by each cut. This leads to the decomposition of a chromosome sequence into long homogeneous genome regions (LHGRs) with well-defined mean G+C contents, each significantly different from the G+C contents of the adjacent LHGRs. Most LHGRs can be identified with Bernardi's isochores, given their correlation with biological features such as gene density, SINE and LINE (short, long interspersed repetitive elements) densities, recombination rate or single nucleotide polymorphism variability. The resulting isochore maps are available at our web site (http://bioinfo2.ugr.es/isochores/), and also at the UCSC Genome Browser (http://genome.cse.ucsc.edu/).

摘要

等密度区是基因组中G+C含量均匀的长片段。在此,我们描述了一种在网页上运行的算法(IsoFinder,网址为http://bioinfo2.ugr.es/IsoF/isofinder.html),它能够在序列水平上预测等密度区。我们沿着DNA序列从左到右移动一个滑动指针。在指针的每个位置,我们计算指针左侧和右侧的平均G+C值。然后,我们确定指针的位置,此时左右平均值之间的差异(通过t统计量测量)达到最大值。接下来,在通过应用粗粒化技术滤除3 kb以下的短尺度异质性后,我们确定这个潜在切割点的统计显著性。最后,程序检查这个显著性是否超过概率阈值。如果是,则在这一点将序列切割成两个子序列;否则,序列保持未分割状态。对于每次切割产生的两个子序列中的每一个,该过程都递归继续。这导致染色体序列分解为具有明确平均G+C含量的长均质基因组区域(LHGRs),每个区域与相邻LHGRs的G+C含量有显著差异。考虑到大多数LHGRs与基因密度、SINE和LINE(短、长散布重复元件)密度、重组率或单核苷酸多态性变异性等生物学特征的相关性,它们可以被识别为伯纳迪的等密度区。生成的等密度区图谱可在我们的网站(http://bioinfo2.ugr.es/isochores/)以及UCSC基因组浏览器(http://genome.cse.ucsc.edu/)上获取。

相似文献

1
IsoFinder: computational prediction of isochores in genome sequences.IsoFinder:基因组序列中同线区的计算预测
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W287-92. doi: 10.1093/nar/gkh399.
2
Isochore chromosome maps of the human genome.人类基因组的等臂染色体图谱。
Gene. 2002 Oct 30;300(1-2):117-27. doi: 10.1016/s0378-1119(02)01034-x.
3
Isochore chromosome maps of eukaryotic genomes.真核生物基因组的等臂染色体图谱。
Gene. 2001 Oct 3;276(1-2):47-56. doi: 10.1016/s0378-1119(01)00641-2.
4
Discovering isochores by least-squares optimal segmentation.通过最小二乘最优分割发现等容线。
Gene. 2007 Jun 1;394(1-2):53-60. doi: 10.1016/j.gene.2007.01.028. Epub 2007 Feb 16.
5
Isochore structures in the chicken genome.鸡基因组中的等容线结构。
FEBS J. 2006 Apr;273(8):1637-48. doi: 10.1111/j.1742-4658.2006.05178.x.
6
Isochore structures in the mouse genome.小鼠基因组中的等容线结构。
Genomics. 2004 Mar;83(3):384-94. doi: 10.1016/j.ygeno.2003.09.011.
10
Prediction of replication time zones at single nucleotide resolution in the human genome.
FEBS Lett. 2008 Jul 9;582(16):2441-4. doi: 10.1016/j.febslet.2008.06.008. Epub 2008 Jun 12.

引用本文的文献

4
Compositional Structure of the Genome: A Review.基因组的组成结构:综述
Biology (Basel). 2023 Jun 13;12(6):849. doi: 10.3390/biology12060849.

本文引用的文献

1
Isochore structures in the mouse genome.小鼠基因组中的等容线结构。
Genomics. 2004 Mar;83(3):384-94. doi: 10.1016/j.ygeno.2003.09.011.
3
Isochores merit the prefix 'iso'.等密度区值得加上前缀“等”。
Comput Biol Chem. 2003 Feb;27(1):5-10. doi: 10.1016/s1476-9271(02)00090-7.
4
Isochore chromosome maps of the human genome.人类基因组的等臂染色体图谱。
Gene. 2002 Oct 30;300(1-2):117-27. doi: 10.1016/s0378-1119(02)01034-x.
5
A simple and species-independent coding measure.
Gene. 2002 Oct 30;300(1-2):97-104. doi: 10.1016/s0378-1119(02)01041-7.
6
Segmentation of genomic DNA through entropic divergence: power laws and scaling.基于熵散度的基因组DNA分割:幂律与标度
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 May;65(5 Pt 1):051909. doi: 10.1103/PhysRevE.65.051909. Epub 2002 May 8.
7
Isochores: dream or reality?
Trends Biotechnol. 2002 Jun;20(6):237. doi: 10.1016/s0167-7799(02)01951-0.
8
Analysis of symbolic sequences using the Jensen-Shannon divergence.使用詹森 - 香农散度分析符号序列。
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Apr;65(4 Pt 1):041905. doi: 10.1103/PhysRevE.65.041905. Epub 2002 Mar 25.
10
Scale invariance in the nonstationarity of human heart rate.人类心率非平稳性中的尺度不变性。
Phys Rev Lett. 2001 Oct 15;87(16):168105. doi: 10.1103/PhysRevLett.87.168105. Epub 2001 Oct 2.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验