Messer Philipp W, Arndt Peter F
Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin, Germany.
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W692-5. doi: 10.1093/nar/gkl234.
CorGen is a web server that measures long-range correlations in the base composition of DNA and generates random sequences with the same correlation parameters. Long-range correlations are characterized by a power-law decay of the auto correlation function of the GC-content. The widespread presence of such correlations in eukaryotic genomes calls for their incorporation into accurate null models of eukaryotic DNA in computational biology. For example, the score statistics of sequence alignment and the performance of motif finding algorithms are significantly affected by the presence of genomic long-range correlations. We use an expansion-randomization dynamics to efficiently generate the correlated random sequences. The server is available at http://corgen.molgen.mpg.de.
CorGen是一个网络服务器,用于测量DNA碱基组成中的长程相关性,并生成具有相同相关参数的随机序列。长程相关性的特征是GC含量的自相关函数呈幂律衰减。真核生物基因组中广泛存在这种相关性,这就要求在计算生物学中将其纳入真核生物DNA的精确零模型。例如,序列比对的得分统计和基序查找算法的性能会受到基因组长程相关性的显著影响。我们使用一种扩展随机化动力学来高效生成相关随机序列。该服务器可在http://corgen.molgen.mpg.de获取。