Paces Jan, Zíka Radek, Paces Václav, Pavlícek Adam, Clay Oliver, Bernardi Giorgio
Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Flemingovo 2, Prague CZ-16637, Czech Republic.
Gene. 2004 May 26;333:135-41. doi: 10.1016/j.gene.2004.02.041.
Genome sequencing now permits direct visual representation, at any scale, of GC heterogeneity along the chromosomes of several higher eukaryotes. Plots can be easily obtained from the chromosomal sequences, yet sequence releases of mammalian or plant chromosomes still tend to use small scales or window sizes that obscure important large-scale compositional features. To faithfully reveal, at one glance, the compositional variation at a given scale, we have devised a simple scheme that combines line plots with color-coded shading of the regions underneath the plots. The scheme can be applied to different eukaryotic genomes to facilitate their comparison, as illustrated here for a sample of chromosomes chosen from seven selected species. As a complement to a previously published compact view of isochores in the human genome sequence, we include here an analogous map for the recently sequenced mouse genome, and discuss the contribution of repetitive DNA to the GC variation along the plots. Supplementary information, including a database of color-coded GC profiles for all recently sequenced eukaryotes and the program draw_chromosomes_gc.pl used to obtain them, are available at.
现在,基因组测序能够以任何比例直接直观呈现几种高等真核生物染色体上的GC异质性。从染色体序列中可以轻松获得图谱,但哺乳动物或植物染色体的序列发布仍倾向于使用较小的比例或窗口大小,这会掩盖重要的大规模组成特征。为了在一瞥之间忠实地揭示给定比例下的组成变化,我们设计了一种简单的方案,该方案将线图与图下方区域的颜色编码阴影相结合。该方案可应用于不同的真核生物基因组,以方便它们之间的比较,这里以从七个选定物种中选取的一组染色体为例进行说明。作为对之前发表的人类基因组序列中同线区紧凑视图的补充,我们在此提供了最近测序的小鼠基因组的类似图谱,并讨论了重复DNA对沿图谱的GC变化的贡献。补充信息,包括所有最近测序的真核生物的颜色编码GC图谱数据库以及用于获取这些图谱的程序draw_chromosomes_gc.pl,可在[具体网址]获取。