Lee Jennifer M, Sonnhammer Erik L L
Center for Genomics and Bioinformatics, Karolinska Institutet, S171 77 Stockholm, Sweden.
Genome Res. 2003 May;13(5):875-82. doi: 10.1101/gr.737703. Epub 2003 Apr 14.
Genomic clustering of genes in a pathway is commonly found in prokaryotes due to transcriptional operons, but these are not present in most eukaryotes. Yet, there might be clustering to a lesser extent of pathway members in eukaryotic genomes, that assist coregulation of a set of functionally cooperating genes. We analyzed five sequenced eukaryotic genomes for clustering of genes assigned to the same pathway in the KEGG database. Between 98% and 30% of the analyzed pathways in a genome were found to exhibit significantly higher clustering levels than expected by chance. In descending order by the level of clustering, the genomes studied were Saccharomyces cerevisiae, Homo sapiens, Caenorhabditis elegans, Arabidopsis thaliana, and Drosophila melanogaster. Surprisingly, there is not much agreement between genomes in terms of which pathways are most clustered. Only seven of 69 pathways found in all species were significantly clustered in all five of them. This species-specific pattern of pathway clustering may reflect adaptations or evolutionary events unique to a particular lineage. We note that although operons are common in C. elegans, only 58% of the pathways showed significant clustering, which is less than in human. Virtually all pathways in S. cerevisiae showed significant clustering.
由于转录操纵子的存在,原核生物中常见一条途径中的基因成簇排列,但大多数真核生物中不存在这种情况。然而,真核生物基因组中可能在较小程度上存在途径成员的成簇现象,这有助于一组功能协作基因的共同调控。我们分析了五个已测序的真核生物基因组,以确定KEGG数据库中分配到同一途径的基因的成簇情况。在一个基因组中,98%至30%的分析途径被发现呈现出显著高于随机预期的成簇水平。按成簇水平从高到低排序,所研究的基因组依次为酿酒酵母、智人、秀丽隐杆线虫、拟南芥和黑腹果蝇。令人惊讶的是,就哪些途径成簇程度最高而言,不同基因组之间的一致性并不高。在所有物种中发现的69条途径中,只有7条在所有五个基因组中都显著成簇。这种特定物种的途径成簇模式可能反映了特定谱系特有的适应性或进化事件。我们注意到,尽管操纵子在秀丽隐杆线虫中很常见,但只有58%的途径显示出显著成簇,低于人类。酿酒酵母中几乎所有途径都显示出显著成簇。
Genome Res. 2003-5
Genome Res. 2001-4
Gene. 2014-10-25
BMC Bioinformatics. 2004-5-14
Genome Res. 2003-9
Genome Res. 2001-3
Genome Res. 2001-11
Biology (Basel). 2024-3-8
Genome Biol Evol. 2024-3-2
Archaea. 2023-9-29
Sci Adv. 2023-5-26
Genomics Proteomics Bioinformatics. 2023-2
Genome Biol. 2001
Trends Genet. 2001-6
Genome Res. 2001-3
Trends Biochem Sci. 2000-10