Ezawa Kiyoshi, OOta Satoshi, Saitou Naruya
Division of Population Genetics, National Institute of Genetics, Mishima, Japan.
Mol Biol Evol. 2006 May;23(5):927-40. doi: 10.1093/molbev/msj093. Epub 2006 Jan 11.
Gene conversion is considered to play important roles in the formation of genomic makeup such as homogenization of multigene families and diversification of alleles. We devised two statistical tests on quartets for detecting gene conversion events. Each "quartet" consists of two pairs of orthologous sequences supposed to have been generated by a duplication event and a subsequent speciation of two closely related species. As example data, EnsEMBL mouse and rat cDNA sequences were used to obtain a genome-wide picture of gene conversion events. We extensively sampled 2,641 quartets that appear to have resulted from duplications after the divergence of primates and rodents and before mouse-rat speciation. Combination of our new tests with Sawyer's and Takahata's tests enhanced the detection sensitivity while keeping false positives as few as possible. About 18% (488 quartets) were shown to be highly positive for gene conversion using this combined test. Out of them, 340 (13% of the total) showed signs of gene conversion in mouse sequence pairs. Those gene conversion-positive gene pairs are mostly linked in the same chromosomes, with the proportion of positive pairs in the linked and unlinked categories being 15% and 1%, respectively. Statistical analyses showed that (1) the susceptibility to gene conversion correlates negatively with the physical distance, especially the frequency of 29% was observed for gene pairs whose distances are smaller than 55 kb; (2) the occurrence of gene conversions does not depend on the transcriptional direction; (3) small gene families consisting of between three and six contiguous genes are highly prone to gene conversion; and (4) frequency of gene conversions greatly varies depending on functional categories, and cadherins favor gene conversion, while vomeronasal receptors type 1 and immunoglobulin V-type proteins disfavor it. These findings will be useful to deepen the understanding of the roles of gene conversion.
基因转换被认为在基因组构成的形成过程中发挥着重要作用,比如多基因家族的同质化以及等位基因的多样化。我们设计了两种针对四重奏的统计测试来检测基因转换事件。每个“四重奏”由两对直系同源序列组成,假定它们是由一次复制事件以及随后两个亲缘关系密切的物种的物种形成所产生的。作为示例数据,使用了EnsEMBL小鼠和大鼠的cDNA序列来获取基因转换事件的全基因组图谱。我们广泛采样了2641个四重奏,这些四重奏似乎是在灵长类动物和啮齿动物分化之后、小鼠 - 大鼠物种形成之前由复制产生的。我们的新测试与索耶(Sawyer)和高畑(Takahata)的测试相结合,在尽可能减少假阳性的同时提高了检测灵敏度。使用这种联合测试,约18%(488个四重奏)显示出基因转换高度阳性。其中,340个(占总数的13%)在小鼠序列对中显示出基因转换的迹象。那些基因转换阳性的基因对大多位于同一条染色体上,在连锁和非连锁类别中阳性对的比例分别为15%和1%。统计分析表明:(1)对基因转换的敏感性与物理距离呈负相关,尤其是对于距离小于55 kb的基因对,观察到29%的频率;(2)基因转换的发生不依赖于转录方向;(3)由三到六个连续基因组成的小基因家族极易发生基因转换;(4)基因转换的频率因功能类别而异,钙黏着蛋白有利于基因转换,而1型犁鼻器受体和免疫球蛋白V型蛋白则不利于基因转换。这些发现将有助于加深对基因转换作用的理解。