Department of Horticulture, Michigan State University, East Lansing, Michigan 48824.
Plant Physiol. 2013 Nov;163(3):1419-32. doi: 10.1104/pp.113.223271. Epub 2013 Sep 12.
The process of gene duplication followed by sequence and functional divergence is important for the generation of new genes. Pack-MULEs, nonautonomous Mutator-like elements (MULEs) that carry genic sequence(s), are potentially involved in generating new open reading frames and regulating parental gene expression. These elements are identified in many plant genomes and are most abundant in rice (Oryza sativa). Despite the abundance of Pack-MULEs, the mechanism by which parental genes are captured by Pack-MULEs remains largely unknown. In this study, we identified all MULEs in rice and examined factors likely important for sequence acquisition. Terminal inverted repeat MULEs are the predominant MULE type and account for the majority of the Pack-MULEs. In addition to genic sequences, rice MULEs capture guanine-cytosine (GC)-rich intergenic sequences, albeit at a much lower frequency. MULEs carrying nontransposon sequences have longer terminal inverted repeats and higher GC content in terminal and subterminal regions. An overrepresentation of genes with known functions and genes with orthologs among parental genes of Pack-MULEs is observed in rice, maize (Zea mays), and Arabidopsis (Arabidopsis thaliana), suggesting preferential acquisition for bona fide genes by these elements. Pack-MULEs selectively acquire/retain parental sequences through a combined effect of GC content and breadth of expression, with GC content playing a stronger role. Increased GC content and number of tissues with detectable expression result in higher chances of a gene being acquired by Pack-MULEs. Such selective acquisition/retention provides these elements greater chances of carrying functional sequences that may provide new genetic resources for the evolution of new genes or the modification of existing genes.
基因复制,随后序列和功能分化的过程,对新基因的产生很重要。Pack-MULE 是一种非自主的 Mutator-like 元件 (MULE),可携带基因序列,可能参与产生新的开放阅读框和调节亲本基因表达。这些元件在许多植物基因组中都有发现,在水稻 (Oryza sativa) 中最为丰富。尽管 Pack-MULE 数量众多,但亲本基因被 Pack-MULE 捕获的机制在很大程度上仍不清楚。在这项研究中,我们鉴定了水稻中的所有 MULE,并研究了可能对序列获取很重要的因素。末端反向重复 MULE 是主要的 MULE 类型,占 Pack-MULE 的大多数。除了基因序列,水稻 MULE 还捕获富含鸟嘌呤-胞嘧啶 (GC) 的基因间序列,尽管频率要低得多。携带非转座子序列的 MULE 具有更长的末端反向重复和更高的末端和次末端区域 GC 含量。在水稻、玉米 (Zea mays) 和拟南芥 (Arabidopsis thaliana) 中观察到,携带已知功能基因和 Pack-MULE 亲本基因的直系同源基因的基因数量增加,这表明这些元件优先获取真正的基因。Pack-MULE 通过 GC 含量和表达广度的综合效应选择性地获取/保留亲本序列,GC 含量起着更强的作用。GC 含量增加和可检测表达的组织数量增加,使基因被 Pack-MULE 获得的几率更高。这种选择性的获取/保留使这些元件更有可能携带可能为新基因的进化或现有基因的修饰提供新遗传资源的功能序列。