Department of Crop, Soil and Environmental Science Auburn University, Auburn, AL, USA.
Department of Biological Sciences, Auburn University, Auburn, AL, USA.
BMC Genomics. 2021 Mar 12;22(1):175. doi: 10.1186/s12864-021-07447-y.
Finger millet (Eleusine coracana 2n = 4x = 36) is a hardy, nutraceutical, climate change tolerant, orphan crop that is consumed throughout eastern Africa and India. Its genome has been sequenced multiple times, but A and B subgenomes could not be separated because no published genome for E. indica existed. The classification of A and B subgenomes is important for understanding the evolution of this crop and provide a means to improve current and future breeding programs.
We produced subgenome calls for 704 syntenic blocks and inferred A or B subgenomic identity for 59,377 genes 81% of the annotated genes. Phylogenetic analysis of a super matrix containing 455 genes shows high support for A and B divergence within the Eleusine genus. Synonymous substitution rates between A and B genes support A and B calls. The repetitive content on highly supported B contigs is higher than that on similar A contigs. Analysis of syntenic singletons showed evidence of biased fractionation showed a pattern of A genome dominance, with 61% A, 37% B and 1% unassigned, and was further supported by the pattern of loss observed among cyto-nuclear interacting genes.
The evidence of individual gene calls within each syntenic block, provides a powerful tool for inference for subgenome classification. Our results show the utility of a draft genome in resolving A and B subgenomes calls, primarily it allows for the proper polarization of A and B syntenic blocks. There have been multiple calls for the use of phylogenetic inference in subgenome classification, our use of synteny is a practical application in a system that has only one parental genome available.
手指粟(Eleusine coracana 2n = 4x = 36)是一种坚韧、具有营养保健功能、能耐受气候变化的 orphan 作物,在东非和印度各地都有食用。其基因组已多次测序,但由于没有发表的印度手指粟基因组,因此无法分离 A 和 B 亚基因组。A 和 B 亚基因组的分类对于理解该作物的进化很重要,并为改进当前和未来的育种计划提供了一种手段。
我们对 704 个同源块进行了亚基因组调用,并对 59377 个基因中的 81%推断出 A 或 B 亚基因组身份,这些基因是注释基因的 81%。包含 455 个基因的超级矩阵的系统发育分析为 Eleusine 属内 A 和 B 的分化提供了高度支持。A 和 B 基因之间的同义替换率支持 A 和 B 的调用。高度支持的 B 连续基因上的重复含量高于类似的 A 连续基因上的重复含量。对同线性单体的分析表明,偏分离的证据显示出 A 基因组优势的模式,A 占 61%,B 占 37%,未分配的占 1%,并且核质相互作用基因的缺失模式进一步支持了这一模式。
每个同源块内的个体基因调用的证据,为亚基因组分类的推断提供了有力的工具。我们的结果表明,在解决 A 和 B 亚基因组调用方面,草案基因组是有用的,它主要允许正确极化 A 和 B 同线性块。已经有多次呼吁在亚基因组分类中使用系统发育推断,我们对同线性的使用是在只有一个亲本基因组可用的系统中的实际应用。