Lloyd Evans Dyfed, Hlongwane Thandekile Thandiwe, Joshi Shailesh V, Riaño Pachón Diego M
Plant Breeding, South African Sugarcane Research Institute, Durban, KwaZulu-Natal, South Africa.
Cambridge Sequence Services (CSS), Waterbeach, Cambridgeshire, UK.
PeerJ. 2019 Sep 24;7:e7558. doi: 10.7717/peerj.7558. eCollection 2019.
Chloroplast genomes provide insufficient phylogenetic information to distinguish between closely related sugarcane cultivars, due to the recent origin of many cultivars and the conserved sequence of the chloroplast. In comparison, the mitochondrial genome of plants is much larger and more plastic and could contain increased phylogenetic signals. We assembled a consensus reference mitochondrion with Illumina TruSeq synthetic long reads and Oxford Nanopore Technologies MinION long reads. Based on this assembly we also analyzed the mitochondrial transcriptomes of sugarcane and sorghum and improved the annotation of the sugarcane mitochondrion as compared with other species.
Mitochondrial genomes were assembled from genomic read pools using a bait and assemble methodology. The mitogenome was exhaustively annotated using BLAST and transcript datasets were mapped with HISAT2 prior to analysis with the Integrated Genome Viewer.
The sugarcane mitochondrion is comprised of two independent chromosomes, for which there is no evidence of recombination. Based on the reference assembly from the sugarcane cultivar SP80-3280 the mitogenomes of four additional cultivars (R570, LCP85-384, RB72343 and SP70-1143) were assembled (with the SP70-1143 assembly utilizing both genomic and transcriptomic data). We demonstrate that the sugarcane plastome is completely transcribed and we assembled the chloroplast genome of SP80-3280 using transcriptomic data only. Phylogenomic analysis using mitogenomes allow closely related sugarcane cultivars to be distinguished and supports the discrimination between and as modern sugarcane's female parent. From whole chloroplast comparisons, we demonstrate that modern sugarcane arose from a limited number of female founders. Transcriptomic and spliceosomal analyses reveal that the two chromosomes of the sugarcane mitochondrion are combined at the transcript level and that splice sites occur more frequently within gene coding regions than without. We reveal one confirmed and one potential cytoplasmic male sterility (CMS) factor in the sugarcane mitochondrion, both of which are transcribed.
Transcript processing in the sugarcane mitochondrion is highly complex with diverse splice events, the majority of which span the two chromosomes. PolyA baited transcripts are consistent with the use of polyadenylation for transcript degradation. For the first time we annotate two CMS factors within the sugarcane mitochondrion and demonstrate that sugarcane possesses all the molecular machinery required for CMS and rescue. A mechanism of cross-chromosomal splicing based on guide RNAs is proposed. We also demonstrate that mitogenomes can be used to perform phylogenomic studies on sugarcane cultivars.
由于许多甘蔗品种起源较近且叶绿体序列保守,叶绿体基因组提供的系统发育信息不足以区分亲缘关系较近的甘蔗品种。相比之下,植物的线粒体基因组要大得多且可塑性更强,可能包含更多的系统发育信号。我们利用Illumina TruSeq合成长读段和牛津纳米孔技术MinION长读段组装了一个共识参考线粒体。基于此组装结果,我们还分析了甘蔗和高粱的线粒体转录组,并与其他物种相比改进了甘蔗线粒体的注释。
使用诱饵和组装方法从基因组读段库中组装线粒体基因组。使用BLAST对有丝分裂基因组进行详尽注释,并在使用综合基因组浏览器进行分析之前,用HISAT2将转录组数据集进行比对。
甘蔗线粒体由两条独立的染色体组成,没有重组的证据。基于甘蔗品种SP80 - 3280的参考组装结果,组装了另外四个品种(R570、LCP85 - 384、RB72343和SP70 - 1143)的有丝分裂基因组(SP70 - 1143的组装同时利用了基因组和转录组数据)。我们证明甘蔗质体基因组完全转录,并且仅使用转录组数据组装了SP80 - 3280的叶绿体基因组。使用有丝分裂基因组进行系统发育分析能够区分亲缘关系较近的甘蔗品种,并支持将 和 区分为现代甘蔗的母本。通过全叶绿体比较,我们证明现代甘蔗起源于有限数量的 雌性亲本。转录组和剪接体分析表明,甘蔗线粒体的两条染色体在转录水平上结合,并且剪接位点在基因编码区内比在基因编码区外更频繁出现。我们在甘蔗线粒体中发现了一个已确认的和一个潜在的细胞质雄性不育(CMS)因子,两者均被转录。
甘蔗线粒体中的转录加工高度复杂,存在多种剪接事件,其中大多数跨越两条染色体。聚腺苷酸化诱饵转录本与利用多聚腺苷酸化进行转录降解一致。我们首次在甘蔗线粒体中注释了两个CMS因子,并证明甘蔗拥有CMS和拯救所需的所有分子机制。提出了一种基于引导RNA的跨染色体剪接机制。我们还证明有丝分裂基因组可用于对甘蔗品种进行系统发育研究。