Muiruri Kariuki S, Britt Anne, Amugune Nelson O, Nguu Edward K, Chan Simon, Tripathi Leena
International Institute of Tropical AgricultureNairobi, Kenya.
School of Biological Sciences, University of NairobiNairobi, Kenya.
Front Plant Sci. 2017 Jun 29;8:1034. doi: 10.3389/fpls.2017.01034. eCollection 2017.
Centromeres are specified by a centromere specific histone 3 (CENH3) protein, which exists in a complex environment, interacting with conserved proteins and rapidly evolving satellite DNA sequences. The interactions may become more challenging if multiple CENH3 versions are introduced into the zygote as this can affect post-zygotic mitosis and ultimately sexual reproduction. Here, we characterize variant transcripts expressed in cultivated triploid and wild diploid progenitor bananas. We describe both splice- and allelic-[Single Nucleotide Polymorphisms (SNP)] variants and their effects on the predicted secondary structures of protein. Expressed transcripts from six banana genotypes were characterized and clustered into three groups (-1A, -1B, and -2) based on similarity. The groups differed with SNPs as well as presence of indels resulting from retained and/or skipped exons. The transcripts from different banana genotypes were spliced in either 7/6, 5/4 or 6/5 exons/introns. The 7/6 and the 5/4 exon/intron structures were found in both diploids and triploids, however, 7/6 was most predominant. The 6/5 exon/introns structure was a result of failure of the 7/6 to splice correctly. The various transcripts obtained were predicted to encode highly variable N-terminal tails and a relatively conserved C-terminal histone fold domain (HFD). The SNPs were predicted in some cases to affect the secondary structure of protein by lengthening or shorting the affected domains. Sequencing of banana transcripts predicts SNP variations that affect amino acid sequences and alternatively spliced transcripts. Most of these changes affect the N-terminal tail of CENH3.
着丝粒由着丝粒特异性组蛋白3(CENH3)蛋白指定,该蛋白存在于复杂环境中,与保守蛋白和快速进化的卫星DNA序列相互作用。如果将多个CENH3版本引入合子中,这种相互作用可能会变得更具挑战性,因为这会影响合子后的有丝分裂并最终影响有性繁殖。在这里,我们对栽培三倍体和野生二倍体祖先香蕉中表达的变异转录本进行了表征。我们描述了剪接变体和等位基因单核苷酸多态性(SNP)变体及其对预测的蛋白质二级结构的影响。对六种香蕉基因型的表达转录本进行了表征,并根据相似性将其聚类为三组(-1A、-1B和-2)。这些组在SNP以及由保留和/或跳过外显子导致的插入缺失的存在方面存在差异。不同香蕉基因型的转录本以7/6、5/4或6/5外显子/内含子进行剪接。7/6和5/4外显子/内含子结构在二倍体和三倍体中均有发现,然而,7/6最为常见。6/5外显子/内含子结构是7/6未能正确剪接的结果。预测获得的各种转录本编码高度可变的N末端尾巴和相对保守的C末端组蛋白折叠结构域(HFD)。在某些情况下,预测SNP会通过延长或缩短受影响的结构域来影响蛋白质的二级结构。香蕉转录本的测序预测了影响氨基酸序列和可变剪接转录本的SNP变异。这些变化大多影响CENH3的N末端尾巴。