Department of Bioinformatics and Genomics, University of North Carolina, Kannapolis, North Carolina 28081, USA.
Plant Physiol. 2013 Jun;162(2):1092-109. doi: 10.1104/pp.112.211441. Epub 2013 Apr 16.
Pollen grains of Arabidopsis (Arabidopsis thaliana) contain two haploid sperm cells enclosed in a haploid vegetative cell. Upon germination, the vegetative cell extrudes a pollen tube that carries the sperm to an ovule for fertilization. Knowing the identity, relative abundance, and splicing patterns of pollen transcripts will improve our understanding of pollen and allow investigation of tissue-specific splicing in plants. Most Arabidopsis pollen transcriptome studies have used the ATH1 microarray, which does not assay splice variants and lacks specific probe sets for many genes. To investigate the pollen transcriptome, we performed high-throughput sequencing (RNA-Seq) of Arabidopsis pollen and seedlings for comparison. Gene expression was more diverse in seedling, and genes involved in cell wall biogenesis were highly expressed in pollen. RNA-Seq detected at least 4,172 protein-coding genes expressed in pollen, including 289 assayed only by nonspecific probe sets. Additional exons and previously unannotated 5' and 3' untranslated regions for pollen-expressed genes were revealed. We detected regions in the genome not previously annotated as expressed; 14 were tested and 12 were confirmed by polymerase chain reaction. Gapped read alignments revealed 1,908 high-confidence new splicing events supported by 10 or more spliced read alignments. Alternative splicing patterns in pollen and seedling were highly correlated. For most alternatively spliced genes, the ratio of variants in pollen and seedling was similar, except for some encoding proteins involved in RNA splicing. This study highlights the robustness of splicing patterns in plants and the importance of ongoing annotation and visualization of RNA-Seq data using interactive tools such as Integrated Genome Browser.
拟南芥(Arabidopsis thaliana)的花粉粒包含两个被单倍体营养细胞包裹的单倍体精子细胞。花粉粒萌发后,营养细胞伸出花粉管,将精子输送到胚珠进行受精。了解花粉的基因身份、相对丰度和剪接模式,将有助于我们深入了解花粉,并能对植物组织中的特异性剪接进行研究。大多数拟南芥花粉转录组研究都使用了 ATH1 微阵列,它不能检测剪接变体,并且缺乏许多基因的特定探针集。为了研究花粉转录组,我们对拟南芥花粉和幼苗进行了高通量测序(RNA-Seq)比较。在幼苗中基因表达更为多样化,而细胞壁生物发生相关基因在花粉中高度表达。RNA-Seq 检测到至少 4172 个在花粉中表达的蛋白质编码基因,其中 289 个仅由非特异性探针集检测到。在花粉表达基因中发现了额外的外显子和以前未注释的 5'和 3'非翻译区。我们检测到了基因组中以前未注释为表达的区域;其中 14 个经过了测试,12 个通过聚合酶链反应得到了验证。缺口读取比对揭示了 1908 个高可信度的新剪接事件,这些事件得到了 10 个或更多剪接读取比对的支持。花粉和幼苗中的可变剪接模式高度相关。对于大多数可变剪接基因,花粉和幼苗中变体的比例相似,但有些编码参与 RNA 剪接的蛋白质的基因除外。这项研究强调了植物中剪接模式的稳健性,以及使用交互式工具(如 Integrated Genome Browser)对 RNA-Seq 数据进行持续注释和可视化的重要性。