Huang Yan, Xiong Yuanyan, Lin Zhuoheng, Feng Xuyang, Jiang Xue, Songyang Zhou, Huang Junjiu
Key Laboratory of Reproductive Medicine of Guangdong Province, the First Affiliated Hospital and Key Laboratory of Gene Engineering of the Ministry of Education, School of Life Sciences, Sun Yat-sen University, Guangzhou, 51000, China.
SYSU-CMU Shunde International Joint Research Institute, Shunde, China.
PLoS One. 2015 Dec 29;10(12):e0145417. doi: 10.1371/journal.pone.0145417. eCollection 2015.
A recently developed strategy of sequencing alternative polyadenylation (APA) sites (SAPAS) with second-generation sequencing technology can be used to explore complete genome-wide patterns of tandem APA sites and global gene expression profiles. spermatogonial stem cells (SSCs) maintain long-term reproductive abilities in male mammals. The detailed mechanisms by which SSCs self-renew and generate mature spermatozoa are not clear. To understand the specific alternative polyadenylation pattern and global gene expression profile of male germline stem cells (GSCs, mainly referred to SSCs here), we isolated and purified mouse Thy1+ cells from testis by magnetic-activated cell sorting (MACS) and then used the SAPAS method for analysis, using pluripotent embryonic stem cells (ESCs) and differentiated mouse embryonic fibroblast cells (MEFs) as controls. As a result, we obtained 99,944 poly(A) sites, approximately 40% of which were newly detected in our experiments. These poly(A) sites originated from three mouse cell types and covered 17,499 genes, including 831 long non-coding RNA (lncRNA) genes. We observed that GSCs tend to have shorter 3'UTR lengths while MEFs tend towards longer 3'UTR lengths. We also identified 1337 genes that were highly expressed in GSCs, and these genes were highly consistent with the functional characteristics of GSCs. Our detailed bioinformatics analysis identified APA site-switching events at 3'UTRs and many new specifically expressed genes in GSCs, which we experimentally confirmed. Furthermore, qRT-PCR was performed to validate several events of the 334 genes with distal-to-proximal poly(A) switch in GSCs. Consistently APA reporter assay confirmed the total 3'UTR shortening in GSCs compared to MEFs. We also analyzed the cis elements around the proximal poly(A) site preferentially used in GSCs and found C-rich elements may contribute to this regulation. Overall, our results identified the expression level and polyadenylation site profiles and these data provide new insights into the processes potentially involved in the GSC life cycle and spermatogenesis.
一种最近开发的利用第二代测序技术对可变聚腺苷酸化(APA)位点进行测序的策略(SAPAS),可用于探索全基因组范围内串联APA位点的完整模式以及全局基因表达谱。精原干细胞(SSCs)在雄性哺乳动物中维持长期生殖能力。SSCs自我更新并产生成熟精子的详细机制尚不清楚。为了解雄性生殖系干细胞(GSCs,此处主要指SSCs)的特定可变聚腺苷酸化模式和全局基因表达谱,我们通过磁激活细胞分选(MACS)从睾丸中分离并纯化了小鼠Thy1+细胞,然后以多能胚胎干细胞(ESCs)和分化的小鼠胚胎成纤维细胞(MEFs)作为对照,使用SAPAS方法进行分析。结果,我们获得了99,944个聚(A)位点,其中约40%是在我们的实验中新检测到的。这些聚(A)位点源自三种小鼠细胞类型,覆盖了17,499个基因,包括831个长链非编码RNA(lncRNA)基因。我们观察到GSCs倾向于具有较短的3'UTR长度,而MEFs倾向于具有较长的3'UTR长度。我们还鉴定出1337个在GSCs中高表达的基因,这些基因与GSCs的功能特征高度一致。我们详细的生物信息学分析确定了3'UTR处的APA位点切换事件以及GSCs中许多新的特异性表达基因,我们通过实验对其进行了证实。此外,进行了qRT-PCR以验证GSCs中334个基因从远端到近端聚(A)转换的几个事件。一致地,APA报告基因检测证实与MEFs相比,GSCs中总的3'UTR缩短。我们还分析了GSCs中优先使用的近端聚(A)位点周围的顺式元件,发现富含C的元件可能有助于这种调控。总体而言,我们的结果确定了表达水平和聚腺苷酸化位点谱,这些数据为GSC生命周期和精子发生过程中可能涉及的过程提供了新的见解。