Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.
Pacific Biosciences, Inc., Menlo Park, CA, 94025, USA.
Nat Commun. 2023 Oct 30;14(1):6902. doi: 10.1038/s41467-023-42558-y.
Human preimplantation development involves extensive remodeling of RNA expression and splicing. However, its transcriptome has been compiled using short-read sequencing data, which fails to capture most full-length mRNAs. Here, we generate an isoform-resolved transcriptome of early human development by performing long- and short-read RNA sequencing on 73 embryos spanning the zygote to blastocyst stages. We identify 110,212 unannotated isoforms transcribed from known genes, including highly conserved protein-coding loci and key developmental regulators. We further identify 17,964 isoforms from 5,239 unannotated genes, which are largely non-coding, primate-specific, and highly associated with transposable elements. These isoforms are widely supported by the integration of published multi-omics datasets, including single-cell 8CLC and blastoid studies. Alternative splicing and gene co-expression network analyses further reveal that embryonic genome activation is associated with splicing disruption and transient upregulation of gene modules. Together, these findings show that the human embryo transcriptome is far more complex than currently known, and will act as a valuable resource to empower future studies exploring development.
人类胚胎着床前的发育涉及到 RNA 表达和剪接的广泛重构。然而,其转录组是使用短读测序数据构建的,这些数据无法捕获大多数全长 mRNAs。在这里,我们通过对 73 个胚胎进行长读长和短读长 RNA 测序,跨越了从受精卵到囊胚阶段,生成了一个具有亚型分辨率的人类早期发育转录组。我们从已知基因中鉴定出 110212 个未注释的亚型,包括高度保守的蛋白质编码基因座和关键的发育调控因子。我们进一步从 5239 个未注释的基因中鉴定出 17964 个亚型,这些亚型主要是非编码的、灵长类特有的,并且与转座元件高度相关。这些亚型通过整合已发表的多组学数据集,包括单细胞 8CLC 和 Blastoid 研究,得到了广泛的支持。可变剪接和基因共表达网络分析进一步揭示了胚胎基因组激活与剪接破坏和基因模块的瞬时上调有关。总之,这些发现表明人类胚胎转录组远比目前已知的复杂,并且将成为一个有价值的资源,为未来探索发育的研究提供支持。