Suppr超能文献

微流控异构体测序显示人类转录组中广泛的剪接协调。

Microfluidic isoform sequencing shows widespread splicing coordination in the human transcriptome.

机构信息

Brain and Mind Research Institute, Weill Cornell Medicine, New York, New York 10021, USA.

Department of Genetics, Stanford University, Stanford, California 94304, USA.

出版信息

Genome Res. 2018 Feb;28(2):231-242. doi: 10.1101/gr.230516.117. Epub 2017 Dec 1.

Abstract

Understanding transcriptome complexity is crucial for understanding human biology and disease. Technologies such as Synthetic long-read RNA sequencing (SLR-RNA-seq) delivered 5 million isoforms and allowed assessing splicing coordination. Pacific Biosciences and Oxford Nanopore increase throughput also but require high input amounts or amplification. Our new droplet-based method, sparse isoform sequencing (spISO-seq), sequences 100k-200k partitions of 10-200 molecules at a time, enabling analysis of 10-100 million RNA molecules. SpISO-seq requires less than 1 ng of input cDNA, limiting or removing the need for prior amplification with its associated biases. Adjusting the number of reads devoted to each molecule reduces sequencing lanes and cost, with little loss in detection power. The increased number of molecules expands our understanding of isoform complexity. In addition to confirming our previously published cases of splicing coordination (e.g., ), the greater depth reveals many new cases, such as Coordination of internal exons is found to be extensive among protein coding genes: 23.5%-59.3% (95% confidence interval) of highly expressed genes with distant alternative exons exhibit coordination, showcasing the need for long-read transcriptomics. However, coordination is less frequent for noncoding sequences, suggesting a larger role of splicing coordination in shaping proteins. Groups of genes with coordination are involved in protein-protein interactions with each other, raising the possibility that coordination facilitates complex formation and/or function. We also find new splicing coordination types, involving initial and terminal exons. Our results provide a more comprehensive understanding of the human transcriptome and a general, cost-effective method to analyze it.

摘要

理解转录组的复杂性对于理解人类生物学和疾病至关重要。Synthetic long-read RNA sequencing (SLR-RNA-seq) 等技术可以提供 500 万个异构体,并评估剪接协调。Pacific Biosciences 和 Oxford Nanopore 也提高了通量,但需要高输入量或扩增。我们的新基于液滴的方法 sparse isoform sequencing (spISO-seq) 一次可以对 10-200 个分子的 100k-200k 分区进行测序,能够分析 10-1000 万个 RNA 分子。spISO-seq 需要的输入 cDNA 少于 1ng,限制或消除了与扩增相关的偏倚的先前扩增的需要。调整用于每个分子的读取次数可以减少测序通道和成本,而不会降低检测能力。分子数量的增加扩展了我们对异构体复杂性的理解。除了确认我们之前发表的剪接协调案例(例如,)外,更大的深度揭示了许多新的案例,例如 Coordination of internal exons is found to be extensive among protein coding genes: 23.5%-59.3% (95% confidence interval) of highly expressed genes with distant alternative exons exhibit coordination, showcasing the need for long-read transcriptomics. However, coordination is less frequent for noncoding sequences, suggesting a larger role of splicing coordination in shaping proteins. Groups of genes with coordination are involved in protein-protein interactions with each other, raising the possibility that coordination facilitates complex formation and/or function. We also find new splicing coordination types, involving initial and terminal exons. Our results provide a more comprehensive understanding of the human transcriptome and a general, cost-effective method to analyze it.

在蛋白质编码基因中,内部外显子的协调被发现是广泛存在的:具有远距离替代外显子的高度表达基因中,有 23.5%-59.3%(95%置信区间)表现出协调,这展示了长读转录组学的必要性。然而,对于非编码序列,协调的频率较低,这表明剪接协调在塑造蛋白质方面起着更大的作用。具有协调作用的基因组相互参与蛋白质-蛋白质相互作用,这增加了协调促进复合物形成和/或功能的可能性。我们还发现了涉及初始和末端外显子的新剪接协调类型。我们的结果提供了对人类转录组的更全面理解,以及一种通用的、具有成本效益的分析方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d4/5793787/b266a0af0801/231f01.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验