一种长读 RNA-seq 方法，用于鉴定超大基因的新型转录本。

A long-read RNA-seq approach to identify novel transcripts of very large genes.

机构信息

Center for Genetic Medicine Research, Children's Research Institute, Children's National Health System, Washington, D.C. 20010, USA.

Department of Genomics and Precision Medicine, The George Washington University School of Medicine and Health Sciences, Washington, D.C. 20052, USA.

出版信息

Genome Res. 2020 Jun;30(6):885-897. doi: 10.1101/gr.259903.119. Epub 2020 Jul 6.

DOI:10.1101/gr.259903.119

PMID:32660935

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7370890/

Abstract

RNA-seq is widely used for studying gene expression, but commonly used sequencing platforms produce short reads that only span up to two exon junctions per read. This makes it difficult to accurately determine the composition and phasing of exons within transcripts. Although long-read sequencing improves this issue, it is not amenable to precise quantitation, which limits its utility for differential expression studies. We used long-read isoform sequencing combined with a novel analysis approach to compare alternative splicing of large, repetitive structural genes in muscles. Analysis of muscle structural genes that produce medium (: 5 kb), large (: 22 kb), and very large (: 106 kb) transcripts in cardiac muscle, and fast and slow skeletal muscles identified unannotated exons for each of these ubiquitous muscle genes. This also identified differential exon usage and phasing for these genes between the different muscle types. By mapping the in-phase transcript structures to known annotations, we also identified and quantified previously unannotated transcripts. Results were confirmed by endpoint PCR and Sanger sequencing, which revealed muscle-type-specific differential expression of these novel transcripts. The improved transcript identification and quantification shown by our approach removes previous impediments to studies aimed at quantitative differential expression of ultralong transcripts.

摘要

RNA-seq 被广泛用于研究基因表达，但常用的测序平台产生的短读长只能跨越每个读长的两个外显子接头。这使得准确确定转录本中外显子的组成和相位变得困难。尽管长读长测序改善了这个问题，但它不适于精确定量，这限制了其在差异表达研究中的应用。我们使用长读长异构体测序并结合新的分析方法，比较了肌肉中大型重复结构基因的可变剪接。对在心肌、快速和慢速骨骼肌中产生中等（:5 kb）、大（:22 kb）和非常大（:106 kb）转录本的肌肉结构基因的分析，鉴定了这些普遍存在的肌肉基因的每个基因的未注释外显子。这还鉴定了这些基因在不同肌肉类型之间的可变剪接和相位差异。通过将同相转录本结构映射到已知注释，我们还鉴定和定量了以前未注释的转录本。通过终点 PCR 和 Sanger 测序进行了验证，结果表明这些新转录本在肌肉类型中具有特异性差异表达。我们的方法在转录本鉴定和定量方面的改进，消除了以前研究超长转录本定量差异表达的障碍。

相似文献

A long-read RNA-seq approach to identify novel transcripts of very large genes.一种长读 RNA-seq 方法，用于鉴定超大基因的新型转录本。

Genome Res. 2020 Jun;30(6):885-897. doi: 10.1101/gr.259903.119. Epub 2020 Jul 6.

Knowledge-based reconstruction of mRNA transcripts with short sequencing reads for transcriptome research.基于知识的短测序 reads 转录本重构用于转录组研究。

PLoS One. 2012;7(2):e31440. doi: 10.1371/journal.pone.0031440. Epub 2012 Feb 1.

Alternative Splicing Signatures in RNA-seq Data: Percent Spliced in (PSI).RNA测序数据中的可变剪接特征：剪接百分率（PSI）

Curr Protoc Hum Genet. 2015 Oct 6;87:11.16.1-11.16.14. doi: 10.1002/0471142905.hg1116s87.

Combining RT-PCR-seq and RNA-seq to catalog all genic elements encoded in the human genome.将 RT-PCR-seq 和 RNA-seq 相结合，对人类基因组中所有编码基因元件进行编目。

Genome Res. 2012 Sep;22(9):1698-710. doi: 10.1101/gr.134478.111.

Transcript Profiling Using Long-Read Sequencing Technologies.使用长读长测序技术进行转录本分析

Methods Mol Biol. 2018;1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.

Exploring differential exon usage via short- and long-read RNA sequencing strategies.通过短读长读 RNA 测序策略探索差异外显子使用。

Open Biol. 2022 Sep;12(9):220206. doi: 10.1098/rsob.220206. Epub 2022 Sep 28.

Human splicing diversity and the extent of unannotated splice junctions across human RNA-seq samples on the Sequence Read Archive.人类剪接多样性以及序列读取存档中人类RNA测序样本间未注释剪接位点的程度。

Genome Biol. 2016 Dec 30;17(1):266. doi: 10.1186/s13059-016-1118-6.

Improved annotation of the domestic pig genome through integration of Iso-Seq and RNA-seq data.通过整合 Iso-Seq 和 RNA-seq 数据来提高家猪基因组的注释水平。

BMC Genomics. 2019 May 7;20(1):344. doi: 10.1186/s12864-019-5709-y.

Full-length transcript sequencing of human and mouse cerebral cortex identifies widespread isoform diversity and alternative splicing.全长转录本测序鉴定出人及鼠大脑皮层中广泛的异构体多样性和可变剪接。

Cell Rep. 2021 Nov 16;37(7):110022. doi: 10.1016/j.celrep.2021.110022.

A high-resolution single-molecule sequencing-based Arabidopsis transcriptome using novel methods of Iso-seq analysis.利用 Iso-seq 分析的新方法进行高分辨率的单个分子测序的拟南芥转录组。

Genome Biol. 2022 Jul 7;23(1):149. doi: 10.1186/s13059-022-02711-0.

引用本文的文献

Detection of mRNA Transcript Variants.mRNA转录变体的检测

Genes (Basel). 2025 Mar 16;16(3):343. doi: 10.3390/genes16030343.

Differential inclusion of NEB exons 143 and 144 provides insight into NEB-related myopathy variant interpretation and disease manifestation.NEB外显子143和144的差异包含为深入了解与NEB相关的肌病变异解读及疾病表现提供了线索。

HGG Adv. 2025 Jan 9;6(1):100354. doi: 10.1016/j.xhgg.2024.100354. Epub 2024 Sep 23.

Differential inclusion of exons 143 and 144 provides insight into -related myopathy variant interpretation and disease manifestation.外显子143和144的差异包含为相关肌病变异体的解释和疾病表现提供了见解。

medRxiv. 2024 Mar 26:2024.03.25.24304535. doi: 10.1101/2024.03.25.24304535.

Long-read sequencing improves diagnostic rate in neuromuscular disorders.长读测序提高神经肌肉疾病的诊断率。

Acta Myol. 2023 Dec 20;42(4):123-128. doi: 10.36185/2532-1900-394. eCollection 2023.

Beyond the exome: What's next in diagnostic testing for Mendelian conditions.外显子组之外：孟德尔疾病诊断检测的下一步是什么。

Am J Hum Genet. 2023 Aug 3;110(8):1229-1248. doi: 10.1016/j.ajhg.2023.06.009.

Novel TTN Mutation Causing Severe Congenital Myopathy and Uncertain Association with Infantile Hydrocephalus.导致严重先天性肌病的新型TTN突变以及与婴儿脑积水的不确定关联。

Case Rep Genet. 2023 Jul 18;2023:5535083. doi: 10.1155/2023/5535083. eCollection 2023.

RNA Transcript Diversity in Neuromuscular Research.RNA 转录本多样性在神经肌肉研究中的作用。

J Neuromuscul Dis. 2023;10(4):473-482. doi: 10.3233/JND-221601.

Clinical and functional characterization of a long survivor congenital titinopathy patient with a novel metatranscript-only titin variant.临床和功能特征分析一位长生存先天性肌联蛋白病患者的新型仅翻译后变异体肌联蛋白变异。

Acta Neuropathol Commun. 2023 Mar 21;11(1):48. doi: 10.1186/s40478-023-01539-4.

Re-evaluating the impact of alternative RNA splicing on proteomic diversity.重新评估可变RNA剪接对蛋白质组多样性的影响。

Front Genet. 2023 Feb 9;14:1089053. doi: 10.3389/fgene.2023.1089053. eCollection 2023.

Integrative analysis of Iso-Seq and RNA-seq reveals dynamic changes of alternative promoter, alternative splicing and alternative polyadenylation during Angiotensin II-induced senescence in rat primary aortic endothelial cells.对全长转录组测序（Iso-Seq）和RNA测序（RNA-seq）的综合分析揭示了血管紧张素II诱导大鼠原代主动脉内皮细胞衰老过程中可变启动子、可变剪接和可变聚腺苷酸化的动态变化。

Front Genet. 2023 Jan 19;14:1064624. doi: 10.3389/fgene.2023.1064624. eCollection 2023.

本文引用的文献

GENCODE reference annotation for the human and mouse genomes.GENCODE 人类和小鼠基因组参考注释。

Nucleic Acids Res. 2019 Jan 8;47(D1):D766-D773. doi: 10.1093/nar/gky955.

Two alternatively-spliced human nebulin isoforms with either exon 143 or exon 144 and their developmental regulation.两种具有外显子 143 或外显子 144 的人肌联蛋白异构体及其发育调控。

Sci Rep. 2018 Oct 24;8(1):15728. doi: 10.1038/s41598-018-33281-6.

Rbfox-Splicing Factors Maintain Skeletal Muscle Mass by Regulating Calpain3 and Proteostasis.Rbfox 剪接因子通过调节钙蛋白酶 3 和蛋白质稳态来维持骨骼肌质量。

Cell Rep. 2018 Jul 3;24(1):197-208. doi: 10.1016/j.celrep.2018.06.017.

The complexity of titin splicing pattern in human adult skeletal muscles.人成年骨骼肌中肌联蛋白剪接模式的复杂性。

Skelet Muscle. 2018 Mar 29;8(1):11. doi: 10.1186/s13395-018-0156-z.

Titin Gene and Protein Functions in Passive and Active Muscle.肌联蛋白基因和蛋白在肌肉被动和主动状态下的功能

Annu Rev Physiol. 2018 Feb 10;80:389-411. doi: 10.1146/annurev-physiol-021317-121234. Epub 2017 Nov 13.

TITINdb-a computational tool to assess titin's role as a disease gene.TITINdb-一个评估肌联蛋白作为疾病基因作用的计算工具。

Bioinformatics. 2017 Nov 1;33(21):3482-3485. doi: 10.1093/bioinformatics/btx424.

Comprehensive comparison of Pacific Biosciences and Oxford Nanopore Technologies and their applications to transcriptome analysis.太平洋生物科学公司和牛津纳米孔技术公司的全面比较及其在转录组分析中的应用。

F1000Res. 2017 Feb 3;6:100. doi: 10.12688/f1000research.10571.2. eCollection 2017.

Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing.通过单分子长读测序揭示玉米转录组的复杂性。

Nat Commun. 2016 Jun 24;7:11708. doi: 10.1038/ncomms11708.

High-performance web services for querying gene and variant annotation.用于查询基因和变异注释的高性能网络服务。

Genome Biol. 2016 May 6;17(1):91. doi: 10.1186/s13059-016-0953-9.

Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing.通过可变剪接实现蛋白质相互作用能力的广泛扩展

Cell. 2016 Feb 11;164(4):805-17. doi: 10.1016/j.cell.2016.01.029.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种长读 RNA-seq 方法，用于鉴定超大基因的新型转录本。

A long-read RNA-seq approach to identify novel transcripts of very large genes.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献