从头构建小鼠细胞类型特异性转录组揭示了 lincRNAs 的保守多外显子结构。

Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs.

机构信息

Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.

出版信息

Nat Biotechnol. 2010 May;28(5):503-10. doi: 10.1038/nbt.1633. Epub 2010 May 2.

DOI:10.1038/nbt.1633

PMID:20436462

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2868100/

Abstract

Massively parallel cDNA sequencing (RNA-Seq) provides an unbiased way to study a transcriptome, including both coding and noncoding genes. Until now, most RNA-Seq studies have depended crucially on existing annotations and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We applied it to mouse embryonic stem cells, neuronal precursor cells and lung fibroblasts to accurately reconstruct the full-length gene structures for most known expressed genes. We identified substantial variation in protein coding genes, including thousands of novel 5' start sites, 3' ends and internal coding exons. We then determined the gene structures of more than a thousand large intergenic noncoding RNA (lincRNA) and antisense loci. Our results open the way to direct experimental manipulation of thousands of noncoding RNAs and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes.

摘要

大规模并行 cDNA 测序（RNA-Seq）为研究转录组提供了一种无偏倚的方法，包括编码基因和非编码基因。到目前为止，大多数 RNA-Seq 研究都严重依赖于现有注释，因此主要关注已知转录本的表达水平和变化。在这里，我们提出了 Scripture 方法，该方法仅使用 RNA-Seq 读数和基因组序列来重建哺乳动物细胞的转录组。我们将其应用于小鼠胚胎干细胞、神经前体细胞和肺成纤维细胞，以准确重建大多数已知表达基因的全长基因结构。我们发现蛋白质编码基因存在大量变异，包括数千个新的 5'起始位点、3' 末端和内部编码外显子。然后，我们确定了超过一千个大的基因间非编码 RNA（lincRNA）和反义基因座的基因结构。我们的结果为直接实验操纵数千个非编码 RNA 开辟了道路，并展示了从头重建方法在描绘哺乳动物转录组方面的强大功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf1/2868100/fd6f0d7596e8/nihms194494f1.jpg

相似文献

Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs.

Nat Biotechnol. 2010 May;28(5):503-10. doi: 10.1038/nbt.1633. Epub 2010 May 2.

Systematic identification of long intergenic non-coding RNAs expressed in bovine oocytes.

Reprod Biol Endocrinol. 2020 Feb 21;18(1):13. doi: 10.1186/s12958-020-00573-4.

Identification of transcribed protein coding sequence remnants within lincRNAs.

Nucleic Acids Res. 2018 Sep 28;46(17):8720-8729. doi: 10.1093/nar/gky608.

Transcriptome analysis of smut fungi reveals widespread intergenic transcription and conserved antisense transcript expression.

BMC Genomics. 2017 May 2;18(1):340. doi: 10.1186/s12864-017-3720-8.

Identification of large intergenic non-coding RNAs in bovine muscle using next-generation transcriptomic sequencing.

BMC Genomics. 2014 Jun 19;15(1):499. doi: 10.1186/1471-2164-15-499.

Comprehensive characterization of 10,571 mouse large intergenic noncoding RNAs from whole transcriptome sequencing.

PLoS One. 2013 Aug 12;8(8):e70835. doi: 10.1371/journal.pone.0070835. eCollection 2013.

Prediction of novel long non-coding RNAs based on RNA-Seq data of mouse Klf1 knockout study.

BMC Bioinformatics. 2012 Dec 13;13:331. doi: 10.1186/1471-2105-13-331.

Discovery, Identification, and Functional Characterization of Plant Long Intergenic Noncoding RNAs After Virus Infection.

Methods Mol Biol. 2019;1933:187-194. doi: 10.1007/978-1-4939-9045-0_10.

An integrative proteogenomics approach reveals peptides encoded by annotated lincRNA in the mouse kidney inner medulla.

Physiol Genomics. 2020 Oct 1;52(10):485-491. doi: 10.1152/physiolgenomics.00048.2020. Epub 2020 Aug 31.

Pervasive transcription of the human genome produces thousands of previously unidentified long intergenic noncoding RNAs.

PLoS Genet. 2013 Jun;9(6):e1003569. doi: 10.1371/journal.pgen.1003569. Epub 2013 Jun 20.

引用本文的文献

Open-Field Blast Injury Disrupts Corneal Gene Expression Linked to Ion Transport, Sensory Perception, and Neural Signaling.

Invest Ophthalmol Vis Sci. 2025 Aug 1;66(11):68. doi: 10.1167/iovs.66.11.68.

An Emphasis on the Role of Long Non-Coding RNAs in Viral Gene Expression, Pathogenesis, and Innate Immunity in Viral Chicken Diseases.

Noncoding RNA. 2025 May 26;11(3):42. doi: 10.3390/ncrna11030042.

The role of non-coding RNA regulates stem cell programmed death in disease therapy.

Noncoding RNA Res. 2025 Apr 23;13:57-70. doi: 10.1016/j.ncrna.2025.04.005. eCollection 2025 Aug.

p53-inducible lncRNA LOC644656 causes genotoxic stress-induced stem cell maldifferentiation and cancer chemoresistance.

Nat Commun. 2025 May 23;16(1):4818. doi: 10.1038/s41467-025-59886-w.

Cov-trans: an efficient algorithm for discontinuous transcript assembly in coronaviruses.

BMC Genomics. 2024 Dec 30;25(1):1257. doi: 10.1186/s12864-024-11179-0.

Computational Prediction of Gene Regulation by lncRNAs.

Methods Mol Biol. 2025;2883:343-362. doi: 10.1007/978-1-0716-4290-0_15.

Data-driven AI system for learning how to run transcript assemblers.

bioRxiv. 2024 Oct 30:2024.01.25.577290. doi: 10.1101/2024.01.25.577290.

The Role of Long Intergenic Noncoding RNA in Fetal Development.

Int J Mol Sci. 2024 Oct 25;25(21):11453. doi: 10.3390/ijms252111453.

Transcriptomic landscape of quiescent and proliferating human corneal stromal fibroblasts.

Exp Eye Res. 2024 Nov;248:110073. doi: 10.1016/j.exer.2024.110073. Epub 2024 Sep 5.

RNA-Seq Analysis Unraveling Novel Genes and Pathways Influencing Corneal Wound Healing.

Invest Ophthalmol Vis Sci. 2024 Sep 3;65(11):13. doi: 10.1167/iovs.65.11.13.

本文引用的文献

Dynamic transcriptomes during neural differentiation of human embryonic stem cells revealed by short, long, and paired-end sequencing.

Proc Natl Acad Sci U S A. 2010 Mar 16;107(11):5254-9. doi: 10.1073/pnas.0914114107. Epub 2010 Mar 1.

Integrative analysis of the melanoma transcriptome.

Genome Res. 2010 Apr;20(4):413-27. doi: 10.1101/gr.103697.109. Epub 2010 Feb 23.

An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data.

PLoS Comput Biol. 2009 Dec;5(12):e1000598. doi: 10.1371/journal.pcbi.1000598. Epub 2009 Dec 11.

Many human large intergenic noncoding RNAs associate with chromatin-modifying complexes and affect gene expression.

Proc Natl Acad Sci U S A. 2009 Jul 14;106(28):11667-72. doi: 10.1073/pnas.0904715106. Epub 2009 Jul 1.

De novo transcriptome assembly with ABySS.

Bioinformatics. 2009 Nov 1;25(21):2872-7. doi: 10.1093/bioinformatics/btp367. Epub 2009 Jun 15.

Identifying novel constrained elements by exploiting biased substitution patterns.

Bioinformatics. 2009 Jun 15;25(12):i54-62. doi: 10.1093/bioinformatics/btp190.

TopHat: discovering splice junctions with RNA-Seq.

Bioinformatics. 2009 May 1;25(9):1105-11. doi: 10.1093/bioinformatics/btp120. Epub 2009 Mar 16.

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.

Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing.

Proc Natl Acad Sci U S A. 2009 Mar 3;106(9):3264-9. doi: 10.1073/pnas.0812841106. Epub 2009 Feb 10.

Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals.

Nature. 2009 Mar 12;458(7235):223-7. doi: 10.1038/nature07672. Epub 2009 Feb 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从头构建小鼠细胞类型特异性转录组揭示了 lincRNAs 的保守多外显子结构。

Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献