• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过最小独特长度分析对下一代测序进行高效且全面的独特性表示。

Efficient and comprehensive representation of uniqueness for next-generation sequencing by minimum unique length analyses.

机构信息

Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden.

出版信息

PLoS One. 2013;8(1):e53822. doi: 10.1371/journal.pone.0053822. Epub 2013 Jan 18.

DOI:10.1371/journal.pone.0053822
PMID:23349747
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3548888/
Abstract

As next generation sequencing technologies are getting more efficient and less expensive, RNA-Seq is becoming a widely used technique for transcriptome studies. Computational analysis of RNA-Seq data often starts with the mapping of millions of short reads back to the genome or transcriptome, a process in which some reads are found to map equally well to multiple genomic locations (multimapping reads). We have developed the Minimum Unique Length Tool (MULTo), a framework for efficient and comprehensive representation of mappability information, through identification of the shortest possible length required for each genomic coordinate to become unique in the genome and transcriptome. Using the minimum unique length information, we have compared different uniqueness compensation approaches for transcript expression level quantification and demonstrate that the best compensation is achieved by discarding multimapping reads and correctly adjusting gene model lengths. We have also explored uniqueness within specific regions of the mouse genome and enhancer mapping experiments. Finally, by making MULTo available to the community we hope to facilitate the use of uniqueness compensation in RNA-Seq analysis and to eliminate the need to make additional mappability files.

摘要

随着下一代测序技术变得越来越高效和廉价,RNA-Seq 正成为转录组研究中广泛使用的技术。RNA-Seq 数据的计算分析通常从将数百万个短读段映射回基因组或转录组开始,在此过程中,一些读段被发现可以同样好地映射到多个基因组位置(多重映射读段)。我们通过确定每个基因组坐标在基因组和转录组中变得唯一所需的最短可能长度,开发了最小唯一长度工具(MULTo),这是一种高效和全面表示可映射性信息的框架。利用最小唯一长度信息,我们比较了不同的唯一性补偿方法来定量转录表达水平,并证明通过丢弃多重映射读段并正确调整基因模型长度可以实现最佳补偿。我们还探索了小鼠基因组特定区域和增强子映射实验中的独特性。最后,通过向社区提供 MULTo,我们希望促进在 RNA-Seq 分析中使用唯一性补偿,并消除制作额外可映射性文件的需求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/8de26a88f180/pone.0053822.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/0e34e17a726a/pone.0053822.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/501e32203f03/pone.0053822.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/27bf565dac0f/pone.0053822.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/a0f2bf88bff6/pone.0053822.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/8de26a88f180/pone.0053822.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/0e34e17a726a/pone.0053822.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/501e32203f03/pone.0053822.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/27bf565dac0f/pone.0053822.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/a0f2bf88bff6/pone.0053822.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41f9/3548888/8de26a88f180/pone.0053822.g005.jpg

相似文献

1
Efficient and comprehensive representation of uniqueness for next-generation sequencing by minimum unique length analyses.通过最小独特长度分析对下一代测序进行高效且全面的独特性表示。
PLoS One. 2013;8(1):e53822. doi: 10.1371/journal.pone.0053822. Epub 2013 Jan 18.
2
Next-generation sequencing facilitates quantitative analysis of wild-type and Nrl(-/-) retinal transcriptomes.新一代测序技术有助于对野生型和Nrl基因敲除小鼠视网膜转录组进行定量分析。
Mol Vis. 2011;17:3034-54. Epub 2011 Nov 23.
3
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.SPARTA:用于基于参考的细菌RNA测序转录组自动分析的简单程序。
BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.
4
Efficient assembly and annotation of the transcriptome of catfish by RNA-Seq analysis of a doubled haploid homozygote.通过对双倍单倍体纯合子的 RNA-Seq 分析,高效组装和注释鲶鱼转录组。
BMC Genomics. 2012 Nov 5;13:595. doi: 10.1186/1471-2164-13-595.
5
Transcript Profiling Using Long-Read Sequencing Technologies.使用长读长测序技术进行转录本分析
Methods Mol Biol. 2018;1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.
6
RNA-Seq: revelation of the messengers.RNA-Seq:信使的揭示。
Trends Plant Sci. 2013 Apr;18(4):175-9. doi: 10.1016/j.tplants.2013.02.001. Epub 2013 Mar 5.
7
Retinal transcriptome profiling by directional next-generation sequencing using 100 ng of total RNA.使用100纳克总RNA通过定向新一代测序进行视网膜转录组分析。
Methods Mol Biol. 2012;884:319-34. doi: 10.1007/978-1-61779-848-1_23.
8
QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization.QuickRNASeq将大规模RNA测序数据分析提升到了一个新的自动化和交互式可视化水平。
BMC Genomics. 2016 Jan 8;17:39. doi: 10.1186/s12864-015-2356-9.
9
Mapping RNA-seq Reads with STAR.使用STAR对RNA测序读数进行比对
Curr Protoc Bioinformatics. 2015 Sep 3;51:11.14.1-11.14.19. doi: 10.1002/0471250953.bi1114s51.
10
RNA Next-Generation Sequencing and a Bioinformatics Pipeline to Identify Expressed LINE-1s at the Locus-Specific Level.RNA 下一代测序及生物信息学流程以在基因座特异性水平鉴定表达的 LINE-1
J Vis Exp. 2019 May 19(147). doi: 10.3791/59771.

引用本文的文献

1
Characterizing crosstalk in epigenetic signaling to understand disease physiology.描述表观遗传信号转导中的串扰,以了解疾病生理学。
Biochem J. 2023 Jan 13;480(1):57-85. doi: 10.1042/BCJ20220550.
2
VSGs Expressed during Natural T. b. gambiense Infection Exhibit Extensive Sequence Divergence and a Subspecies-Specific Bias towards Type B N-Terminal Domains.在自然感染 T.b. 冈比亚锥虫期间表达的 VSGs 表现出广泛的序列差异,并且对 B 型 N 端结构域具有亚种特异性偏向。
mBio. 2022 Dec 20;13(6):e0255322. doi: 10.1128/mbio.02553-22. Epub 2022 Nov 10.
3
m5C-Atlas: a comprehensive database for decoding and annotating the 5-methylcytosine (m5C) epitranscriptome.

本文引用的文献

1
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
2
DNA methylome analysis using short bisulfite sequencing data.使用短重亚硫酸盐测序数据进行 DNA 甲基组分析。
Nat Methods. 2012 Jan 30;9(2):145-51. doi: 10.1038/nmeth.1828.
3
Fast computation and applications of genome mappability.基因组可及性的快速计算与应用。
m5C-Atlas:一个全面的数据库,用于解码和注释 5-甲基胞嘧啶(m5C)转录组内表观遗传学修饰。
Nucleic Acids Res. 2022 Jan 7;50(D1):D196-D203. doi: 10.1093/nar/gkab1075.
4
Targeting barrel field spiny stellate cells using a vesicular monoaminergic transporter 2-Cre mouse line.靶向桶状束星状细胞的囊泡单胺能转运体 2-Cre 小鼠品系。
Sci Rep. 2021 Feb 5;11(1):3239. doi: 10.1038/s41598-021-82649-8.
5
Spatio-molecular domains identified in the mouse subthalamic nucleus and neighboring glutamatergic and GABAergic brain structures.在小鼠苍白球和相邻谷氨酸能和 GABA 能脑结构中鉴定出的空间分子域。
Commun Biol. 2020 Jul 3;3(1):338. doi: 10.1038/s42003-020-1028-8.
6
Developmental competence and antigen switch frequency can be uncoupled in .在 中,发育能力和抗原转换频率可以解耦。
Proc Natl Acad Sci U S A. 2019 Nov 5;116(45):22774-22782. doi: 10.1073/pnas.1912711116. Epub 2019 Oct 21.
7
Paired-end mappability of transposable elements in the human genome.人类基因组中转座元件的双端可映射性。
Mob DNA. 2019 Jul 10;10:29. doi: 10.1186/s13100-019-0172-5. eCollection 2019.
8
Molecular and functional heterogeneity of IL-10-producing CD4 T cells.IL-10 产生的 CD4 T 细胞的分子和功能异质性。
Nat Commun. 2018 Dec 21;9(1):5457. doi: 10.1038/s41467-018-07581-4.
9
Saccharomyces cerevisiae Metabolic Labeling with 4-thiouracil and the Quantification of Newly Synthesized mRNA As a Proxy for RNA Polymerase II Activity.用4-硫尿嘧啶对酿酒酵母进行代谢标记以及对新合成的mRNA进行定量以作为RNA聚合酶II活性的替代指标
J Vis Exp. 2018 Oct 22(140):57982. doi: 10.3791/57982.
10
Single-cell RNA sequencing of mouse brain and lung vascular and vessel-associated cell types.单细胞 RNA 测序分析小鼠脑和肺血管及血管相关细胞类型。
Sci Data. 2018 Aug 21;5:180160. doi: 10.1038/sdata.2018.160.
PLoS One. 2012;7(1):e30377. doi: 10.1371/journal.pone.0030377. Epub 2012 Jan 19.
4
The uniqueome: a mappability resource for short-tag sequencing.独特组学:短标签测序的可作图资源。
Bioinformatics. 2011 Jan 15;27(2):272-4. doi: 10.1093/bioinformatics/btq640. Epub 2010 Nov 12.
5
Accurate quantification of transcriptome from RNA-Seq data by effective length normalization.通过有效长度归一化对 RNA-Seq 数据进行转录组的精确定量。
Nucleic Acids Res. 2011 Jan;39(2):e9. doi: 10.1093/nar/gkq1015. Epub 2010 Nov 8.
6
Analysis and design of RNA sequencing experiments for identifying isoform regulation.RNA 测序实验分析与设计,用于鉴定异构体调控
Nat Methods. 2010 Dec;7(12):1009-15. doi: 10.1038/nmeth.1528. Epub 2010 Nov 7.
7
Alternative expression analysis by RNA sequencing.RNA 测序的替代表达分析。
Nat Methods. 2010 Oct;7(10):843-7. doi: 10.1038/nmeth.1503. Epub 2010 Sep 12.
8
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.通过 RNA-Seq 进行转录本组装和定量分析揭示了细胞分化过程中未注释的转录本和异构体转换。
Nat Biotechnol. 2010 May;28(5):511-5. doi: 10.1038/nbt.1621. Epub 2010 May 2.
9
An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data.组织转录组序列数据揭示了大量普遍表达的基因。
PLoS Comput Biol. 2009 Dec;5(12):e1000598. doi: 10.1371/journal.pcbi.1000598. Epub 2009 Dec 11.
10
Sequencing technologies - the next generation.测序技术——下一代。
Nat Rev Genet. 2010 Jan;11(1):31-46. doi: 10.1038/nrg2626. Epub 2009 Dec 8.