• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DE-kupl:通过 k-mer 分解实现 RNA-seq 数据中生物变异的全面捕获。

DE-kupl: exhaustive capture of biological variation in RNA-seq data through k-mer decomposition.

机构信息

INSERM U1183 IRMB, Université de Montpellier, Hopital St Eloi, 80 avenue Augustin Fliche, Montpellier, 34295, France.

Institut de Biologie Computationnelle, Université Montpellier, Montpellier, France.

出版信息

Genome Biol. 2017 Dec 28;18(1):243. doi: 10.1186/s13059-017-1372-2.

DOI:10.1186/s13059-017-1372-2
PMID:29284518
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5747171/
Abstract

We introduce a k-mer-based computational protocol, DE-kupl, for capturing local RNA variation in a set of RNA-seq libraries, independently of a reference genome or transcriptome. DE-kupl extracts all k-mers with differential abundance directly from the raw data files. This enables the retrieval of virtually all variation present in an RNA-seq data set. This variation is subsequently assigned to biological events or entities such as differential long non-coding RNAs, splice and polyadenylation variants, introns, repeats, editing or mutation events, and exogenous RNA. Applying DE-kupl to human RNA-seq data sets identified multiple types of novel events, reproducibly across independent RNA-seq experiments.

摘要

我们介绍了一种基于 k-mer 的计算方案 DE-kupl,用于在一组 RNA-seq 文库中捕获局部 RNA 变异,而无需参考基因组或转录组。DE-kupl 从原始数据文件中直接提取具有差异丰度的所有 k-mer。这使得几乎可以检索到 RNA-seq 数据集中存在的所有变异。随后,这些变异被分配到生物事件或实体中,例如差异长非编码 RNA、剪接和多聚腺苷酸化变体、内含子、重复、编辑或突变事件以及外源性 RNA。将 DE-kupl 应用于人类 RNA-seq 数据集,在多个独立的 RNA-seq 实验中可重复鉴定多种类型的新型事件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/660b44804257/13059_2017_1372_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/80202f9d40fb/13059_2017_1372_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/1a8c438fdcc0/13059_2017_1372_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/aec909112bd0/13059_2017_1372_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/fa9b129ff367/13059_2017_1372_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/34653c0946ed/13059_2017_1372_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/bd1bc109ecde/13059_2017_1372_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/abf48555672e/13059_2017_1372_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/660b44804257/13059_2017_1372_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/80202f9d40fb/13059_2017_1372_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/1a8c438fdcc0/13059_2017_1372_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/aec909112bd0/13059_2017_1372_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/fa9b129ff367/13059_2017_1372_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/34653c0946ed/13059_2017_1372_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/bd1bc109ecde/13059_2017_1372_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/abf48555672e/13059_2017_1372_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b652/5747171/660b44804257/13059_2017_1372_Fig8_HTML.jpg

相似文献

1
DE-kupl: exhaustive capture of biological variation in RNA-seq data through k-mer decomposition.DE-kupl:通过 k-mer 分解实现 RNA-seq 数据中生物变异的全面捕获。
Genome Biol. 2017 Dec 28;18(1):243. doi: 10.1186/s13059-017-1372-2.
2
PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme.PLEK:一种基于改进的k-mer方案预测长链非编码RNA和信使RNA的工具。
BMC Bioinformatics. 2014 Sep 19;15(1):311. doi: 10.1186/1471-2105-15-311.
3
Characterizing and annotating the genome using RNA-seq data.利用RNA测序数据对基因组进行特征描述和注释。
Sci China Life Sci. 2017 Feb;60(2):116-125. doi: 10.1007/s11427-015-0349-4. Epub 2016 Jun 13.
4
Using RNentropy to Detect Significant Variation in Gene Expression Across Multiple RNA-Seq or Single-Cell RNA-Seq Samples.使用 RNentropy 检测多个 RNA-Seq 或单细胞 RNA-Seq 样本中基因表达的显著变化。
Methods Mol Biol. 2021;2284:77-96. doi: 10.1007/978-1-0716-1307-8_6.
5
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.SPARTA:用于基于参考的细菌RNA测序转录组自动分析的简单程序。
BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.
6
FLAME: long-read bioinformatics tool for comprehensive spliceome characterization.FLAME:用于全面剪接体表征的长读长生物信息学工具。
RNA. 2021 Oct;27(10):1127-1139. doi: 10.1261/rna.078800.121. Epub 2021 Jul 12.
7
2-kupl: mapping-free variant detection from DNA-seq data of matched samples.2-kupl:从配对样本的 DNA 测序数据中进行无映射变异检测。
BMC Bioinformatics. 2021 Jun 5;22(1):304. doi: 10.1186/s12859-021-04185-6.
8
De Novo Plant Transcriptome Assembly and Annotation Using Illumina RNA-Seq Reads.利用Illumina RNA测序读数进行从头植物转录组组装与注释
Methods Mol Biol. 2019;1933:265-275. doi: 10.1007/978-1-4939-9045-0_16.
9
Key Anti-Fibrosis Associated Long Noncoding RNAs Identified in Human Hepatic Stellate Cell via Transcriptome Sequencing Analysis.通过转录组测序分析鉴定人肝星状细胞中关键的抗纤维化相关长非编码 RNA。
Int J Mol Sci. 2018 Feb 27;19(3):675. doi: 10.3390/ijms19030675.
10
Nuclear RNA Isolation and Sequencing.核 RNA 分离与测序。
Methods Mol Biol. 2021;2372:75-83. doi: 10.1007/978-1-0716-1697-0_8.

引用本文的文献

1
Pseudoassembly of k-mers.k-mer的伪组装
bioRxiv. 2025 May 13:2025.05.11.653354. doi: 10.1101/2025.05.11.653354.
2
Distribution rules of 8-mer spectra and characterization of evolution state in animal genome sequences.8 元组谱的分布规律及动物基因组序列进化状态的特征。
BMC Genomics. 2024 Sep 12;25(1):855. doi: 10.1186/s12864-024-10786-1.
3
Reference-free inferring of transcriptomic events in cancer cells on single-cell data.无参考转录事件推断在单细胞数据中的癌细胞。

本文引用的文献

1
The European Nucleotide Archive in 2017.欧洲核苷酸档案库,2017 年。
Nucleic Acids Res. 2018 Jan 4;46(D1):D36-D40. doi: 10.1093/nar/gkx1125.
2
Evaluation and comparison of computational tools for RNA-seq isoform quantification.RNA测序异构体定量计算工具的评估与比较
BMC Genomics. 2017 Aug 7;18(1):583. doi: 10.1186/s12864-017-4002-1.
3
Gaining comprehensive biological insight into the transcriptome by performing a broad-spectrum RNA-seq analysis.通过进行广谱RNA测序分析,全面深入了解转录组的生物学特性。
BMC Cancer. 2024 May 20;24(1):607. doi: 10.1186/s12885-024-12331-5.
4
KaMRaT: a C++ toolkit for k-mer count matrix dimension reduction.KaMRaT:一种用于 k-mer 计数矩阵降维的 C++工具包。
Bioinformatics. 2024 Mar 4;40(3). doi: 10.1093/bioinformatics/btae090.
5
Comparative genome analysis using sample-specific string detection in accurate long reads.在准确的长读段中使用样本特异性字符串检测进行比较基因组分析。
Bioinform Adv. 2021 May 31;1(1):vbab005. doi: 10.1093/bioadv/vbab005. eCollection 2021.
6
kmtricks: efficient and flexible construction of Bloom filters for large sequencing data collections.kmtricks:用于大型测序数据集的布隆过滤器的高效灵活构建
Bioinform Adv. 2022 Apr 29;2(1):vbac029. doi: 10.1093/bioadv/vbac029. eCollection 2022.
7
SPRISS: approximating frequent k-mers by sampling reads, and applications.SPRISS:通过读取采样来近似频繁的 k-mers 及其应用。
Bioinformatics. 2022 Jun 27;38(13):3343-3350. doi: 10.1093/bioinformatics/btac180.
8
The contribution of uncharted RNA sequences to tumor identity in lung adenocarcinoma.未知RNA序列对肺腺癌肿瘤特征的贡献。
NAR Cancer. 2022 Feb 1;4(1):zcac001. doi: 10.1093/narcan/zcac001. eCollection 2022 Mar.
9
MINTIE: identifying novel structural and splice variants in transcriptomes using RNA-seq data.MINTIE:利用 RNA-seq 数据鉴定转录组中的新型结构和剪接变体。
Genome Biol. 2021 Oct 22;22(1):296. doi: 10.1186/s13059-021-02507-8.
10
Impact of human gene annotations on RNA-seq differential expression analysis.人类基因注释对 RNA-seq 差异表达分析的影响。
BMC Genomics. 2021 Oct 8;22(1):730. doi: 10.1186/s12864-021-08038-7.
Nat Commun. 2017 Jul 5;8(1):59. doi: 10.1038/s41467-017-00050-4.
4
The Landscape of Isoform Switches in Human Cancers.人类癌症中异构体切换的全景。
Mol Cancer Res. 2017 Sep;15(9):1206-1220. doi: 10.1158/1541-7786.MCR-16-0459. Epub 2017 Jun 5.
5
IRFinder: assessing the impact of intron retention on mammalian gene expression.IRFinder:评估内含子保留对哺乳动物基因表达的影响。
Genome Biol. 2017 Mar 15;18(1):51. doi: 10.1186/s13059-017-1184-4.
6
Salmon provides fast and bias-aware quantification of transcript expression.鲑鱼提供快速且无偏倚的转录本表达定量。
Nat Methods. 2017 Apr;14(4):417-419. doi: 10.1038/nmeth.4197. Epub 2017 Mar 6.
7
Rail-RNA: scalable analysis of RNA-seq splicing and coverage.Rail-RNA:用于 RNA-seq 剪接和覆盖度分析的可扩展分析方法。
Bioinformatics. 2017 Dec 15;33(24):4033-4040. doi: 10.1093/bioinformatics/btw575.
8
Fast genotyping of known SNPs through approximate k-mer matching.通过近似k-mer匹配对已知单核苷酸多态性进行快速基因分型。
Bioinformatics. 2016 Sep 1;32(17):i538-i544. doi: 10.1093/bioinformatics/btw460.
9
Mash: fast genome and metagenome distance estimation using MinHash.Mash:使用MinHash进行快速的基因组和宏基因组距离估计。
Genome Biol. 2016 Jun 20;17(1):132. doi: 10.1186/s13059-016-0997-x.
10
A benchmark for RNA-seq quantification pipelines.RNA测序定量流程的一个基准。
Genome Biol. 2016 Apr 23;17:74. doi: 10.1186/s13059-016-0940-1.