• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

充分利用RNA测序:使用负鼠预处理测序数据以进行可靠的单核苷酸多态性(SNP)变体检测。

Making the most of RNA-seq: Pre-processing sequencing data with Opossum for reliable SNP variant detection.

作者信息

Oikkonen Laura, Lise Stefano

机构信息

Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK.

Centre for Evolution and Cancer, The Institute of Cancer Research, Sutton, UK.

出版信息

Wellcome Open Res. 2017 Jan 17;2:6. doi: 10.12688/wellcomeopenres.10501.2.

DOI:10.12688/wellcomeopenres.10501.2
PMID:28239666
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5322827/
Abstract

Identifying variants from RNA-seq (transcriptome sequencing) data is a cost-effective and versatile alternative to whole-genome sequencing. However, current variant callers do not generally behave well with RNA-seq data due to reads encompassing intronic regions. We have developed a software programme called Opossum to address this problem. Opossum pre-processes RNA-seq reads prior to variant calling, and although it has been designed to work specifically with Platypus, it can be used equally well with other variant callers such as GATK HaplotypeCaller. In this work, we show that using Opossum in conjunction with either Platypus or GATK HaplotypeCaller maintains precision and improves the sensitivity for SNP detection compared to the GATK Best Practices pipeline. In addition, using it in combination with Platypus offers a substantial reduction in run times compared to the GATK pipeline so it is ideal when there are only limited time or computational resources available.

摘要

从RNA测序(转录组测序)数据中识别变异体是一种经济高效且通用的全基因组测序替代方法。然而,由于读取的序列包含内含子区域,当前的变异体调用程序在处理RNA测序数据时通常表现不佳。我们开发了一个名为负鼠(Opossum)的软件程序来解决这个问题。负鼠在进行变异体调用之前会对RNA测序读取的数据进行预处理,虽然它是专门为与鸭嘴兽(Platypus)一起使用而设计的,但它也可以与其他变异体调用程序(如GATK HaplotypeCaller)同样出色地配合使用。在这项工作中,我们表明,与GATK最佳实践流程相比,将负鼠与鸭嘴兽或GATK HaplotypeCaller结合使用可保持精度并提高单核苷酸多态性(SNP)检测的灵敏度。此外,与GATK流程相比,将其与鸭嘴兽结合使用可大幅减少运行时间,因此在时间或计算资源有限时非常理想。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dea1/5446017/85c93db0a4fc/wellcomeopenres-2-12053-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dea1/5446017/a9e1feedee00/wellcomeopenres-2-12053-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dea1/5446017/b06ce84ac2b0/wellcomeopenres-2-12053-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dea1/5446017/85c93db0a4fc/wellcomeopenres-2-12053-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dea1/5446017/a9e1feedee00/wellcomeopenres-2-12053-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dea1/5446017/b06ce84ac2b0/wellcomeopenres-2-12053-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dea1/5446017/85c93db0a4fc/wellcomeopenres-2-12053-g0002.jpg

相似文献

1
Making the most of RNA-seq: Pre-processing sequencing data with Opossum for reliable SNP variant detection.充分利用RNA测序:使用负鼠预处理测序数据以进行可靠的单核苷酸多态性(SNP)变体检测。
Wellcome Open Res. 2017 Jan 17;2:6. doi: 10.12688/wellcomeopenres.10501.2.
2
Detailed comparison of two popular variant calling packages for exome and targeted exon studies.详细比较两种用于外显子组和靶向外显子研究的流行变异调用包。
PeerJ. 2014 Sep 30;2:e600. doi: 10.7717/peerj.600. eCollection 2014.
3
The GATK joint genotyping workflow is appropriate for calling variants in RNA-seq experiments.GATK联合基因分型工作流程适用于在RNA测序实验中检测变异。
J Anim Sci Biotechnol. 2019 Jun 21;10:44. doi: 10.1186/s40104-019-0359-0. eCollection 2019.
4
An analytical workflow for accurate variant discovery in highly divergent regions.一种用于在高度分化区域进行准确变异发现的分析流程。
BMC Genomics. 2016 Sep 2;17(1):703. doi: 10.1186/s12864-016-3045-z.
5
OVarFlow: a resource optimized GATK 4 based Open source Variant calling workFlow.OVarFlow:一种基于资源优化的 GATK4 的开源变异调用工作流程。
BMC Bioinformatics. 2021 Aug 13;22(1):402. doi: 10.1186/s12859-021-04317-y.
6
Impact of post-alignment processing in variant discovery from whole exome data.全外显子数据变异发现中比对后处理的影响
BMC Bioinformatics. 2016 Oct 3;17(1):403. doi: 10.1186/s12859-016-1279-z.
7
Variant Calling from RNA-seq Data Using the GATK Joint Genotyping Workflow.使用 GATK 联合基因分型工作流程进行 RNA-seq 数据的变异调用。
Methods Mol Biol. 2022;2493:205-233. doi: 10.1007/978-1-0716-2293-3_13.
8
Benchmarking variant callers in next-generation and third-generation sequencing analysis.在新一代和第三代测序分析中对变异调用程序进行基准测试。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa148.
9
Comparison of GATK and DeepVariant by trio sequencing.基于 trio 测序的 GATK 和 DeepVariant 比较。
Sci Rep. 2022 Feb 2;12(1):1809. doi: 10.1038/s41598-022-05833-4.
10
Halvade somatic: Somatic variant calling with Apache Spark.半体变异体调用:基于 Apache Spark 的半体变异体调用。
Gigascience. 2022 Jan 12;11(1). doi: 10.1093/gigascience/giab094.

引用本文的文献

1
A pan-cancer atlas of therapeutic T cell targets.一份治疗性T细胞靶点的泛癌图谱。
bioRxiv. 2025 Jul 7:2025.01.22.634237. doi: 10.1101/2025.01.22.634237.
2
No Evidence for Distinct Transcriptomic Subgroups of Devil Facial Tumor Disease (DFTD).无证据表明袋獾面部肿瘤病(DFTD)存在不同的转录组亚群。
Evol Appl. 2025 Apr 1;18(4):e70091. doi: 10.1111/eva.70091. eCollection 2025 Apr.
3
Polyploidization-driven transcriptomic dynamics in Medicago sativa neotetraploids: mRNA, smRNA and allele-specific gene expression.紫花苜蓿新四倍体中多倍体化驱动的转录组动态变化:mRNA、小RNA和等位基因特异性基因表达

本文引用的文献

1
Improving genetic diagnosis in Mendelian disease with transcriptome sequencing.通过转录组测序改善孟德尔疾病的基因诊断。
Sci Transl Med. 2017 Apr 19;9(386). doi: 10.1126/scitranslmed.aal5209.
2
Indel detection from RNA-seq data: tool evaluation and strategies for accurate detection of actionable mutations.从 RNA-seq 数据中检测插入缺失:工具评估和准确检测可操作突变的策略。
Brief Bioinform. 2017 Nov 1;18(6):973-983. doi: 10.1093/bib/bbw069.
3
HISAT: a fast spliced aligner with low memory requirements.HISAT:一种内存需求低的快速剪接比对器。
BMC Plant Biol. 2025 Jan 25;25(1):108. doi: 10.1186/s12870-025-06090-z.
4
Chemoproteogenomic stratification of the missense variant cysteinome.错义变异半胱氨酸组的化学蛋白质基因组分层分析。
Nat Commun. 2024 Oct 28;15(1):9284. doi: 10.1038/s41467-024-53520-x.
5
RNA-DNA differences in variant calls from cattle tissues result in erroneous eQTLs.来自牛组织的变异调用中的 RNA-DNA 差异导致错误的 eQTLs。
BMC Genomics. 2024 Aug 1;25(1):750. doi: 10.1186/s12864-024-10645-z.
6
A deep-learning-based RNA-seq germline variant caller.一种基于深度学习的RNA测序种系变异检测工具。
Bioinform Adv. 2023 Jun 13;3(1):vbad062. doi: 10.1093/bioadv/vbad062. eCollection 2023.
7
Striated muscle-specific base editing enables correction of mutations causing dilated cardiomyopathy.横纹肌特异性碱基编辑可纠正导致扩张型心肌病的突变。
Nat Commun. 2023 Jun 22;14(1):3714. doi: 10.1038/s41467-023-39352-1.
8
RNA-Seq-Pop: Exploiting the sequence in RNA sequencing-A Snakemake workflow reveals patterns of insecticide resistance in the malaria vector Anopheles gambiae.RNA-Seq-Pop:利用 RNA 测序中的序列——Snakemake 工作流程揭示疟疾传播媒介冈比亚按蚊的杀虫剂抗性模式。
Mol Ecol Resour. 2023 May;23(4):946-961. doi: 10.1111/1755-0998.13759. Epub 2023 Feb 10.
9
Antigens Expressed by Breast Cancer Cells Undergoing EMT Stimulate Cytotoxic CD8 T Cell Immunity.经历上皮-间质转化的乳腺癌细胞所表达的抗原刺激细胞毒性CD8 T细胞免疫。
Cancers (Basel). 2022 Sep 9;14(18):4397. doi: 10.3390/cancers14184397.
10
Identifying plant genes shaping microbiota composition in the barley rhizosphere.鉴定影响大麦根际微生物群落组成的植物基因。
Nat Commun. 2022 Jun 16;13(1):3443. doi: 10.1038/s41467-022-31022-y.
Nat Methods. 2015 Apr;12(4):357-60. doi: 10.1038/nmeth.3317. Epub 2015 Mar 9.
4
The eSNV-detect: a computational system to identify expressed single nucleotide variants from transcriptome sequencing data.eSNV-detect:一种从转录组测序数据中识别表达的单核苷酸变异的计算系统。
Nucleic Acids Res. 2014 Dec 16;42(22):e172. doi: 10.1093/nar/gku1005. Epub 2014 Oct 28.
5
Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications.整合基于图谱、组装和单倍型的方法以在临床测序应用中进行变异检测。
Nat Genet. 2014 Aug;46(8):912-918. doi: 10.1038/ng.3036. Epub 2014 Jul 13.
6
Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls.整合人类序列数据集提供了一个基准 SNP 和 indel 基因型调用资源。
Nat Biotechnol. 2014 Mar;32(3):246-51. doi: 10.1038/nbt.2835. Epub 2014 Feb 16.
7
Consistent errors in first strand cDNA due to random hexamer mispriming.由于随机六聚体错配引发的第一链cDNA中的一致性错误。
PLoS One. 2013 Dec 30;8(12):e85583. doi: 10.1371/journal.pone.0085583. eCollection 2013.
8
Systematic evaluation of spliced alignment programs for RNA-seq data.系统评估 RNA-seq 数据拼接比对程序。
Nat Methods. 2013 Dec;10(12):1185-91. doi: 10.1038/nmeth.2722. Epub 2013 Nov 3.
9
RADAR: a rigorously annotated database of A-to-I RNA editing.RADAR:一个经过严格注释的 A 到 I RNA 编辑数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D109-13. doi: 10.1093/nar/gkt996. Epub 2013 Oct 25.
10
Reliable identification of genomic variants from RNA-seq data.从 RNA-seq 数据中可靠地识别基因组变异。
Am J Hum Genet. 2013 Oct 3;93(4):641-51. doi: 10.1016/j.ajhg.2013.08.008. Epub 2013 Sep 26.