• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 transIndel 从 DNA 和 RNA 测序数据中检测插入缺失

Indel detection from DNA and RNA sequencing data with transIndel.

机构信息

The Hormel Institute, University of Minnesota, 801 16th AVE NE, Austin, MN, 55912, USA.

Masonic Cancer Center, University of Minnesota, 420 Delaware St SE, Minneapolis, MN, 55455, USA.

出版信息

BMC Genomics. 2018 Apr 19;19(1):270. doi: 10.1186/s12864-018-4671-4.

DOI:10.1186/s12864-018-4671-4
PMID:29673323
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5909256/
Abstract

BACKGROUND

Insertions and deletions (indels) are a major class of genomic variation associated with human disease. Indels are primarily detected from DNA sequencing (DNA-seq) data but their transcriptional consequences remain unexplored due to challenges in discriminating medium-sized and large indels from splicing events in RNA-seq data.

RESULTS

Here, we developed transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. TransIndel exhibits competitive or superior performance over eight state-of-the-art indel detection tools on benchmarks using both synthetic and real DNA-seq data. Additionally, we applied transIndel to DNA-seq and RNA-seq datasets from 333 primary prostate cancer patients from The Cancer Genome Atlas (TCGA) and 59 metastatic prostate cancer patients from AACR-PCF Stand-Up- To-Cancer (SU2C) studies. TransIndel enhanced the taxonomy of DNA- and RNA-level alterations in prostate cancer by identifying recurrent FOXA1 indels as well as exitron splicing in genes implicated in disease progression.

CONCLUSIONS

Our study demonstrates that transIndel is a robust tool for elucidation of medium- and large-sized indels from DNA-seq and RNA-seq data. Including RNA-seq in indel discovery efforts leads to significant improvements in sensitivity for identification of med-sized and large indels missed by DNA-seq, and reveals non-canonical RNA-splicing events in genes associated with disease pathology.

摘要

背景

插入和缺失(indels)是与人类疾病相关的主要基因组变异类型。indels 主要从 DNA 测序(DNA-seq)数据中检测到,但由于在 RNA-seq 数据中区分中等大小和大 indels 与剪接事件具有挑战性,其转录后果仍未得到探索。

结果

在这里,我们开发了 transIndel,这是一种拼接感知算法,它解析短读比对器预测的嵌合比对,并根据来自 DNA-seq 或 RNA-seq 数据的拆分读的线性比对,重建中型插入和大型缺失。transIndel 在使用合成和真实 DNA-seq 数据的基准测试中,在八个最先进的 indel 检测工具上表现出有竞争力或优越的性能。此外,我们将 transIndel 应用于来自 TCGA 的 333 名原发性前列腺癌患者的 DNA-seq 和 RNA-seq 数据集,以及 AACR-PCF Stand-Up-To-Cancer(SU2C)研究中的 59 名转移性前列腺癌患者的 DNA-seq 和 RNA-seq 数据集。transIndel 通过鉴定 FOXA1 indels 以及与疾病进展相关的基因中外显子剪接,增强了前列腺癌中 DNA 和 RNA 水平改变的分类。

结论

我们的研究表明,transIndel 是一种从 DNA-seq 和 RNA-seq 数据中阐明中等和大型 indels 的强大工具。在 indel 发现工作中包括 RNA-seq 可显著提高 DNA-seq 遗漏的中等和大型 indels 的检测灵敏度,并揭示与疾病病理学相关的基因中非典型的 RNA 剪接事件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/8e290ea7a7bf/12864_2018_4671_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/2187ac6dd6aa/12864_2018_4671_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/77eb8930b73f/12864_2018_4671_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/e776bcf25c92/12864_2018_4671_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/089e9d2f1a20/12864_2018_4671_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/8e290ea7a7bf/12864_2018_4671_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/2187ac6dd6aa/12864_2018_4671_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/77eb8930b73f/12864_2018_4671_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/e776bcf25c92/12864_2018_4671_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/089e9d2f1a20/12864_2018_4671_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd56/5909256/8e290ea7a7bf/12864_2018_4671_Fig5_HTML.jpg

相似文献

1
Indel detection from DNA and RNA sequencing data with transIndel.使用 transIndel 从 DNA 和 RNA 测序数据中检测插入缺失
BMC Genomics. 2018 Apr 19;19(1):270. doi: 10.1186/s12864-018-4671-4.
2
Detecting Medium and Large Insertions and Deletions with transIndel.使用 transIndel 检测中型和大型插入和缺失
Methods Mol Biol. 2022;2493:67-75. doi: 10.1007/978-1-0716-2293-3_5.
3
Read-Split-Run: an improved bioinformatics pipeline for identification of genome-wide non-canonical spliced regions using RNA-Seq data.读取-分割-运行:一种利用RNA测序数据识别全基因组非经典剪接区域的改进型生物信息学流程。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):503. doi: 10.1186/s12864-016-2896-7.
4
Indel detection from RNA-seq data: tool evaluation and strategies for accurate detection of actionable mutations.从 RNA-seq 数据中检测插入缺失:工具评估和准确检测可操作突变的策略。
Brief Bioinform. 2017 Nov 1;18(6):973-983. doi: 10.1093/bib/bbw069.
5
Using high-throughput sequencing transcriptome data for INDEL detection: challenges for cancer drug discovery.利用高通量测序转录组数据进行插入缺失检测:癌症药物发现面临的挑战
Expert Opin Drug Discov. 2016;11(3):257-68. doi: 10.1517/17460441.2016.1143813. Epub 2016 Feb 6.
6
Identification and analysis of short indels inducing exon extension/shrinkage events.鉴定和分析导致exon 延伸/收缩事件的短插入缺失。
FEBS Open Bio. 2024 Oct;14(10):1682-1690. doi: 10.1002/2211-5463.13871. Epub 2024 Jul 31.
7
Indel sensitive and comprehensive variant/mutation detection from RNA sequencing data for precision medicine.从 RNA 测序数据中进行灵敏且全面的基因变异/突变检测,以实现精准医疗。
BMC Med Genomics. 2018 Sep 14;11(Suppl 3):67. doi: 10.1186/s12920-018-0391-5.
8
Identification of somatic mutations in human prostate cancer by RNA-Seq.通过 RNA-Seq 鉴定人前列腺癌中的体细胞突变。
Gene. 2013 May 1;519(2):343-7. doi: 10.1016/j.gene.2013.01.046. Epub 2013 Feb 19.
9
Detecting Somatic Insertions/Deletions (Indels) Using Tumor RNA-Seq Data.利用肿瘤 RNA-Seq 数据检测体细胞插入/缺失(Indels)。
Methods Mol Biol. 2024;2812:235-242. doi: 10.1007/978-1-0716-3886-6_13.
10
TAP: a targeted clinical genomics pipeline for detecting transcript variants using RNA-seq data.TAP:一种使用 RNA-seq 数据检测转录变体的靶向临床基因组学管道。
BMC Med Genomics. 2018 Sep 10;11(1):79. doi: 10.1186/s12920-018-0402-6.

引用本文的文献

1
Comprehensive analysis of 385 chloroplast genomes unveils phylogenetic relationships and evolutionary history in cassava.对385个叶绿体基因组的综合分析揭示了木薯的系统发育关系和进化历史。
BMC Plant Biol. 2025 Jul 3;25(1):858. doi: 10.1186/s12870-025-06883-2.
2
VISTA: an integrated framework for structural variant discovery.VISTA:一个用于结构变异发现的集成框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae462.
3
Improved inference of population histories by integrating genomic and epigenomic data.通过整合基因组和表观基因组数据来改进群体历史推断。

本文引用的文献

1
Truncation and constitutive activation of the androgen receptor by diverse genomic rearrangements in prostate cancer.前列腺癌中雄激素受体的截断和组成性激活通过多种基因组重排。
Nat Commun. 2016 Nov 29;7:13668. doi: 10.1038/ncomms13668.
2
novoBreak: local assembly for breakpoint detection in cancer genomes.novoBreak:用于癌症基因组中断点检测的局部组装
Nat Methods. 2017 Jan;14(1):65-67. doi: 10.1038/nmeth.4084. Epub 2016 Nov 28.
3
Direct Transcriptional Consequences of Somatic Mutation in Breast Cancer.乳腺癌体细胞突变的直接转录后果
Elife. 2024 Sep 12;12:RP89470. doi: 10.7554/eLife.89470.
4
Transcriptomic Profiling of Plasma Extracellular Vesicles Enables Reliable Annotation of the Cancer-Specific Transcriptome and Molecular Subtype.血浆细胞外囊泡转录组谱分析可实现对肿瘤特异性转录组和分子亚型的可靠注释。
Cancer Res. 2024 May 15;84(10):1719-1732. doi: 10.1158/0008-5472.CAN-23-4070.
5
Relevance of mutation-derived neoantigens and non-classical antigens for anticancer therapies.突变衍生的新抗原和非经典抗原与癌症治疗的相关性。
Hum Vaccin Immunother. 2024 Dec 31;20(1):2303799. doi: 10.1080/21645515.2024.2303799. Epub 2024 Feb 12.
6
Development of Gene-Based InDel Markers on Putative Drought Stress-Responsive Genes and Genetic Diversity of Durian (Durio zibethinus).基于推定耐旱相关基因的 InDel 标记的开发及榴莲(Durio zibethinus)遗传多样性分析。
Biochem Genet. 2024 Dec;62(6):4396-4407. doi: 10.1007/s10528-023-10638-9. Epub 2024 Feb 2.
7
Mastering DNA chromatogram analysis in Sanger sequencing for reliable clinical analysis.掌握桑格测序中的DNA色谱图分析以进行可靠的临床分析。
J Genet Eng Biotechnol. 2023 Nov 13;21(1):115. doi: 10.1186/s43141-023-00587-6.
8
ScanNeo2: a comprehensive workflow for neoantigen detection and immunogenicity prediction from diverse genomic and transcriptomic alterations.ScanNeo2:一种从多种基因组和转录组改变中检测新抗原和预测免疫原性的综合工作流程。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad659.
9
Genetic basis of the historical iron-accumulating dgl and brz mutants in pea.豌豆历史上铁积累的 dgl 和 brz 突变体的遗传基础。
Plant J. 2024 Jan;117(2):590-598. doi: 10.1111/tpj.16514. Epub 2023 Oct 26.
10
Multimodal CRISPR perturbations of GWAS loci associated with coronary artery disease in vascular endothelial cells.多模态 CRISPR 干扰与血管内皮细胞中冠心病相关的 GWAS 位点。
PLoS Genet. 2023 Mar 16;19(3):e1010680. doi: 10.1371/journal.pgen.1010680. eCollection 2023 Mar.
Cell Rep. 2016 Aug 16;16(7):2032-46. doi: 10.1016/j.celrep.2016.07.028. Epub 2016 Aug 4.
4
Indel detection from RNA-seq data: tool evaluation and strategies for accurate detection of actionable mutations.从 RNA-seq 数据中检测插入缺失:工具评估和准确检测可操作突变的策略。
Brief Bioinform. 2017 Nov 1;18(6):973-983. doi: 10.1093/bib/bbw069.
5
VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research.VarDict:一种用于癌症研究中下一代测序的新型多功能变异检测工具。
Nucleic Acids Res. 2016 Jun 20;44(11):e108. doi: 10.1093/nar/gkw227. Epub 2016 Apr 7.
6
A survey of best practices for RNA-seq data analysis.RNA测序数据分析的最佳实践调查。
Genome Biol. 2016 Jan 26;17:13. doi: 10.1186/s13059-016-0881-8.
7
Using high-throughput sequencing transcriptome data for INDEL detection: challenges for cancer drug discovery.利用高通量测序转录组数据进行插入缺失检测:癌症药物发现面临的挑战
Expert Opin Drug Discov. 2016;11(3):257-68. doi: 10.1517/17460441.2016.1143813. Epub 2016 Feb 6.
8
UCSC Data Integrator and Variant Annotation Integrator.加州大学圣克鲁兹分校数据整合器与变异注释整合器
Bioinformatics. 2016 May 1;32(9):1430-2. doi: 10.1093/bioinformatics/btv766. Epub 2016 Jan 6.
9
ScanIndel: a hybrid framework for indel detection via gapped alignment, split reads and de novo assembly.ScanIndel:一种通过间隙比对、分割读段和从头组装进行插入缺失检测的混合框架。
Genome Med. 2015 Dec 7;7:127. doi: 10.1186/s13073-015-0251-2.
10
INTEGRATE: gene fusion discovery using whole genome and transcriptome data.整合:利用全基因组和转录组数据发现基因融合
Genome Res. 2016 Jan;26(1):108-18. doi: 10.1101/gr.186114.114. Epub 2015 Nov 10.