• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

IMSindel:一种准确的中等大小插入缺失检测工具,结合从头组装和缺口全局-局部比对以及拆分读分析。

IMSindel: An accurate intermediate-size indel detection tool incorporating de novo assembly and gapped global-local alignment with split read analysis.

机构信息

Department for Medical Genome Sciences, Medical Genome Center, National Center for Geriatrics and Gerontology, Aichi, Japan.

Department of Medical Science Mathematics, Medical Research Institute, Tokyo Medical and Dental University (TMDU), Tokyo, Japan.

出版信息

Sci Rep. 2018 Apr 4;8(1):5608. doi: 10.1038/s41598-018-23978-z.

DOI:10.1038/s41598-018-23978-z
PMID:29618752
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5884821/
Abstract

Insertions and deletions (indels) have been implicated in dozens of human diseases through the radical alteration of gene function by short frameshift indels as well as long indels. However, the accurate detection of these indels from next-generation sequencing data is still challenging. This is particularly true for intermediate-size indels (≥50 bp), due to the short DNA sequencing reads. Here, we developed a new method that predicts intermediate-size indels using BWA soft-clipped fragments (unmatched fragments in partially mapped reads) and unmapped reads. We report the performance comparison of our method, GATK, PINDEL and ScanIndel, using whole exome sequencing data from the same samples. False positive and false negative counts were determined through Sanger sequencing of all predicted indels across these four methods. The harmonic mean of the recall and precision, F-measure, was used to measure the performance of each method. Our method achieved the highest F-measure of 0.84 in one sample, compared to 0.56 for GATK, 0.52 for PINDEL and 0.46 for ScanIndel. Similar results were obtained in additional samples, demonstrating that our method was superior to the other methods for detecting intermediate-size indels. We believe that this methodology will contribute to the discovery of intermediate-size indels associated with human disease.

摘要

插入和缺失(indels)通过短的移码 indels 以及长的 indels 对基因功能产生剧烈改变,从而与数十种人类疾病相关。然而,从下一代测序数据中准确检测这些 indels 仍然具有挑战性。对于中等大小的 indels(≥50 bp),由于 DNA 测序读段较短,情况更是如此。在这里,我们开发了一种新的方法,该方法使用 BWA 软剪辑片段(部分映射读取中的未匹配片段)和未映射读取来预测中等大小的 indels。我们报告了使用来自相同样本的全外显子组测序数据,对我们的方法、GATK、PINDEL 和 ScanIndel 进行的性能比较。通过对这四种方法预测的所有 indels 进行 Sanger 测序,确定了假阳性和假阴性计数。我们使用每个方法的召回率和精度的调和平均值(F-measure)来衡量性能。在一个样本中,我们的方法的 F-measure 达到了最高的 0.84,而 GATK 为 0.56,PINDEL 为 0.52,ScanIndel 为 0.46。在其他样本中也得到了类似的结果,这表明我们的方法在检测中等大小的 indels 方面优于其他方法。我们相信,这种方法将有助于发现与人类疾病相关的中等大小的 indels。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/b1aba6d355de/41598_2018_23978_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/0d5220621e61/41598_2018_23978_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/96145160babb/41598_2018_23978_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/a0ce316af4fa/41598_2018_23978_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/42dedd1ff2e9/41598_2018_23978_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/b1aba6d355de/41598_2018_23978_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/0d5220621e61/41598_2018_23978_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/96145160babb/41598_2018_23978_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/a0ce316af4fa/41598_2018_23978_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/42dedd1ff2e9/41598_2018_23978_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5f4/5884821/b1aba6d355de/41598_2018_23978_Fig5_HTML.jpg

相似文献

1
IMSindel: An accurate intermediate-size indel detection tool incorporating de novo assembly and gapped global-local alignment with split read analysis.IMSindel:一种准确的中等大小插入缺失检测工具,结合从头组装和缺口全局-局部比对以及拆分读分析。
Sci Rep. 2018 Apr 4;8(1):5608. doi: 10.1038/s41598-018-23978-z.
2
The challenge of detecting indels in bacterial genomes from short-read sequencing data.从短读长测序数据中检测细菌基因组插入缺失的挑战。
J Biotechnol. 2017 May 20;250:11-15. doi: 10.1016/j.jbiotec.2017.02.026. Epub 2017 Mar 4.
3
ScanIndel: a hybrid framework for indel detection via gapped alignment, split reads and de novo assembly.ScanIndel:一种通过间隙比对、分割读段和从头组装进行插入缺失检测的混合框架。
Genome Med. 2015 Dec 7;7:127. doi: 10.1186/s13073-015-0251-2.
4
SOAPindel: efficient identification of indels from short paired reads.SOAPindel:从短配对读取中有效识别插入缺失。
Genome Res. 2013 Jan;23(1):195-200. doi: 10.1101/gr.132480.111. Epub 2012 Sep 12.
5
Indel detection from RNA-seq data: tool evaluation and strategies for accurate detection of actionable mutations.从 RNA-seq 数据中检测插入缺失:工具评估和准确检测可操作突变的策略。
Brief Bioinform. 2017 Nov 1;18(6):973-983. doi: 10.1093/bib/bbw069.
6
Optimized detection of insertions/deletions (INDELs) in whole-exome sequencing data.全外显子组测序数据中插入/缺失(INDELs)的优化检测
PLoS One. 2017 Aug 9;12(8):e0182272. doi: 10.1371/journal.pone.0182272. eCollection 2017.
7
Comparison of insertion/deletion calling algorithms on human next-generation sequencing data.人类下一代测序数据中插入/缺失检测算法的比较。
BMC Res Notes. 2014 Dec 1;7:864. doi: 10.1186/1756-0500-7-864.
8
INDELseek: detection of complex insertions and deletions from next-generation sequencing data.INDELseek:从下一代测序数据中检测复杂插入和缺失
BMC Genomics. 2017 Jan 5;18(1):16. doi: 10.1186/s12864-016-3449-9.
9
Uncovering missed indels by leveraging unmapped reads.利用未映射的读取来揭示遗漏的插入缺失。
Sci Rep. 2019 Jul 31;9(1):11093. doi: 10.1038/s41598-019-47405-z.
10
Amplicon Indel Hunter Is a Novel Bioinformatics Tool to Detect Large Somatic Insertion/Deletion Mutations in Amplicon-Based Next-Generation Sequencing Data.扩增子插入缺失猎手是一种用于检测基于扩增子的下一代测序数据中大型体细胞插入/缺失突变的新型生物信息学工具。
J Mol Diagn. 2015 Nov;17(6):635-43. doi: 10.1016/j.jmoldx.2015.06.005. Epub 2015 Aug 28.

引用本文的文献

1
Targeted long-read sequencing enables higher diagnostic yield of ADPKD by accurate PKD1 genetic analysis.靶向长读长测序通过准确的PKD1基因分析提高了常染色体显性多囊肾病(ADPKD)的诊断率。
NPJ Genom Med. 2025 Mar 11;10(1):22. doi: 10.1038/s41525-025-00477-5.
2
Identification of a large intra-exonic deletion in exon 18 in a pancreatic ductal adenocarcinoma.在一例胰腺导管腺癌中鉴定出第18外显子存在一个大的外显子内缺失。
Ther Adv Med Oncol. 2023 Jan 18;15:17588359221146132. doi: 10.1177/17588359221146132. eCollection 2023.
3
Optimizing Insertion and Deletion Detection Using Next-Generation Sequencing in the Clinical Laboratory.

本文引用的文献

1
SV-STAT accurately detects structural variation via alignment to reference-based assemblies.SV-STAT通过与基于参考的组装进行比对,准确检测结构变异。
Source Code Biol Med. 2016 Jun 18;11:8. doi: 10.1186/s13029-016-0051-0. eCollection 2016.
2
ScanIndel: a hybrid framework for indel detection via gapped alignment, split reads and de novo assembly.ScanIndel:一种通过间隙比对、分割读段和从头组装进行插入缺失检测的混合框架。
Genome Med. 2015 Dec 7;7:127. doi: 10.1186/s13073-015-0251-2.
3
A combination of targeted enrichment methodologies for whole-exome sequencing reveals novel pathogenic mutations.
利用下一代测序技术在临床实验室中优化插入和缺失检测。
J Mol Diagn. 2022 Dec;24(12):1217-1231. doi: 10.1016/j.jmoldx.2022.08.006. Epub 2022 Sep 24.
4
Plant pathogenic bacterium can rapidly evolve tolerance to an antimicrobial plant allelochemical.植物病原细菌能够迅速进化出对抗菌植物化感物质的耐受性。
Evol Appl. 2022 Mar 18;15(5):735-750. doi: 10.1111/eva.13363. eCollection 2022 May.
5
svBreak: A New Approach for the Detection of Structural Variant Breakpoints Based on Convolutional Neural Network.svBreak:一种基于卷积神经网络检测结构变异断点的新方法。
Biomed Res Int. 2022 Mar 19;2022:7196040. doi: 10.1155/2022/7196040. eCollection 2022.
6
Whole exome analysis of patients in Japan with hearing loss reveals high heterogeneity among responsible and novel candidate genes.对日本听力损失患者的全外显子组分析揭示了候选基因的高异质性
Orphanet J Rare Dis. 2022 Mar 5;17(1):114. doi: 10.1186/s13023-022-02262-4.
7
Detection of homozygous and hemizygous complete or partial exon deletions by whole-exome sequencing.通过全外显子组测序检测纯合子和半合子的完全或部分外显子缺失。
NAR Genom Bioinform. 2021 May 22;3(2):lqab037. doi: 10.1093/nargab/lqab037. eCollection 2021 Jun.
8
Characterization of intermediate-sized insertions using whole-genome sequencing data and analysis of their functional impact on gene expression.利用全基因组测序数据对中等大小插入进行特征分析,并分析它们对基因表达的功能影响。
Hum Genet. 2021 Aug;140(8):1201-1216. doi: 10.1007/s00439-021-02291-2. Epub 2021 May 12.
9
Rad51 tolerates mismatches in hybrid meiosis with diverse genome sequences.Rad51 容忍杂种减数分裂中具有不同基因组序列的错配。
Proc Natl Acad Sci U S A. 2021 Feb 23;118(8). doi: 10.1073/pnas.2007192118.
10
Identification of the novel COL5A1 c.3369_3431dup, p.(Glu1124_Gly1144dup) variant in a patient with incomplete classical Ehlers-Danlos syndrome: The importance of phenotype-guided genetic testing.在一名不完全性经典型埃勒斯-当洛斯综合征患者中鉴定出新的 COL5A1 c.3369_3431dup,p.(Glu1124_Gly1144dup)变异:表型指导下基因检测的重要性。
Mol Genet Genomic Med. 2020 Oct;8(10):e1422. doi: 10.1002/mgg3.1422. Epub 2020 Jul 28.
用于全外显子组测序的靶向富集方法组合揭示了新的致病突变。
Sci Rep. 2015 Mar 19;5:9331. doi: 10.1038/srep09331.
4
Comparison of insertion/deletion calling algorithms on human next-generation sequencing data.人类下一代测序数据中插入/缺失检测算法的比较。
BMC Res Notes. 2014 Dec 1;7:864. doi: 10.1186/1756-0500-7-864.
5
Accurate de novo and transmitted indel detection in exome-capture data using microassembly.利用微组装技术对捕获外显子组数据进行精确的从头和传递插入缺失检测。
Nat Methods. 2014 Oct;11(10):1033-6. doi: 10.1038/nmeth.3069. Epub 2014 Aug 17.
6
Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications.整合基于图谱、组装和单倍型的方法以在临床测序应用中进行变异检测。
Nat Genet. 2014 Aug;46(8):912-918. doi: 10.1038/ng.3036. Epub 2014 Jul 13.
7
Clinical whole-exome sequencing: are we there yet?临床全外显子组测序:我们到那儿了吗?
Genet Med. 2014 Sep;16(9):717-9. doi: 10.1038/gim.2014.10. Epub 2014 Feb 13.
8
Clinical whole-exome sequencing for the diagnosis of mendelian disorders.临床全外显子测序用于孟德尔疾病的诊断。
N Engl J Med. 2013 Oct 17;369(16):1502-11. doi: 10.1056/NEJMoa1306555. Epub 2013 Oct 2.
9
A practical method to detect SNVs and indels from whole genome and exome sequencing data.一种从全基因组和外显子组测序数据中检测 SNVs 和 indels 的实用方法。
Sci Rep. 2013;3:2161. doi: 10.1038/srep02161.
10
MAFFT multiple sequence alignment software version 7: improvements in performance and usability.MAFFT 多序列比对软件版本 7:性能和易用性的改进。
Mol Biol Evol. 2013 Apr;30(4):772-80. doi: 10.1093/molbev/mst010. Epub 2013 Jan 16.