• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ERINS:通过构建扩展参考来检测新的序列插入。

ERINS: Novel Sequence Insertion Detection by Constructing an Extended Reference.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1893-1901. doi: 10.1109/TCBB.2019.2954315. Epub 2021 Oct 7.

DOI:10.1109/TCBB.2019.2954315
PMID:31751246
Abstract

Next generation sequencing technology has led to the development of methods for the detection of novel sequence insertions (nsINS). Multiple signatures from short reads are usually extracted to improve nsINS detection performance. However, characterization of nsINSs larger than the mean insert size is still challenging. This article presents a new method, ERINS, to detect nsINS contents and genotypes of full spectrum range size. It integrates the features of structural variations and mapping states of split reads to find nsINS breakpoints, and then adopts a left-most mapping strategy to infer nsINS content by iteratively extending the standard reference at each breakpoint. Finally, it realigns all reads to the extended reference and infers nsINS genotypes through statistical testing on read counts. We test and validate the performance of ERINS on simulation and real sequencing datasets. The simulation experimental results demonstrate that it outperforms several peer methods with respect to sensitivity and precision. The real data application indicates that ERINS obtains high consistent results with those of previously reported and detects nsINSs over 200 base pairs that many other methods fail. In conclusion, ERINS can be used as a supplement to existing tools and will become a routine approach for characterizing nsINSs.

摘要

下一代测序技术已经开发出了用于检测新序列插入(nsINS)的方法。通常会提取来自短读长的多个特征来提高 nsINS 检测性能。然而,大于平均插入大小的 nsINS 的特征描述仍然具有挑战性。本文提出了一种新的方法 ERINS,用于检测全谱范围大小的 nsINS 内容和基因型。它集成了结构变异的特征和分裂读的映射状态,以找到 nsINS 的断点,然后采用最左映射策略,通过在每个断点处迭代扩展标准参考来推断 nsINS 内容。最后,它将所有读重新比对到扩展的参考,并通过对读计数进行统计检验来推断 nsINS 基因型。我们在模拟和真实测序数据集上测试和验证了 ERINS 的性能。模拟实验结果表明,它在灵敏度和精度方面优于几个同类方法。实际数据应用表明,ERINS 与先前报道的结果高度一致,并检测到了许多其他方法无法检测到的超过 200 个碱基对的 nsINS。总之,ERINS 可以作为现有工具的补充,并将成为特征描述 nsINS 的常规方法。

相似文献

1
ERINS: Novel Sequence Insertion Detection by Constructing an Extended Reference.ERINS:通过构建扩展参考来检测新的序列插入。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1893-1901. doi: 10.1109/TCBB.2019.2954315. Epub 2021 Oct 7.
2
SVSR: A Program to Simulate Structural Variations and Generate Sequencing Reads for Multiple Platforms.SVSR:一个用于模拟结构变异并为多个平台生成测序读数的程序。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):1082-1091. doi: 10.1109/TCBB.2018.2876527. Epub 2018 Oct 17.
3
Identification of indels in next-generation sequencing data.下一代测序数据中插入缺失的鉴定。
BMC Bioinformatics. 2015 Feb 13;16(1):42. doi: 10.1186/s12859-015-0483-6.
4
Discovery and genotyping of novel sequence insertions in many sequenced individuals.在许多测序个体中发现和基因分型新的序列插入。
Bioinformatics. 2017 Jul 15;33(14):i161-i169. doi: 10.1093/bioinformatics/btx254.
5
Identification of genomic indels and structural variations using split reads.利用拆分读识别基因组插入/缺失和结构变异。
BMC Genomics. 2011 Jul 25;12:375. doi: 10.1186/1471-2164-12-375.
6
SInC: an accurate and fast error-model based simulator for SNPs, Indels and CNVs coupled with a read generator for short-read sequence data.SInC:一种准确且快速的基于错误模型的 SNP、Indel 和 CNV 模拟器,结合了用于短读序列数据的读取生成器。
BMC Bioinformatics. 2014 Feb 5;15:40. doi: 10.1186/1471-2105-15-40.
7
Optimized detection of insertions/deletions (INDELs) in whole-exome sequencing data.全外显子组测序数据中插入/缺失(INDELs)的优化检测
PLoS One. 2017 Aug 9;12(8):e0182272. doi: 10.1371/journal.pone.0182272. eCollection 2017.
8
ScanIndel: a hybrid framework for indel detection via gapped alignment, split reads and de novo assembly.ScanIndel:一种通过间隙比对、分割读段和从头组装进行插入缺失检测的混合框架。
Genome Med. 2015 Dec 7;7:127. doi: 10.1186/s13073-015-0251-2.
9
PRISM: pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants.PRISM:基于双读信息的分读比对算法,用于检测插入、缺失和结构变异的碱基对水平。
Bioinformatics. 2012 Oct 15;28(20):2576-83. doi: 10.1093/bioinformatics/bts484. Epub 2012 Jul 31.
10
Performance evaluation of indel calling tools using real short-read data.使用真实短读长数据对插入缺失(Indel)检测工具进行性能评估。
Hum Genomics. 2015 Aug 19;9(1):20. doi: 10.1186/s40246-015-0042-2.

引用本文的文献

1
A Density Peak-Based Method to Detect Copy Number Variations From Next-Generation Sequencing Data.一种基于密度峰值的方法,用于从下一代测序数据中检测拷贝数变异。
Front Genet. 2021 Jan 13;11:632311. doi: 10.3389/fgene.2020.632311. eCollection 2020.