• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Pash 2.0:用于下一代测序技术的可扩展序列锚定

Pash 2.0: scaleable sequence anchoring for next-generation sequencing technologies.

作者信息

Coarfa Cristian, Milosavljevic Aleksandar

机构信息

Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.

出版信息

Pac Symp Biocomput. 2008:102-13.

PMID:18229679
Abstract

Many applications of next-generation sequencing technologies involve anchoring of a sequence fragment or a tag onto a corresponding position on a reference genome assembly. Positional Hashing method, implemented in the Pash 2.0 program, is specifically designed for the task of high-volume anchoring. In this article we present multi-diagonal gapped kmer collation and other improvements introduced in Pash 2.0 that further improve accuracy and speed of Positional Hashing. The goal of this article is to show that gapped kmer matching with cross-diagonal collation suffices for anchoring across close evolutionary distances and for the purpose of human resequencing. We propose a benchmark for evaluating the performance of anchoring programs that captures key parameters in specific applications, including duplicative structure of genomes of humans and other species. We demonstrate speedups of up to tenfold in large-scale anchoring experiments achieved by PASH 2.0 when compared to BLAT, another similarity search program frequently used for anchoring.

摘要

下一代测序技术的许多应用都涉及将序列片段或标签锚定到参考基因组组装的相应位置上。Pash 2.0程序中实现的位置哈希方法专门针对大量锚定任务而设计。在本文中,我们介绍了Pash 2.0中引入的多对角线间隔kmer核对及其他改进,这些改进进一步提高了位置哈希的准确性和速度。本文的目的是表明,采用交叉对角线核对的间隔kmer匹配足以跨越较近的进化距离进行锚定,并适用于人类重测序目的。我们提出了一个用于评估锚定程序性能的基准,该基准涵盖特定应用中的关键参数,包括人类和其他物种基因组的重复结构。与另一个常用于锚定的相似性搜索程序BLAT相比,我们展示了在大规模锚定实验中PASH 2.0实现了高达十倍的加速。

相似文献

1
Pash 2.0: scaleable sequence anchoring for next-generation sequencing technologies.Pash 2.0:用于下一代测序技术的可扩展序列锚定
Pac Symp Biocomput. 2008:102-13.
2
Pash: efficient genome-scale sequence anchoring by Positional Hashing.帕什:通过位置哈希实现高效的基因组规模序列锚定。
Genome Res. 2004 Apr;14(4):672-8. doi: 10.1101/gr.1963804.
3
LSHPlace: fast phylogenetic placement using locality-sensitive hashing.LSHPlace:使用局部敏感哈希进行快速系统发育定位
Pac Symp Biocomput. 2013:310-9.
4
Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing.Pash 3.0:一个通用软件包,用于使用大规模平行 DNA 测序进行读映射和基因组和表观基因组变异的综合分析。
BMC Bioinformatics. 2010 Nov 23;11:572. doi: 10.1186/1471-2105-11-572.
5
Evaluation of next-generation sequencing software in mapping and assembly.下一代测序软件在图谱绘制和组装方面的评估。
J Hum Genet. 2011 Jun;56(6):406-14. doi: 10.1038/jhg.2011.43. Epub 2011 Apr 28.
6
Short read DNA fragment anchoring algorithm.短读长DNA片段锚定算法
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S17. doi: 10.1186/1471-2105-10-S1-S17.
7
Analysis of high-throughput sequencing data.高通量测序数据的分析
Methods Mol Biol. 2011;678:1-11. doi: 10.1007/978-1-60761-682-5_1.
8
A survey of error-correction methods for next-generation sequencing.下一代测序错误纠正方法综述。
Brief Bioinform. 2013 Jan;14(1):56-66. doi: 10.1093/bib/bbs015. Epub 2012 Apr 6.
9
Analysis and prediction of functional sub-types from protein sequence alignments.基于蛋白质序列比对的功能亚类型分析与预测。
J Mol Biol. 2000 Oct 13;303(1):61-76. doi: 10.1006/jmbi.2000.4036.
10
Genome comparison without alignment using shortest unique substrings.使用最短唯一子串进行无需比对的基因组比较。
BMC Bioinformatics. 2005 May 23;6:123. doi: 10.1186/1471-2105-6-123.

引用本文的文献

1
Integrating Epigenomics into the Understanding of Biomedical Insight.将表观基因组学融入对生物医学见解的理解中。
Bioinform Biol Insights. 2016 Dec 4;10:267-289. doi: 10.4137/BBI.S38427. eCollection 2016.
2
Comparison and quantitative verification of mapping algorithms for whole-genome bisulfite sequencing.全基因组亚硫酸氢盐测序映射算法的比较与定量验证
Nucleic Acids Res. 2014 Apr;42(6):e43. doi: 10.1093/nar/gkt1325. Epub 2014 Jan 3.
3
High Performance Multiple Sequence Alignment System for Pyrosequencing Reads from Multiple Reference Genomes.
用于来自多个参考基因组的焦磷酸测序读数的高性能多序列比对系统。
J Parallel Distrib Comput. 2012 Jan;72(1):83-93. doi: 10.1016/j.jpdc.2011.08.001. Epub 2011 Sep 16.
4
Genomic hypomethylation in the human germline associates with selective structural mutability in the human genome.人类生殖细胞中的基因组低甲基化与人类基因组中选择性结构突变有关。
PLoS Genet. 2012;8(5):e1002692. doi: 10.1371/journal.pgen.1002692. Epub 2012 May 17.
5
Song exposure regulates known and novel microRNAs in the zebra finch auditory forebrain.歌曲暴露调控斑马雀听觉前脑的已知和新的 microRNAs。
BMC Genomics. 2011 May 31;12(1):277. doi: 10.1186/1471-2164-12-277.
6
Simrank: Rapid and sensitive general-purpose k-mer search tool.Simrank:快速且敏感的通用 k-mer 搜索工具。
BMC Ecol. 2011 Apr 27;11:11. doi: 10.1186/1472-6785-11-11.
7
Next generation sequencing based approaches to epigenomics.基于下一代测序的表观基因组学方法。
Brief Funct Genomics. 2010 Dec;9(5-6):455-65. doi: 10.1093/bfgp/elq035.
8
Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing.Pash 3.0:一个通用软件包,用于使用大规模平行 DNA 测序进行读映射和基因组和表观基因组变异的综合分析。
BMC Bioinformatics. 2010 Nov 23;11:572. doi: 10.1186/1471-2105-11-572.
9
Comparison of sequencing-based methods to profile DNA methylation and identification of monoallelic epigenetic modifications.基于测序的方法比较以描绘 DNA 甲基化并鉴定单等位基因表观遗传修饰。
Nat Biotechnol. 2010 Oct;28(10):1097-105. doi: 10.1038/nbt.1682. Epub 2010 Sep 19.
10
MicroRNA transcriptome in the newborn mouse ovaries determined by massive parallel sequencing.大规模平行测序技术鉴定新生期小鼠卵巢中的 microRNA 转录组
Mol Hum Reprod. 2010 Jul;16(7):463-71. doi: 10.1093/molehr/gaq017. Epub 2010 Mar 9.