• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

帕什:通过位置哈希实现高效的基因组规模序列锚定。

Pash: efficient genome-scale sequence anchoring by Positional Hashing.

作者信息

Kalafus Ken J, Jackson Andrew R, Milosavljevic Aleksandar

机构信息

Program in Structural and Computational Biology and Molecular Biophysics, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, 77030, USA.

出版信息

Genome Res. 2004 Apr;14(4):672-8. doi: 10.1101/gr.1963804.

DOI:10.1101/gr.1963804
PMID:15060009
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC383312/
Abstract

Pash is a computer program for efficient, parallel, all-against-all comparison of very long DNA sequences. Pash implements Positional Hashing, a novel parallelizable method for sequence comparison based on k-mer representation of sequences. The Positional Hashing method breaks the comparison problem in a unique way that avoids the quadratic penalty encountered with other sensitive methods and confers inherent low-level parallelism. Furthermore, Positional Hashing allows one to readily and predictably trade between sensitivity and speed. In a simulated comparison task, anchoring computationally mutated reads onto a genome, the sensitivity of Pash was equal to or greater than that of BLAST and BLAT, with Pash outperforming these programs as the reads became shorter and less similar to the genome. Using modest computing resources, we employed Pash for two large-scale sequence comparison tasks: comparison of three mammalian genomes, and anchoring millions of chimpanzee whole-genome shotgun sequencing reads onto the human genome. The results of these comparisons by Pash agree with those computed by other methods that use more than an order of magnitude more computing resources. These results confirm the sensitivity of Positional Hashing.

摘要

Pash是一个计算机程序,用于对非常长的DNA序列进行高效、并行的全对全比较。Pash实现了位置哈希算法,这是一种基于序列的k-mer表示的新型可并行化序列比较方法。位置哈希方法以独特的方式解决了比较问题,避免了其他敏感方法所遇到的二次惩罚,并赋予了固有的低级并行性。此外,位置哈希允许人们在灵敏度和速度之间轻松且可预测地进行权衡。在一项模拟比较任务中,即将计算突变的读段锚定到基因组上,Pash的灵敏度等于或高于BLAST和BLAT,随着读段变得更短且与基因组的相似度更低,Pash的表现优于这些程序。使用适度的计算资源,我们将Pash用于两项大规模序列比较任务:比较三个哺乳动物基因组,以及将数百万条黑猩猩全基因组鸟枪法测序读段锚定到人类基因组上。Pash进行这些比较的结果与使用超过一个数量级计算资源的其他方法所计算的结果一致。这些结果证实了位置哈希算法的灵敏度。

相似文献

1
Pash: efficient genome-scale sequence anchoring by Positional Hashing.帕什:通过位置哈希实现高效的基因组规模序列锚定。
Genome Res. 2004 Apr;14(4):672-8. doi: 10.1101/gr.1963804.
2
Pash 2.0: scaleable sequence anchoring for next-generation sequencing technologies.Pash 2.0:用于下一代测序技术的可扩展序列锚定
Pac Symp Biocomput. 2008:102-13.
3
Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing.Pash 3.0:一个通用软件包,用于使用大规模平行 DNA 测序进行读映射和基因组和表观基因组变异的综合分析。
BMC Bioinformatics. 2010 Nov 23;11:572. doi: 10.1186/1471-2105-11-572.
4
pblat: a multithread blat algorithm speeding up aligning sequences to genomes.pblat:一种多线程 blat 算法,用于加速将序列与基因组对齐。
BMC Bioinformatics. 2019 Jan 15;20(1):28. doi: 10.1186/s12859-019-2597-8.
5
G-Anchor: a novel approach for whole-genome comparative mapping utilizing evolutionary conserved DNA sequences.G-Anchor:一种利用进化保守 DNA 序列进行全基因组比较作图的新方法。
Gigascience. 2018 May 1;7(5). doi: 10.1093/gigascience/giy017.
6
BFAST: an alignment tool for large scale genome resequencing.BFAST:用于大规模基因组重测序的比对工具。
PLoS One. 2009 Nov 11;4(11):e7767. doi: 10.1371/journal.pone.0007767.
7
BLAT--the BLAST-like alignment tool.BLAT——类BLAST比对工具。
Genome Res. 2002 Apr;12(4):656-64. doi: 10.1101/gr.229202.
8
Comparing vertebrate whole-genome shotgun reads to the human genome.将脊椎动物全基因组鸟枪法测序 reads 与人类基因组进行比较。
Genome Res. 2001 Nov;11(11):1807-16. doi: 10.1101/gr.203601.
9
Comparison and quantitative verification of mapping algorithms for whole-genome bisulfite sequencing.全基因组亚硫酸氢盐测序映射算法的比较与定量验证
Nucleic Acids Res. 2014 Apr;42(6):e43. doi: 10.1093/nar/gkt1325. Epub 2014 Jan 3.
10
Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data.基于全基因组鸟枪法测序数据的灵长类着丝粒DNA的组织与进化
PLoS Comput Biol. 2007 Sep;3(9):1807-18. doi: 10.1371/journal.pcbi.0030181.

引用本文的文献

1
ProdMX: Rapid query and analysis of protein functional domain based on compressed sparse matrices.ProdMX:基于压缩稀疏矩阵的蛋白质功能域快速查询与分析
Comput Struct Biotechnol J. 2020 Nov 24;18:3890-3896. doi: 10.1016/j.csbj.2020.10.023. eCollection 2020.
2
Integrating Epigenomics into the Understanding of Biomedical Insight.将表观基因组学融入对生物医学见解的理解中。
Bioinform Biol Insights. 2016 Dec 4;10:267-289. doi: 10.4137/BBI.S38427. eCollection 2016.
3
Genomic hypomethylation in the human germline associates with selective structural mutability in the human genome.人类生殖细胞中的基因组低甲基化与人类基因组中选择性结构突变有关。
PLoS Genet. 2012;8(5):e1002692. doi: 10.1371/journal.pgen.1002692. Epub 2012 May 17.
4
Song exposure regulates known and novel microRNAs in the zebra finch auditory forebrain.歌曲暴露调控斑马雀听觉前脑的已知和新的 microRNAs。
BMC Genomics. 2011 May 31;12(1):277. doi: 10.1186/1471-2164-12-277.
5
Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing.Pash 3.0:一个通用软件包,用于使用大规模平行 DNA 测序进行读映射和基因组和表观基因组变异的综合分析。
BMC Bioinformatics. 2010 Nov 23;11:572. doi: 10.1186/1471-2105-11-572.
6
Proinflammatory role for let-7 microRNAS in experimental asthma.let-7 微 RNA 在实验性哮喘中的促炎作用。
J Biol Chem. 2010 Sep 24;285(39):30139-49. doi: 10.1074/jbc.M110.145698. Epub 2010 Jul 14.
7
MicroRNA transcriptome in the newborn mouse ovaries determined by massive parallel sequencing.大规模平行测序技术鉴定新生期小鼠卵巢中的 microRNA 转录组
Mol Hum Reprod. 2010 Jul;16(7):463-71. doi: 10.1093/molehr/gaq017. Epub 2010 Mar 9.
8
GASZ is essential for male meiosis and suppression of retrotransposon expression in the male germline.GASZ对于雄性减数分裂以及雄性生殖系中逆转录转座子表达的抑制至关重要。
PLoS Genet. 2009 Sep;5(9):e1000635. doi: 10.1371/journal.pgen.1000635. Epub 2009 Sep 4.
9
Evolutionary breakpoints in the gibbon suggest association between cytosine methylation and karyotype evolution.长臂猿的进化断点表明胞嘧啶甲基化与核型进化之间存在关联。
PLoS Genet. 2009 Jun;5(6):e1000538. doi: 10.1371/journal.pgen.1000538. Epub 2009 Jun 26.
10
A sequence-level map of chromosomal breakpoints in the MCF-7 breast cancer cell line yields insights into the evolution of a cancer genome.MCF-7乳腺癌细胞系中染色体断点的序列水平图谱为癌症基因组的进化提供了见解。
Genome Res. 2009 Feb;19(2):167-77. doi: 10.1101/gr.080259.108. Epub 2008 Dec 3.

本文引用的文献

1
Genome sequence of the Brown Norway rat yields insights into mammalian evolution.褐家鼠的基因组序列为哺乳动物进化研究提供了新见解。
Nature. 2004 Apr 1;428(6982):493-521. doi: 10.1038/nature02426.
2
Sequencing and comparison of yeast species to identify genes and regulatory elements.对酵母物种进行测序和比较以鉴定基因和调控元件。
Nature. 2003 May 15;423(6937):241-54. doi: 10.1038/nature01644.
3
LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA.LAGAN和多LAGAN:用于基因组DNA大规模多重比对的高效工具。
Genome Res. 2003 Apr;13(4):721-31. doi: 10.1101/gr.926603. Epub 2003 Mar 12.
4
Human-mouse alignments with BLASTZ.使用BLASTZ进行人-小鼠序列比对。
Genome Res. 2003 Jan;13(1):103-7. doi: 10.1101/gr.809403.
5
AVID: A global alignment program.AVID:一个全局比对程序。
Genome Res. 2003 Jan;13(1):97-102. doi: 10.1101/gr.789803.
6
Initial sequencing and comparative analysis of the mouse genome.小鼠基因组的初步测序与比较分析。
Nature. 2002 Dec 5;420(6915):520-62. doi: 10.1038/nature01262.
7
Divergence between samples of chimpanzee and human DNA sequences is 5%, counting indels.将插入缺失计算在内,黑猩猩与人类DNA序列样本之间的差异为5%。
Proc Natl Acad Sci U S A. 2002 Oct 15;99(21):13633-5. doi: 10.1073/pnas.172510699. Epub 2002 Oct 4.
8
Genomewide comparison of DNA sequences between humans and chimpanzees.人类与黑猩猩之间DNA序列的全基因组比较。
Am J Hum Genet. 2002 Jun;70(6):1490-7. doi: 10.1086/340787. Epub 2002 Apr 30.
9
PatternHunter: faster and more sensitive homology search.PatternHunter:更快、更灵敏的同源性搜索。
Bioinformatics. 2002 Mar;18(3):440-5. doi: 10.1093/bioinformatics/18.3.440.
10
BLAT--the BLAST-like alignment tool.BLAT——类BLAST比对工具。
Genome Res. 2002 Apr;12(4):656-64. doi: 10.1101/gr.229202.