• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

pblat:一种多线程 blat 算法,用于加速将序列与基因组对齐。

pblat: a multithread blat algorithm speeding up aligning sequences to genomes.

机构信息

Center for Bioinformatics, State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, 100871, People's Republic of China.

出版信息

BMC Bioinformatics. 2019 Jan 15;20(1):28. doi: 10.1186/s12859-019-2597-8.

DOI:10.1186/s12859-019-2597-8
PMID:30646844
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6334396/
Abstract

BACKGROUND

The blat is a widely used sequence alignment tool. It is especially useful for aligning long sequences and gapped mapping, which cannot be performed properly by other fast sequence mappers designed for short reads. However, the blat tool is single threaded and when used to map whole genome or whole transcriptome sequences to reference genomes this program can take days to finish, making it unsuitable for large scale sequencing projects and iterative analysis. Here, we present pblat (parallel blat), a parallelized blat algorithm with multithread and cluster computing support, which functions to rapidly fine map large scale DNA/RNA sequences against genomes.

RESULTS

The pblat algorithm takes advantage of modern multicore processors and significantly reduces the run time with the number of threads used. pblat utilizes almost equal amount of memory as when running blat. The results generated by pblat are identical with those generated by blat. The pblat tool is easy to install and can run on Linux and Mac OS systems. In addition, we provide a cluster version of pblat (pblat-cluster) running on computing clusters with MPI support.

CONCLUSION

pblat is open source and free available for non-commercial users. It is easy to install and easy to use. pblat and pblat-cluster would facilitate the high-throughput mapping of large scale genomic and transcript sequences to reference genomes with both high speed and high precision.

摘要

背景

blat 是一种广泛使用的序列比对工具。它特别适用于对齐长序列和缺口映射,而其他专为短读长设计的快速序列映射器无法正确执行这些操作。然而,blat 工具是单线程的,当用于将整个基因组或整个转录组序列映射到参考基因组时,该程序可能需要数天才能完成,因此不适合大规模测序项目和迭代分析。在这里,我们提出了 pblat(并行 blat),这是一种具有多线程和集群计算支持的并行 blat 算法,用于快速对大规模 DNA/RNA 序列进行精细映射到基因组上。

结果

pblat 算法利用现代多核处理器,通过使用的线程数量显著缩短运行时间。pblat 利用的内存与 blat 运行时几乎相同。pblat 生成的结果与 blat 生成的结果完全一致。pblat 工具易于安装,可以在 Linux 和 Mac OS 系统上运行。此外,我们提供了一个带有 MPI 支持的计算集群上运行的 pblat 集群版本(pblat-cluster)。

结论

pblat 是开源的,免费供非商业用户使用。它易于安装和使用。pblat 和 pblat-cluster 将有助于以高速和高精度将大规模基因组和转录序列映射到参考基因组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff0b/6334396/9d70df8bb3f3/12859_2019_2597_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff0b/6334396/de2c0ecfd7cd/12859_2019_2597_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff0b/6334396/9d70df8bb3f3/12859_2019_2597_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff0b/6334396/de2c0ecfd7cd/12859_2019_2597_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff0b/6334396/9d70df8bb3f3/12859_2019_2597_Fig2_HTML.jpg

相似文献

1
pblat: a multithread blat algorithm speeding up aligning sequences to genomes.pblat:一种多线程 blat 算法,用于加速将序列与基因组对齐。
BMC Bioinformatics. 2019 Jan 15;20(1):28. doi: 10.1186/s12859-019-2597-8.
2
CLAST: CUDA implemented large-scale alignment search tool.CLAST:基于CUDA实现的大规模比对搜索工具。
BMC Bioinformatics. 2014 Dec 11;15(1):406. doi: 10.1186/s12859-014-0406-y.
3
Ψ-RA: a parallel sparse index for genomic read alignment.Ψ-RA:一种用于基因组读取比对的并行稀疏索引。
BMC Genomics. 2011;12 Suppl 2(Suppl 2):S7. doi: 10.1186/1471-2164-12-S2-S7. Epub 2011 Jul 27.
4
Multi-threading the generation of Burrows-Wheeler Alignment.多线程生成布罗-惠勒比对。
Genet Mol Res. 2016 May 23;15(2):gmr8650. doi: 10.4238/gmr.15028650.
5
Fast inexact mapping using advanced tree exploration on backward search methods.在反向搜索方法上使用高级树探索的快速不精确映射。
BMC Bioinformatics. 2015 Jan 28;16:18. doi: 10.1186/s12859-014-0438-3.
6
BFAST: an alignment tool for large scale genome resequencing.BFAST:用于大规模基因组重测序的比对工具。
PLoS One. 2009 Nov 11;4(11):e7767. doi: 10.1371/journal.pone.0007767.
7
Rapid detection and curation of conserved DNA via enhanced-BLAT and EvoPrinterHD analysis.通过增强型BLAT和EvoPrinterHD分析快速检测和整理保守DNA
BMC Genomics. 2008 Feb 28;9:106. doi: 10.1186/1471-2164-9-106.
8
RandAL: a randomized approach to aligning DNA sequences to reference genomes.RandAL:一种将DNA序列与参考基因组进行比对的随机方法。
BMC Genomics. 2014;15 Suppl 5(Suppl 5):S2. doi: 10.1186/1471-2164-15-S5-S2. Epub 2014 Jul 14.
9
A Long Fragment Aligner called ALFALFA.一个名为ALFALFA的长片段比对工具。
BMC Bioinformatics. 2015 May 15;16(1):159. doi: 10.1186/s12859-015-0533-0.
10
Anatomy of a hash-based long read sequence mapping algorithm for next generation DNA sequencing.基于哈希的下一代 DNA 测序长读序列映射算法剖析。
Bioinformatics. 2011 Jan 15;27(2):189-95. doi: 10.1093/bioinformatics/btq648. Epub 2010 Nov 18.

引用本文的文献

1
Get ready for short tandem repeats analysis using long reads-the challenges and the state of the art.为使用长读长进行短串联重复序列分析做好准备——挑战与当前技术水平
Front Genet. 2025 Jul 2;16:1610026. doi: 10.3389/fgene.2025.1610026. eCollection 2025.
2
Precise detection of differential RNA editing sites across varied biological conditions using the CADRES pipeline.使用CADRES流程精确检测不同生物学条件下的差异RNA编辑位点。
Sci Rep. 2025 Jun 4;15(1):19683. doi: 10.1038/s41598-025-04957-7.
3
Editome Disease Knowledgebase v2.0: an updated resource of editome-disease associations through literature curation and integrative analysis.

本文引用的文献

1
Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells.纳米孔长读 RNA 测序揭示了个体 B 细胞表面受体之间广泛的转录变异性。
Nat Commun. 2017 Jul 19;8:16027. doi: 10.1038/ncomms16027.
2
HISAT: a fast spliced aligner with low memory requirements.HISAT:一种内存需求低的快速剪接比对器。
Nat Methods. 2015 Apr;12(4):357-60. doi: 10.1038/nmeth.3317. Epub 2015 Mar 9.
3
Evaluation of alignment algorithms for discovery and identification of pathogens using RNA-Seq.使用RNA测序评估用于发现和鉴定病原体的比对算法。
编辑组疾病知识库v2.0:通过文献编目和综合分析更新的编辑组-疾病关联资源。
Bioinform Adv. 2025 Jan 25;5(1):vbaf012. doi: 10.1093/bioadv/vbaf012. eCollection 2025.
4
Enhancer Dynamics and Spatial Organization Drive Anatomically Restricted Cellular States in the Human Spinal Cord.增强子动力学和空间组织驱动人类脊髓中解剖学上受限的细胞状态。
bioRxiv. 2025 Jan 11:2025.01.10.632483. doi: 10.1101/2025.01.10.632483.
5
Adaptation in human immune cells residing in tissues at the frontline of infections.驻留在感染前线组织中的人类免疫细胞的适应性。
Nat Commun. 2024 Nov 28;15(1):10329. doi: 10.1038/s41467-024-54603-5.
6
Dissecting the invasion history of Spotted-Wing Drosophila (Drosophila suzukii) in Portugal using genomic data.利用基因组数据剖析斑翅果蝇(Drosophila suzukii)在葡萄牙的入侵历史。
BMC Genomics. 2024 Aug 29;25(1):813. doi: 10.1186/s12864-024-10739-8.
7
PxBLAT: an efficient python binding library for BLAT.PxBLAT:BLAT 的高效 Python 绑定库。
BMC Bioinformatics. 2024 Jun 19;25(1):219. doi: 10.1186/s12859-024-05844-0.
8
Lessons learned: overcoming common challenges in reconstructing the SARS-CoV-2 genome from short-read sequencing data via CoVpipe2.经验教训:通过CoVpipe2从短读长测序数据重建严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组时克服常见挑战。
F1000Res. 2024 Apr 16;12:1091. doi: 10.12688/f1000research.136683.1. eCollection 2023.
9
Discovery of a polymorphic gene fusion via bottom-up chimeric RNA prediction.通过自下而上的嵌合 RNA 预测发现多态性基因融合。
Nucleic Acids Res. 2024 May 8;52(8):4409-4421. doi: 10.1093/nar/gkae258.
10
Chromosome-level genome assembly of milk thistle (Silybum marianum (L.) Gaertn.).奶蓟(水飞蓟(Silybum marianum(L.)Gaertn.))染色体水平基因组组装。
Sci Data. 2024 Apr 5;11(1):342. doi: 10.1038/s41597-024-03178-3.
PLoS One. 2013 Oct 30;8(10):e76935. doi: 10.1371/journal.pone.0076935. eCollection 2013.
4
STAR: ultrafast universal RNA-seq aligner.STAR:超快通用 RNA-seq 对齐工具。
Bioinformatics. 2013 Jan 1;29(1):15-21. doi: 10.1093/bioinformatics/bts635. Epub 2012 Oct 25.
5
Tools for mapping high-throughput sequencing data.高通量测序数据映射工具。
Bioinformatics. 2012 Dec 15;28(24):3169-77. doi: 10.1093/bioinformatics/bts605. Epub 2012 Oct 11.
6
GENCODE: the reference human genome annotation for The ENCODE Project.GENCODE:ENCODE 项目的人类参考基因组注释。
Genome Res. 2012 Sep;22(9):1760-74. doi: 10.1101/gr.135350.111.
7
A beginner's guide to eukaryotic genome annotation.真核生物基因组注释入门指南。
Nat Rev Genet. 2012 Apr 18;13(5):329-42. doi: 10.1038/nrg3174.
8
Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM).RNA-Seq 比对算法与 RNA-Seq 统一映射器(RUM)的比较分析。
Bioinformatics. 2011 Sep 15;27(18):2518-28. doi: 10.1093/bioinformatics/btr427. Epub 2011 Jul 19.
9
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.
10
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.短DNA序列与人类基因组的超快速且内存高效比对。
Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.