用于计算肽段鉴定交叉相关得分的通信回避型微架构。

Communication-avoiding micro-architecture to compute Xcorr scores for peptide identification.

作者信息

Kumar Sumesh, Saeed Fahad

机构信息

Knight Foundation School of Computing and Information Sciences, Florida International University (FIU), Miami, FL USA 33199.

出版信息

Int Conf Field Program Log Appl. 2021 Aug-Sep;2021:99-103. doi: 10.1109/fpl53798.2021.00024. Epub 2021 Oct 12.

DOI:10.1109/fpl53798.2021.00024

PMID:35440952

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9015013/

Abstract

Database algorithms play a crucial part in systems biology studies by identifying proteins from mass spectrometry data. Many of these database search algorithms incur huge computational costs by computing similarity scores for each pair of sparse experimental spectrum and candidate theoretical spectrum vectors. Modern MS instrumentation techniques which are capable of generating high-resolution spectrometry data require comparison against an enormous search space, further emphasizing the need of efficient accelerators. Recent research has shown that the overall cost of scoring, and deducing peptides is dominated by the communication costs between different hierarchies of memory and processing units. However, these communication costs are seldom considered in accelerator-based architectures leading to inefficient DRAM accesses, and poor data-utilization due to irregular memory access patterns. In this paper, we propose a novel communication-avoiding micro-architecture to compute cross-correlation based similarity score by utilizing efficient local cache, and peptide pre-fetching to minimize DRAM accesses, and a custom-designed peptide broadcast bus to allow input reuse. An efficient bus arbitration scheme was designed, and implemented to minimize synchronization cost and exploit parallelism of processing elements. Our simulation results show that the proposed micro-architecture performs on average 24x better than a CPU implementation running on a 3.6 GHz Intel i7-4970 processor with 16GB memory.

摘要

数据库算法在系统生物学研究中发挥着关键作用，通过从质谱数据中识别蛋白质。许多此类数据库搜索算法通过计算每对稀疏实验光谱和候选理论光谱向量的相似性得分，产生了巨大的计算成本。能够生成高分辨率光谱数据的现代质谱仪器技术需要与巨大的搜索空间进行比较，这进一步凸显了高效加速器的必要性。最近的研究表明，评分和推导肽段的总体成本主要由不同层次的内存和处理单元之间的通信成本决定。然而，基于加速器的架构很少考虑这些通信成本，导致DRAM访问效率低下，以及由于不规则内存访问模式而导致的数据利用率低下。在本文中，我们提出了一种新颖的避免通信的微架构，通过利用高效的本地缓存来计算基于互相关的相似性得分，并进行肽段预取以最小化DRAM访问，以及定制设计的肽段广播总线以允许输入重用。设计并实现了一种高效的总线仲裁方案，以最小化同步成本并利用处理元件的并行性。我们的模拟结果表明，所提出的微架构平均性能比在具有16GB内存的3.6GHz英特尔i7-4970处理器上运行的CPU实现高出24倍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c457/9015013/77e9212780f8/nihms-1794432-f0001.jpg

相似文献

Communication-avoiding micro-architecture to compute Xcorr scores for peptide identification.用于计算肽段鉴定交叉相关得分的通信回避型微架构。

Int Conf Field Program Log Appl. 2021 Aug-Sep;2021:99-103. doi: 10.1109/fpl53798.2021.00024. Epub 2021 Oct 12.

ProLuCID: An improved SEQUEST-like algorithm with enhanced sensitivity and specificity.ProLuCID：一种具有更高灵敏度和特异性的类似SEQUEST的改进算法。

J Proteomics. 2015 Nov 3;129:16-24. doi: 10.1016/j.jprot.2015.07.001. Epub 2015 Jul 11.

MCtandem: an efficient tool for large-scale peptide identification on many integrated core (MIC) architecture.MCtandem：一种在许多集成核心 (MIC) 架构上进行大规模肽鉴定的高效工具。

BMC Bioinformatics. 2019 Jul 17;20(1):397. doi: 10.1186/s12859-019-2980-5.

Tempest: GPU-CPU computing for high-throughput database spectral matching.Tempest：用于高通量数据库光谱匹配的 GPU-CPU 计算。

J Proteome Res. 2012 Jul 6;11(7):3581-91. doi: 10.1021/pr300338p. Epub 2012 Jun 8.

Optimizing performance of GATK workflows using Apache Arrow In-Memory data framework.使用 Apache Arrow 内存数据框架优化 GATK 工作流程的性能。

BMC Genomics. 2020 Nov 18;21(Suppl 10):683. doi: 10.1186/s12864-020-07013-y.

Towards a HPC-oriented parallel implementation of a learning algorithm for bioinformatics applications.面向高性能计算的生物信息学应用学习算法并行实现

BMC Bioinformatics. 2014;15 Suppl 5(Suppl 5):S2. doi: 10.1186/1471-2105-15-S5-S2. Epub 2014 May 6.

HiXCorr: a portable high-speed XCorr engine for high-resolution tandem mass spectrometry.HiXCorr：用于高分辨率串联质谱的便携式高速 XCorr 引擎。

Bioinformatics. 2015 Dec 15;31(24):4026-8. doi: 10.1093/bioinformatics/btv490. Epub 2015 Aug 26.

Learning score function parameters for improved spectrum identification in tandem mass spectrometry experiments.学习串联质谱实验中谱图识别的评分函数参数。

J Proteome Res. 2012 Sep 7;11(9):4499-508. doi: 10.1021/pr300234m. Epub 2012 Aug 15.

In-DRAM Cache Management for Low Latency and Low Power 3D-Stacked DRAMs.用于低延迟和低功耗3D堆叠DRAM的片上动态随机存取存储器缓存管理

Micromachines (Basel). 2019 Feb 14;10(2):124. doi: 10.3390/mi10020124.

Accelerating a cross-correlation score function to search modifications using a single GPU.使用单个 GPU 加速互相关评分函数以搜索修饰。

BMC Bioinformatics. 2018 Dec 12;19(1):480. doi: 10.1186/s12859-018-2559-6.

本文引用的文献

Communication Lower-Bounds for Distributed-Memory Computations for Mass Spectrometry based Omics Data.基于质谱的组学数据的分布式内存计算的通信下限

J Parallel Distrib Comput. 2022 Mar;161:37-47. doi: 10.1016/j.jpdc.2021.11.001. Epub 2021 Nov 17.

SW-Tandem: a highly efficient tool for large-scale peptide identification with parallel spectrum dot product on Sunway TaihuLight.SW-Tandem：在神威·太湖之光上通过并行谱点积进行大规模肽段鉴定的高效工具。

Bioinformatics. 2019 Oct 1;35(19):3861-3863. doi: 10.1093/bioinformatics/btz147.

MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.MSFragger：基于质谱的蛋白质组学中实现超快速且全面的肽段鉴定

Nat Methods. 2017 May;14(5):513-520. doi: 10.1038/nmeth.4256. Epub 2017 Apr 10.

Crux: rapid open source protein tandem mass spectrometry analysis.关键：快速开源蛋白质串联质谱分析

J Proteome Res. 2014 Oct 3;13(10):4488-91. doi: 10.1021/pr500741y. Epub 2014 Sep 9.

Accelerating the scoring module of mass spectrometry-based peptide identification using GPUs.利用 GPU 加速基于质谱的肽鉴定的打分模块。

BMC Bioinformatics. 2014 Apr 28;15:121. doi: 10.1186/1471-2105-15-121.

An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。

J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.

Tempest: GPU-CPU computing for high-throughput database spectral matching.Tempest：用于高通量数据库光谱匹配的 GPU-CPU 计算。

J Proteome Res. 2012 Jul 6;11(7):3581-91. doi: 10.1021/pr300338p. Epub 2012 Jun 8.

Fast parallel tandem mass spectral library searching using GPU hardware acceleration.利用 GPU 硬件加速进行快速并行串联质谱文库搜索。

J Proteome Res. 2011 Jun 3;10(6):2882-8. doi: 10.1021/pr200074h. Epub 2011 May 5.

An efficient parallelization of phosphorylated peptide and protein identification.高效的磷酸化肽和蛋白质鉴定并行化方法。

Rapid Commun Mass Spectrom. 2010 Jun 30;24(12):1791-8. doi: 10.1002/rcm.4578.

A fast SEQUEST cross correlation algorithm.一种快速的SEQUEST互相关算法。

J Proteome Res. 2008 Oct;7(10):4598-602. doi: 10.1021/pr800420s. Epub 2008 Sep 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验