基于定制 RISC-V 集群的事件驱动型基因分型方法。

An Event-Driven Approach to Genotype Imputation on a Custom RISC-V Cluster.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2024 Jan-Feb;21(1):26-35. doi: 10.1109/TCBB.2023.3328714. Epub 2024 Feb 5.

DOI:10.1109/TCBB.2023.3328714

Abstract

This article proposes an event-driven solution to genotype imputation, a technique used to statistically infer missing genetic markers in DNA. The work implements the widely accepted Li and Stephens model, primary contributor to the computational complexity of modern x86 solutions, in an attempt to determine whether further investigation of the application is warranted in the event-driven domain. The model is implemented using graph-based Hidden Markov Modeling and executed as a customized forward/backward dynamic programming algorithm. The solution uses an event-driven paradigm to map the algorithm to thousands of concurrent cores, where events are small messages that carry both control and data within the algorithm. The design of a single processing element is discussed. This is then extended across multiple cores and executed on a custom RISC-V NoC cluster called POETS. Results demonstrate how the algorithm scales over increasing hardware resources and a multi-core run demonstrates a 270X reduction in wall-clock processing time when compared to a single-threaded x86 solution. Optimisation of the algorithm via linear interpolation is then introduced and tested, with results demonstrating a wall-clock reduction time of ∼ 5 orders of magnitude when compared to a similarly optimised x86 solution.

摘要

本文提出了一种基于事件驱动的基因型推断解决方案，该技术用于统计推断 DNA 中缺失的遗传标记。该工作实现了被广泛接受的 Li 和 Stephens 模型，这是现代 x86 解决方案计算复杂度的主要贡献者，试图确定在事件驱动领域是否有必要进一步研究该应用。该模型使用基于图的隐马尔可夫模型实现，并作为定制的前向/后向动态规划算法执行。该解决方案使用事件驱动范例将算法映射到数千个并发核上，其中事件是在算法中携带控制和数据的小消息。讨论了单个处理元素的设计。然后将其扩展到多个核心，并在称为 POETS 的自定义 RISC-V NoC 集群上执行。结果表明，该算法在增加硬件资源时如何扩展，并且与单线程 x86 解决方案相比，多核运行将处理时间减少了 270 倍。然后引入并测试了通过线性插值对算法进行优化的结果，与类似优化的 x86 解决方案相比，结果表明处理时间减少了 ∼ 5 个数量级。

相似文献

An Event-Driven Approach to Genotype Imputation on a Custom RISC-V Cluster.基于定制 RISC-V 集群的事件驱动型基因分型方法。

IEEE/ACM Trans Comput Biol Bioinform. 2024 Jan-Feb;21(1):26-35. doi: 10.1109/TCBB.2023.3328714. Epub 2024 Feb 5.

Evaluation of vicinity-based hidden Markov models for genotype imputation.基于邻近的隐马尔可夫模型用于基因型推断的评估。

BMC Bioinformatics. 2022 Aug 29;23(1):356. doi: 10.1186/s12859-022-04896-4.

zipHMMlib: a highly optimised HMM library exploiting repetitions in the input to speed up the forward algorithm.zipHMMlib：一个高度优化的 HMM 库，利用输入中的重复项来加速前向算法。

BMC Bioinformatics. 2013 Nov 22;14:339. doi: 10.1186/1471-2105-14-339.

FISH: fast and accurate diploid genotype imputation via segmental hidden Markov model.FISH：通过分段隐马尔可夫模型实现快速准确的二倍体基因型填充

Bioinformatics. 2014 Jul 1;30(13):1876-83. doi: 10.1093/bioinformatics/btu143. Epub 2014 Mar 10.

An average-case sublinear forward algorithm for the haploid Li and Stephens model.用于单倍体李和斯蒂芬斯模型的平均情况次线性前向算法。

Algorithms Mol Biol. 2019 Apr 2;14:11. doi: 10.1186/s13015-019-0144-9. eCollection 2019.

A Parallel Architecture for the Partitioning Around Medoids (PAM) Algorithm for Scalable Multi-Core Processor Implementation with Applications in Healthcare.一种用于划分质心算法（PAM）的并行架构，用于可扩展多核处理器的实现，并在医疗保健中有应用。

Sensors (Basel). 2018 Nov 25;18(12):4129. doi: 10.3390/s18124129.

Two-stage strategy using denoising autoencoders for robust reference-free genotype imputation with missing input genotypes.两阶段策略使用去噪自动编码器实现稳健的无参考基因型缺失输入基因型的基因型推断。

J Hum Genet. 2024 Oct;69(10):511-518. doi: 10.1038/s10038-024-01261-6. Epub 2024 Jun 25.

Minimal positional substring cover is a haplotype threading alternative to Li and Stephens model.最小位置子串覆盖是替代 Li 和 Stephens 模型的单倍型连接方法。

Genome Res. 2023 Jul;33(7):1007-1014. doi: 10.1101/gr.277673.123. Epub 2023 Jun 14.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Evaluating Imputation Algorithms for Low-Depth Genotyping-By-Sequencing (GBS) Data.评估低深度简化基因组测序（GBS）数据的插补算法

PLoS One. 2016 Aug 18;11(8):e0160733. doi: 10.1371/journal.pone.0160733. eCollection 2016.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于定制 RISC-V 集群的事件驱动型基因分型方法。

An Event-Driven Approach to Genotype Imputation on a Custom RISC-V Cluster.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献