• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 GOR 算法的蛋白质二级结构预测 FPGA 加速器。

FPGA accelerator for protein secondary structure prediction based on the GOR algorithm.

机构信息

National Laboratory for Parallel&Distributed Processing, Department of Computer Science, National University of Defense Technology, ChangSha, 410073, China.

出版信息

BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2105-12-S1-S5.

DOI:10.1186/1471-2105-12-S1-S5
PMID:21342582
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3044307/
Abstract

BACKGROUND

Protein is an important molecule that performs a wide range of functions in biological systems. Recently, the protein folding attracts much more attention since the function of protein can be generally derived from its molecular structure. The GOR algorithm is one of the most successful computational methods and has been widely used as an efficient analysis tool to predict secondary structure from protein sequence. However, the execution time is still intolerable with the steep growth in protein database. Recently, FPGA chips have emerged as one promising application accelerator to accelerate bioinformatics algorithms by exploiting fine-grained custom design.

RESULTS

In this paper, we propose a complete fine-grained parallel hardware implementation on FPGA to accelerate the GOR-IV package for 2D protein structure prediction. To improve computing efficiency, we partition the parameter table into small segments and access them in parallel. We aggressively exploit data reuse schemes to minimize the need for loading data from external memory. The whole computation structure is carefully pipelined to overlap the sequence loading, computing and back-writing operations as much as possible. We implemented a complete GOR desktop system based on an FPGA chip XC5VLX330.

CONCLUSIONS

The experimental results show a speedup factor of more than 430x over the original GOR-IV version and 110x speedup over the optimized version with multi-thread SIMD implementation running on a PC platform with AMD Phenom 9650 Quad CPU for 2D protein structure prediction. However, the power consumption is only about 30% of that of current general-propose CPUs.

摘要

背景

蛋白质是一种重要的分子,在生物系统中执行广泛的功能。最近,蛋白质折叠引起了更多的关注,因为蛋白质的功能通常可以从其分子结构中推导出来。GOR 算法是最成功的计算方法之一,已被广泛用作从蛋白质序列预测二级结构的有效分析工具。然而,执行时间仍然难以忍受蛋白质数据库的急剧增长。最近,FPGA 芯片作为一种有前途的应用加速器出现,通过利用细粒度的定制设计来加速生物信息学算法。

结果

在本文中,我们提出了一种完整的细粒度并行硬件实现,用于在 FPGA 上加速 2D 蛋白质结构预测的 GOR-IV 包。为了提高计算效率,我们将参数表划分为小片段并并行访问它们。我们积极利用数据重用方案,以最大程度地减少从外部存储器加载数据的需求。整个计算结构被精心流水线化,以尽可能多地重叠序列加载、计算和回写操作。我们在 FPGA 芯片 XC5VLX330 上实现了一个完整的 GOR 桌面系统。

结论

实验结果表明,与原始的 GOR-IV 版本相比,速度提高了 430 多倍,与在 AMD Phenom 9650 Quad CPU 上运行的具有多线程 SIMD 实现的 PC 平台上的优化版本相比,速度提高了 110 倍,用于 2D 蛋白质结构预测。然而,功耗仅为当前通用 CPU 的约 30%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/796b/3044307/b494287f31e7/1471-2105-12-S1-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/796b/3044307/9cd1890f7d02/1471-2105-12-S1-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/796b/3044307/e58e5435e8c7/1471-2105-12-S1-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/796b/3044307/b494287f31e7/1471-2105-12-S1-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/796b/3044307/9cd1890f7d02/1471-2105-12-S1-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/796b/3044307/e58e5435e8c7/1471-2105-12-S1-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/796b/3044307/b494287f31e7/1471-2105-12-S1-S5-3.jpg

相似文献

1
FPGA accelerator for protein secondary structure prediction based on the GOR algorithm.基于 GOR 算法的蛋白质二级结构预测 FPGA 加速器。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2105-12-S1-S5.
2
Fine-grained parallel RNAalifold algorithm for RNA secondary structure prediction on FPGA.用于在现场可编程门阵列(FPGA)上进行RNA二级结构预测的细粒度并行RNAalifold算法
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S37. doi: 10.1186/1471-2105-10-S1-S37.
3
Fine-grained parallelism accelerating for RNA secondary structure prediction with pseudoknots based on FPGA.基于现场可编程门阵列(FPGA)的带假结RNA二级结构预测的细粒度并行加速
J Bioinform Comput Biol. 2014 Jun;12(3):1450008. doi: 10.1142/S0219720014500085. Epub 2014 Jan 21.
4
The GOR Method of Protein Secondary Structure Prediction and Its Application as a Protein Aggregation Prediction Tool.蛋白质二级结构预测的GOR方法及其作为蛋白质聚集预测工具的应用。
Methods Mol Biol. 2017;1484:7-24. doi: 10.1007/978-1-4939-6406-2_2.
5
CPU-GPU hybrid accelerating the Zuker algorithm for RNA secondary structure prediction applications.CPU-GPU 混合加速 Zuker 算法在 RNA 二级结构预测中的应用。
BMC Genomics. 2012;13 Suppl 1(Suppl 1):S14. doi: 10.1186/1471-2164-13-S1-S14. Epub 2012 Jan 17.
6
FPGASW: Accelerating Large-Scale Smith-Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array.FPGA 软核:在 FPGA 线性脉动阵列上回溯实现大规模 Smith-Waterman 序列比对应用的加速。
Interdiscip Sci. 2018 Mar;10(1):176-188. doi: 10.1007/s12539-017-0225-8. Epub 2017 Apr 21.
7
GenSeq+: A Scalable High-Performance Accelerator for Genome Sequencing.GenSeq+:一种用于基因组测序的可扩展高性能加速器
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jul-Aug;18(4):1512-1523. doi: 10.1109/TCBB.2019.2947059. Epub 2021 Aug 6.
8
Designing Deep Learning Hardware Accelerator and Efficiency Evaluation.深度学习硬件加速器设计与效率评估。
Comput Intell Neurosci. 2022 Jul 13;2022:1291103. doi: 10.1155/2022/1291103. eCollection 2022.
9
Hardware Acceleration of the STRIKE String Kernel Algorithm for Estimating Protein to Protein Interactions.硬件加速 STRIKE 字符串核算法估算蛋白质相互作用。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2272-2283. doi: 10.1109/TCBB.2021.3066591. Epub 2022 Aug 8.
10
Where the Name "GOR" Originates: A Story.“GOR”这个名称的起源:一个故事。
Methods Mol Biol. 2017;1484:1-5. doi: 10.1007/978-1-4939-6406-2_1.

引用本文的文献

1
Structure Prediction and Expression of Modified rCTLA4-Ig as a Blocker for B7 Molecules.作为B7分子阻断剂的修饰型rCTLA4-Ig的结构预测与表达
Iran J Pharm Res. 2020 Summer;19(3):329-348. doi: 10.22037/ijpr.2020.112959.14040.
2
Generation of monoclonal antibodies against MGA and comparison of their application in breast cancer detection by immunohistochemistry.抗MGA单克隆抗体的产生及其在乳腺癌免疫组织化学检测中的应用比较
Sci Rep. 2015 Aug 14;5:13073. doi: 10.1038/srep13073.

本文引用的文献

1
Bhageerath: an energy based web enabled computer software suite for limiting the search space of tertiary structures of small globular proteins.巴吉拉特:一种基于能量的支持网络的计算机软件套件,用于限制小型球状蛋白质三级结构的搜索空间。
Nucleic Acids Res. 2006;34(21):6195-204. doi: 10.1093/nar/gkl789. Epub 2006 Nov 7.
2
ClustalW-MPI: ClustalW analysis using distributed and parallel computing.ClustalW-MPI:使用分布式和并行计算的ClustalW分析。
Bioinformatics. 2003 Aug 12;19(12):1585-6. doi: 10.1093/bioinformatics/btg192.
3
Application of multiple sequence alignment profiles to improve protein secondary structure prediction.
应用多序列比对轮廓来改进蛋白质二级结构预测。
Proteins. 2000 Aug 15;40(3):502-11. doi: 10.1002/1097-0134(20000815)40:3<502::aid-prot170>3.0.co;2-q.
4
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST:新一代蛋白质数据库搜索程序。
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.
5
GOR method for predicting protein secondary structure from amino acid sequence.用于从氨基酸序列预测蛋白质二级结构的GOR方法。
Methods Enzymol. 1996;266:540-53. doi: 10.1016/s0076-6879(96)66034-0.
6
An evolutionary approach to folding small alpha-helical proteins that uses sequence information and an empirical guiding fitness function.一种利用序列信息和经验指导适应度函数折叠小型α-螺旋蛋白的进化方法。
Proc Natl Acad Sci U S A. 1994 May 10;91(10):4436-40. doi: 10.1073/pnas.91.10.4436.
7
Principles of protein folding--a perspective from simple exact models.蛋白质折叠原理——来自简单精确模型的视角
Protein Sci. 1995 Apr;4(4):561-602. doi: 10.1002/pro.5560040401.
8
Prediction of protein conformation.蛋白质构象预测
Biochemistry. 1974 Jan 15;13(2):222-45. doi: 10.1021/bi00699a002.
9
Secondary structure prediction: combination of three different methods.
Protein Eng. 1988 Sep;2(3):185-91. doi: 10.1093/protein/2.3.185.
10
Machine learning approach for the prediction of protein secondary structure.用于预测蛋白质二级结构的机器学习方法。
J Mol Biol. 1990 Nov 20;216(2):441-57. doi: 10.1016/S0022-2836(05)80333-X.