高性能转录因子-DNA 对接与 GPU 计算。

High performance transcription factor-DNA docking with GPU computing.

机构信息

School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, Georgia, 30332, USA.

出版信息

Proteome Sci. 2012 Jun 21;10 Suppl 1(Suppl 1):S17. doi: 10.1186/1477-5956-10-S1-S17.

DOI:10.1186/1477-5956-10-S1-S17

PMID:22759575

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3380734/

Abstract

BACKGROUND

Protein-DNA docking is a very challenging problem in structural bioinformatics and has important implications in a number of applications, such as structure-based prediction of transcription factor binding sites and rational drug design. Protein-DNA docking is very computational demanding due to the high cost of energy calculation and the statistical nature of conformational sampling algorithms. More importantly, experiments show that the docking quality depends on the coverage of the conformational sampling space. It is therefore desirable to accelerate the computation of the docking algorithm, not only to reduce computing time, but also to improve docking quality.

METHODS

In an attempt to accelerate the sampling process and to improve the docking performance, we developed a graphics processing unit (GPU)-based protein-DNA docking algorithm. The algorithm employs a potential-based energy function to describe the binding affinity of a protein-DNA pair, and integrates Monte-Carlo simulation and a simulated annealing method to search through the conformational space. Algorithmic techniques were developed to improve the computation efficiency and scalability on GPU-based high performance computing systems.

RESULTS

The effectiveness of our approach is tested on a non-redundant set of 75 TF-DNA complexes and a newly developed TF-DNA docking benchmark. We demonstrated that the GPU-based docking algorithm can significantly accelerate the simulation process and thereby improving the chance of finding near-native TF-DNA complex structures. This study also suggests that further improvement in protein-DNA docking research would require efforts from two integral aspects: improvement in computation efficiency and energy function design.

CONCLUSIONS

We present a high performance computing approach for improving the prediction accuracy of protein-DNA docking. The GPU-based docking algorithm accelerates the search of the conformational space and thus increases the chance of finding more near-native structures. To the best of our knowledge, this is the first ad hoc effort of applying GPU or GPU clusters to the protein-DNA docking problem.

摘要

背景

蛋白质与 DNA 的对接是结构生物信息学中一个极具挑战性的问题，在许多应用中都具有重要意义，如基于结构的转录因子结合位点预测和合理药物设计。由于能量计算成本高和构象采样算法的统计性质，蛋白质与 DNA 的对接计算量非常大。更重要的是，实验表明对接质量取决于构象采样空间的覆盖范围。因此，不仅需要减少计算时间，还需要提高对接质量，从而加速对接算法的计算。

方法

为了加速采样过程并提高对接性能，我们开发了一种基于图形处理单元 (GPU) 的蛋白质与 DNA 对接算法。该算法采用基于势能的能量函数来描述蛋白质与 DNA 对的结合亲和力，并集成了蒙特卡罗模拟和模拟退火方法来搜索构象空间。开发了算法技术来提高基于 GPU 的高性能计算系统上的计算效率和可扩展性。

结果

我们的方法在一组 75 个非冗余 TF-DNA 复合物和新开发的 TF-DNA 对接基准上进行了有效性测试。我们证明了基于 GPU 的对接算法可以显著加速模拟过程，从而增加找到接近天然 TF-DNA 复合物结构的机会。这项研究还表明，要进一步提高蛋白质与 DNA 对接研究的准确性，需要从两个整体方面努力：提高计算效率和能量函数设计。

结论

我们提出了一种提高蛋白质与 DNA 对接预测准确性的高性能计算方法。基于 GPU 的对接算法加速了构象空间的搜索，从而增加了找到更多接近天然结构的机会。据我们所知，这是首次专门将 GPU 或 GPU 集群应用于蛋白质与 DNA 对接问题的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/362c/3380734/72d717039367/1477-5956-10-S1-S17-1.jpg

相似文献

High performance transcription factor-DNA docking with GPU computing.

Proteome Sci. 2012 Jun 21;10 Suppl 1(Suppl 1):S17. doi: 10.1186/1477-5956-10-S1-S17.

A nonvoxel-based dose convolution/superposition algorithm optimized for scalable GPU architectures.

Med Phys. 2014 Oct;41(10):101711. doi: 10.1118/1.4895822.

A new open-source GPU-based microscopic Monte Carlo simulation tool for the calculations of DNA damages caused by ionizing radiation --- Part I: Core algorithm and validation.

Med Phys. 2020 Apr;47(4):1958-1970. doi: 10.1002/mp.14037. Epub 2020 Feb 14.

A knowledge-based orientation potential for transcription factor-DNA docking.

Bioinformatics. 2013 Feb 1;29(3):322-30. doi: 10.1093/bioinformatics/bts699. Epub 2012 Dec 5.

Parallel beamlet dose calculation via beamlet contexts in a distributed multi-GPU framework.

Med Phys. 2019 Aug;46(8):3719-3733. doi: 10.1002/mp.13651. Epub 2019 Jun 30.

GeauxDock: Accelerating Structure-Based Virtual Screening with Heterogeneous Computing.

PLoS One. 2016 Jul 15;11(7):e0158898. doi: 10.1371/journal.pone.0158898. eCollection 2016.

CPU-GPU hybrid accelerating the Zuker algorithm for RNA secondary structure prediction applications.

BMC Genomics. 2012;13 Suppl 1(Suppl 1):S14. doi: 10.1186/1471-2164-13-S1-S14. Epub 2012 Jan 17.

Benchmarks for flexible and rigid transcription factor-DNA docking.

BMC Struct Biol. 2011 Nov 1;11:45. doi: 10.1186/1472-6807-11-45.

GPU-Accelerated Flexible Molecular Docking.

J Phys Chem B. 2021 Feb 4;125(4):1049-1060. doi: 10.1021/acs.jpcb.0c09051. Epub 2021 Jan 26.

Protein docking by Rotation-Based Uniform Sampling (RotBUS) with fast computing of intermolecular contact distance and residue desolvation.

BMC Bioinformatics. 2010 Jun 28;11:352. doi: 10.1186/1471-2105-11-352.

引用本文的文献

DNA binding and transposition activity of the Sleeping Beauty transposase: role of structural stability of the primary DNA-binding domain.

Nucleic Acids Res. 2025 Jan 11;53(2). doi: 10.1093/nar/gkae1188.

An SVM-based method for assessment of transcription factor-DNA complex models.

BMC Bioinformatics. 2018 Dec 21;19(Suppl 20):506. doi: 10.1186/s12859-018-2538-y.

Aurora A is a prognostic marker for breast cancer arising in BRCA2 mutation carriers.

J Pathol Clin Res. 2014 Nov 7;1(1):33-40. doi: 10.1002/cjp2.6. eCollection 2015 Jan.

Stochastic simulation of notch signaling reveals novel factors that mediate the differentiation of neural stem cells.

J Comput Biol. 2014 Jul;21(7):548-67. doi: 10.1089/cmb.2014.0022. Epub 2014 May 5.

本文引用的文献

Benchmarks for flexible and rigid transcription factor-DNA docking.

BMC Struct Biol. 2011 Nov 1;11:45. doi: 10.1186/1472-6807-11-45.

Targeting Sp1 transcription factors in prostate cancer therapy.

Med Chem. 2011 Sep;7(5):518-25. doi: 10.2174/157340611796799203.

Parallel implementation of DNA sequences matching algorithms using PWM on GPU architecture.

Int J Bioinform Res Appl. 2011;7(2):202-15. doi: 10.1504/IJBRA.2011.040097.

Targeting transcription factor Stat5a/b as a therapeutic strategy for prostate cancer.

Am J Transl Res. 2011 Feb;3(2):133-8. Epub 2010 Nov 21.

GBOOST: a GPU-based tool for detecting gene-gene interactions in genome-wide case control studies.

Bioinformatics. 2011 May 1;27(9):1309-10. doi: 10.1093/bioinformatics/btr114. Epub 2011 Mar 3.

GPU accelerated biochemical network simulation.

Bioinformatics. 2011 Mar 15;27(6):874-6. doi: 10.1093/bioinformatics/btr015. Epub 2011 Jan 11.

Ultra-fast FFT protein docking on graphics processors.

Bioinformatics. 2010 Oct 1;26(19):2398-405. doi: 10.1093/bioinformatics/btq444. Epub 2010 Aug 4.

An adaptive Expectation-Maximization algorithm with GPU implementation for electron cryomicroscopy.

J Struct Biol. 2010 Sep;171(3):256-65. doi: 10.1016/j.jsb.2010.06.004. Epub 2010 Jun 9.

GPU computing for systems biology.

Brief Bioinform. 2010 May;11(3):323-33. doi: 10.1093/bib/bbq006. Epub 2010 Mar 7.

Mechanisms of transcription factor selectivity.

Trends Genet. 2010 Feb;26(2):75-83. doi: 10.1016/j.tig.2009.12.003. Epub 2010 Jan 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

高性能转录因子-DNA 对接与 GPU 计算。

High performance transcription factor-DNA docking with GPU computing.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献