• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GPURFSCREEN:一种基于图形处理器(GPU)并使用随机森林分类器的虚拟筛选工具。

GPURFSCREEN: a GPU based virtual screening tool using random forest classifier.

作者信息

Jayaraj P B, Ajay Mathias K, Nufail M, Gopakumar G, Jaleel U C A

机构信息

Department of Computer Science and Engineering, National Institute of Technology Calicut, NITC Campus, Calicut, Kerala 673601 India.

Center for Cheminformatics, Open Source Pharma, No. 22, World Trade Centre, Malleswaram, Bengaluru, Karnataka 560055 India.

出版信息

J Cheminform. 2016 Mar 1;8:12. doi: 10.1186/s13321-016-0124-8. eCollection 2016.

DOI:10.1186/s13321-016-0124-8
PMID:26933453
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4772510/
Abstract

BACKGROUND

In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale computations, making it a highly time consuming task. This process can be speeded up by implementing parallelized algorithms on a Graphical Processing Unit (GPU).

RESULTS

Random Forest is a robust classification algorithm that can be employed in the virtual screening. A ligand based virtual screening tool (GPURFSCREEN) that uses random forests on GPU systems has been proposed and evaluated in this paper. This tool produces optimized results at a lower execution time for large bioassay data sets. The quality of results produced by our tool on GPU is same as that on a regular serial environment.

CONCLUSION

Considering the magnitude of data to be screened, the parallelized virtual screening has a significantly lower running time at high throughput. The proposed parallel tool outperforms its serial counterpart by successfully screening billions of molecules in training and prediction phases.

摘要

背景

计算机模拟方法是现代药物发现范式的一个组成部分。虚拟筛选作为一种计算机模拟方法,用于优化数据模型并缩小需要进行湿实验室实验的化学空间。对配体数据模型进行虚拟筛选需要大规模计算,这使其成为一项耗时极长的任务。通过在图形处理单元(GPU)上实现并行算法,可以加快这一过程。

结果

随机森林是一种强大的分类算法,可用于虚拟筛选。本文提出并评估了一种在GPU系统上使用随机森林的基于配体的虚拟筛选工具(GPURFSCREEN)。该工具在较低的执行时间内为大型生物测定数据集产生了优化结果。我们的工具在GPU上产生的结果质量与在常规串行环境中相同。

结论

考虑到要筛选的数据量,并行化虚拟筛选在高吞吐量下的运行时间显著更低。所提出的并行工具在训练和预测阶段成功筛选了数十亿个分子,其性能优于串行工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3c/4772510/b5cda65d242d/13321_2016_124_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3c/4772510/f194e98a889d/13321_2016_124_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3c/4772510/b5cda65d242d/13321_2016_124_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3c/4772510/f194e98a889d/13321_2016_124_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d3c/4772510/b5cda65d242d/13321_2016_124_Fig2_HTML.jpg

相似文献

1
GPURFSCREEN: a GPU based virtual screening tool using random forest classifier.GPURFSCREEN:一种基于图形处理器(GPU)并使用随机森林分类器的虚拟筛选工具。
J Cheminform. 2016 Mar 1;8:12. doi: 10.1186/s13321-016-0124-8. eCollection 2016.
2
Ligand based virtual screening using SVM on GPU.基于配体的虚拟筛选:在 GPU 上使用 SVM
Comput Biol Chem. 2019 Dec;83:107143. doi: 10.1016/j.compbiolchem.2019.107143. Epub 2019 Nov 10.
3
Ligand Based Virtual Screening Using Self-organizing Maps.基于配体的自组织映射虚拟筛选
Protein J. 2022 Feb;41(1):44-54. doi: 10.1007/s10930-021-10030-9. Epub 2022 Jan 13.
4
eccCL: parallelized GPU implementation of Ensemble Classifier Chains.eccCL:集成分类器链的并行化GPU实现。
BMC Bioinformatics. 2017 Aug 17;18(1):371. doi: 10.1186/s12859-017-1783-9.
5
High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.用于可变形图像配准的高性能计算:迈向自适应放射治疗的新范式。
Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.
6
Dynamic parallelism for synaptic updating in GPU-accelerated spiking neural network simulations.GPU加速的脉冲神经网络模拟中用于突触更新的动态并行性
Neurocomputing (Amst). 2018 May 2;302:55-65. doi: 10.1016/j.neucom.2018.04.007.
7
Fast parallel Markov clustering in bioinformatics using massively parallel computing on GPU with CUDA and ELLPACK-R sparse format.使用 GPU 上的大规模并行计算和 CUDA 以及 ELLPACK-R 稀疏格式进行生物信息学中的快速并行马尔可夫聚类。
IEEE/ACM Trans Comput Biol Bioinform. 2012 May-Jun;9(3):679-92. doi: 10.1109/TCBB.2011.68.
8
Uni-Dock: GPU-Accelerated Docking Enables Ultralarge Virtual Screening.Uni-Dock:GPU 加速对接实现超大规模虚拟筛选。
J Chem Theory Comput. 2023 Jun 13;19(11):3336-3345. doi: 10.1021/acs.jctc.2c01145. Epub 2023 Apr 26.
9
Vina-GPU 2.0: Further Accelerating AutoDock Vina and Its Derivatives with Graphics Processing Units.Vina-GPU 2.0:利用图形处理器进一步加速自动对接Vina及其衍生工具
J Chem Inf Model. 2023 Apr 10;63(7):1982-1998. doi: 10.1021/acs.jcim.2c01504. Epub 2023 Mar 20.
10
A fast forward projection using multithreads for multirays on GPUs in medical image reconstruction.基于 GPU 的医学图像重建中多线程快速前向投影的多射线算法。
Med Phys. 2011 Jul;38(7):4052-65. doi: 10.1118/1.3591994.

引用本文的文献

1
Predicting New Anti-Norovirus Inhibitor With the Help of Machine Learning Algorithms and Molecular Dynamics Simulation-Based Model.借助机器学习算法和基于分子动力学模拟的模型预测新型抗诺如病毒抑制剂
Front Chem. 2021 Nov 17;9:753427. doi: 10.3389/fchem.2021.753427. eCollection 2021.
2
MRI-based synthetic CT generation using semantic random forest with iterative refinement.基于语义随机森林的迭代细化的 MRI 合成 CT 生成。
Phys Med Biol. 2019 Apr 5;64(8):085001. doi: 10.1088/1361-6560/ab0b66.
3
MRI-based attenuation correction for brain PET/MRI based on anatomic signature and machine learning.

本文引用的文献

1
Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17.化学宇宙数据库 GDB-17 中 1660 亿个有机小分子的枚举。
J Chem Inf Model. 2012 Nov 26;52(11):2864-75. doi: 10.1021/ci300415d. Epub 2012 Nov 1.
2
Machine learning techniques and drug design.机器学习技术与药物设计。
Curr Med Chem. 2012;19(25):4289-97. doi: 10.2174/092986712802884259.
3
Virtual screening of bioassay data.生物测定数据的虚拟筛选。
基于解剖特征和机器学习的脑 PET/MRI 基于 MRI 的衰减校正。
Phys Med Biol. 2019 Jan 7;64(2):025001. doi: 10.1088/1361-6560/aaf5e0.
4
Prediction Methods of Herbal Compounds in Chinese Medicinal Herbs.中草药中草药化合物的预测方法。
Molecules. 2018 Sep 10;23(9):2303. doi: 10.3390/molecules23092303.
5
Using Big Data to Discover Diagnostics and Therapeutics for Gastrointestinal and Liver Diseases.利用大数据发现胃肠道和肝脏疾病的诊断方法与治疗手段。
Gastroenterology. 2017 Jan;152(1):53-67.e3. doi: 10.1053/j.gastro.2016.09.065. Epub 2016 Oct 20.
6
Construction of Metabolism Prediction Models for CYP450 3A4, 2D6, and 2C9 Based on Microsomal Metabolic Reaction System.基于微粒体代谢反应系统构建CYP450 3A4、2D6和2C9的代谢预测模型
Int J Mol Sci. 2016 Oct 9;17(10):1686. doi: 10.3390/ijms17101686.
J Cheminform. 2009 Dec 22;1:21. doi: 10.1186/1758-2946-1-21.
4
In silico pharmacology for drug discovery: methods for virtual ligand screening and profiling.用于药物发现的计算机模拟药理学:虚拟配体筛选和分析方法
Br J Pharmacol. 2007 Sep;152(1):9-20. doi: 10.1038/sj.bjp.0707305. Epub 2007 Jun 4.
5
Evaluation of machine-learning methods for ligand-based virtual screening.基于配体的虚拟筛选中机器学习方法的评估
J Comput Aided Mol Des. 2007 Jan-Mar;21(1-3):53-62. doi: 10.1007/s10822-006-9096-5. Epub 2007 Jan 5.
6
PowerMV: a software environment for molecular viewing, descriptor generation, data analysis and hit evaluation.PowerMV:一个用于分子可视化、描述符生成、数据分析和命中评估的软件环境。
J Chem Inf Model. 2005 Mar-Apr;45(2):515-22. doi: 10.1021/ci049847v.
7
Random forest: a classification and regression tool for compound classification and QSAR modeling.随机森林:一种用于化合物分类和定量构效关系建模的分类与回归工具。
J Chem Inf Comput Sci. 2003 Nov-Dec;43(6):1947-58. doi: 10.1021/ci034160g.
8
Quantum-Chemical Descriptors in QSAR/QSPR Studies.定量构效关系/定量构性关系研究中的量子化学描述符
Chem Rev. 1996 May 9;96(3):1027-1044. doi: 10.1021/cr950202r.
9
Drug design by machine learning: support vector machines for pharmaceutical data analysis.基于机器学习的药物设计:用于药物数据分析的支持向量机
Comput Chem. 2001 Dec;26(1):5-14. doi: 10.1016/s0097-8485(01)00094-8.