蛋白质构象状态——一种第一性原理贝叶斯方法。

Protein Conformational States-A First Principles Bayesian Method.

作者信息

Rogers David M

机构信息

National Center for Computational Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA.

出版信息

Entropy (Basel). 2020 Oct 31;22(11):1242. doi: 10.3390/e22111242.

DOI:10.3390/e22111242

PMID:33287010

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7712966/

Abstract

Automated identification of protein conformational states from simulation of an ensemble of structures is a hard problem because it requires teaching a computer to recognize shapes. We adapt the naïve Bayes classifier from the machine learning community for use on atom-to-atom pairwise contacts. The result is an unsupervised learning algorithm that samples a 'distribution' over potential classification schemes. We apply the classifier to a series of test structures and one real protein, showing that it identifies the conformational transition with >95% accuracy in most cases. A nontrivial feature of our adaptation is a new connection to information entropy that allows us to vary the level of structural detail without spoiling the categorization. This is confirmed by comparing results as the number of atoms and time-samples are varied over 1.5 orders of magnitude. Further, the method's derivation from Bayesian analysis on the set of inter-atomic contacts makes it easy to understand and extend to more complex cases.

摘要

从一组结构的模拟中自动识别蛋白质构象状态是一个难题，因为这需要教会计算机识别形状。我们采用了机器学习领域的朴素贝叶斯分类器，用于原子对原子的成对接触。结果得到了一种无监督学习算法，该算法对潜在的分类方案进行“分布”采样。我们将该分类器应用于一系列测试结构和一个真实蛋白质，结果表明在大多数情况下它能以超过95%的准确率识别构象转变。我们改编的一个重要特性是与信息熵的新联系，这使我们能够在不破坏分类的情况下改变结构细节的程度。通过比较原子数量和时间样本在1.5个数量级上变化时的结果，这一点得到了证实。此外，该方法从对原子间接触集的贝叶斯分析推导而来，使其易于理解并扩展到更复杂的情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56dc/7712966/0177b234f886/entropy-22-01242-g001.jpg

相似文献

Protein Conformational States-A First Principles Bayesian Method.蛋白质构象状态——一种第一性原理贝叶斯方法。

Entropy (Basel). 2020 Oct 31;22(11):1242. doi: 10.3390/e22111242.

Bayesian model averaging of naive Bayes for clustering.用于聚类的朴素贝叶斯的贝叶斯模型平均法。

IEEE Trans Syst Man Cybern B Cybern. 2006 Oct;36(5):1149-61. doi: 10.1109/tsmcb.2006.874132.

A Pairwise Naïve Bayes Approach to Bayesian Classification.一种用于贝叶斯分类的成对朴素贝叶斯方法。

Intern J Pattern Recognit Artif Intell. 2015 Oct 1;29(7). doi: 10.1142/S0218001415500238. Epub 2015 Jul 28.

A Novel Feature Selection Technique for Text Classification Using Naïve Bayes.一种使用朴素贝叶斯进行文本分类的新型特征选择技术。

Int Sch Res Notices. 2014 Oct 28;2014:717092. doi: 10.1155/2014/717092. eCollection 2014.

Bayesian model averaging of Bayesian network classifiers over multiple node-orders: application to sparse datasets.

IEEE Trans Syst Man Cybern B Cybern. 2005 Dec;35(6):1302-10. doi: 10.1109/tsmcb.2005.850162.

Binding Activity Prediction of Cyclin-Dependent Inhibitors.细胞周期蛋白依赖性激酶抑制剂的结合活性预测。

J Chem Inf Model. 2015 Jul 27;55(7):1469-82. doi: 10.1021/ci500633c. Epub 2015 Jul 10.

Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors.通过最大知识驱动信息先验将生物先验知识纳入贝叶斯学习。

BMC Bioinformatics. 2017 Dec 28;18(Suppl 14):552. doi: 10.1186/s12859-017-1893-4.

Continuous time Bayesian network classifiers.连续时间贝叶斯网络分类器。

J Biomed Inform. 2012 Dec;45(6):1108-19. doi: 10.1016/j.jbi.2012.07.002. Epub 2012 Jul 28.

Nonparametric Coupled Bayesian Dictionary and Classifier Learning for Hyperspectral Classification.用于高光谱分类的非参数耦合贝叶斯字典与分类器学习

IEEE Trans Neural Netw Learn Syst. 2018 Sep;29(9):4038-4050. doi: 10.1109/TNNLS.2017.2742528. Epub 2017 Oct 3.

Pairwise FCM based feature weighting for improved classification of vertebral column disorders.基于成对 FCM 的特征加权提高了脊柱疾病的分类效果。

Comput Biol Med. 2014 Mar;46:61-70. doi: 10.1016/j.compbiomed.2013.12.004. Epub 2013 Dec 24.

引用本文的文献

The Impact of COVID-19 on Consumers' Psychological Behavior Based on Data Mining for Online User Comments in the Catering Industry in China.基于中国餐饮业在线用户评论数据挖掘的 COVID-19 对消费者心理行为的影响。

Int J Environ Res Public Health. 2021 Apr 15;18(8):4178. doi: 10.3390/ijerph18084178.

本文引用的文献

Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19.基于超级计算机的集成对接药物发现管道及其在新冠病毒中的应用。

J Chem Inf Model. 2020 Dec 28;60(12):5832-5852. doi: 10.1021/acs.jcim.0c01010. Epub 2020 Dec 16.

Data-guided Multi-Map variables for ensemble refinement of molecular movies.基于数据引导的多重图谱变量用于分子电影的集合细化。

J Chem Phys. 2020 Dec 7;153(21):214102. doi: 10.1063/5.0022433.

Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories.分子模拟轨迹的时间滞后t分布随机邻域嵌入（t-SNE）

Front Mol Biosci. 2020 Jun 30;7:132. doi: 10.3389/fmolb.2020.00132. eCollection 2020.

Protein Allostery and Conformational Dynamics.蛋白质变构与构象动力学

Chem Rev. 2016 Jun 8;116(11):6503-15. doi: 10.1021/acs.chemrev.5b00590. Epub 2016 Feb 15.

The DynDom3D Webserver for the Analysis of Domain Movements in Multimeric Proteins.用于分析多聚体蛋白质中结构域运动的DynDom3D网络服务器。

J Comput Biol. 2016 Jan;23(1):21-6. doi: 10.1089/cmb.2015.0143. Epub 2015 Nov 5.

Evaluation of Dimensionality-reduction Methods from Peptide Folding-unfolding Simulations.肽折叠-解折叠模拟中降维方法的评估

J Chem Theory Comput. 2013 May 14;9(5):2490-2497. doi: 10.1021/ct400052y.

Molecular chaperone functions in protein folding and proteostasis.分子伴侣在蛋白质折叠和蛋白稳态中的功能。

Annu Rev Biochem. 2013;82:323-55. doi: 10.1146/annurev-biochem-060208-092442.

Comparing two Bayes methods based on the free energy functions in Bernoulli mixtures.比较基于贝努利混合中自由能函数的两种贝叶斯方法。

Neural Netw. 2013 Aug;44:36-43. doi: 10.1016/j.neunet.2013.03.002. Epub 2013 Mar 15.

Exploring conformational states of the bacterial voltage-gated sodium channel NavAb via molecular dynamics simulations.通过分子动力学模拟探索细菌电压门控钠离子通道 NavAb 的构象状态。

Proc Natl Acad Sci U S A. 2012 Dec 26;109(52):21336-41. doi: 10.1073/pnas.1218087109. Epub 2012 Nov 12.

Discovering conformational sub-states relevant to protein function.发现与蛋白质功能相关的构象亚态。

PLoS One. 2011 Jan 28;6(1):e15827. doi: 10.1371/journal.pone.0015827.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

蛋白质构象状态——一种第一性原理贝叶斯方法。

Protein Conformational States-A First Principles Bayesian Method.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献