• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多种拉氏构象分布的蛋白质结构分类与环建模

Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions.

作者信息

Najibi Seyed Morteza, Maadooliat Mehdi, Zhou Lan, Huang Jianhua Z, Gao Xin

机构信息

Department of Statistics, College of Sciences, Shiraz University, Shiraz, Iran.

Department of Mathematics, Statistics and Computer Science, Marquette University, WI 53201-1881, USA; Center for Human Genetics, Marshfield Clinic Research Institute, Marshfield, WI 54449, USA.

出版信息

Comput Struct Biotechnol J. 2017 Feb 8;15:243-254. doi: 10.1016/j.csbj.2017.01.011. eCollection 2017.

DOI:10.1016/j.csbj.2017.01.011
PMID:28280526
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5331158/
Abstract

Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.

摘要

最近,使用角度表示法研究蛋白质结构在结构生物学家中引起了广泛关注。主要挑战在于如何基于不同拉马钱德兰图之间的差异和相似性,有效地对蛋白质结构的连续构象空间进行建模。尽管存在用于对蛋白质角度数据进行建模的统计方法,但仍然迫切需要更复杂、更快的统计工具来对大规模圆形数据集进行建模。为满足这一需求,我们开发了一种非参数方法,用于对蛋白质主链角度群体集合的多个二元密度函数进行集体估计。所提出的方法使用三角样条考虑了角度数据的圆形性质,与现有方法相比效率更高。当需要从具有共同特征的不同群体中估计多个密度函数时,这种集体密度估计方法具有广泛的适用性。此外,拟合密度的自适应基展开系数提供了一种低维表示,可用于密度的可视化、聚类和分类。所提出的方法为蛋白质结构研究中的两个重要且具有挑战性的问题提供了新颖独特的视角:基于结构的蛋白质分类和基于角度采样的蛋白质环结构预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2a8/5331158/e55ca149ed0f/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2a8/5331158/efa0965fda05/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2a8/5331158/041458f0eeba/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2a8/5331158/e55ca149ed0f/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2a8/5331158/efa0965fda05/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2a8/5331158/041458f0eeba/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2a8/5331158/e55ca149ed0f/gr3.jpg

相似文献

1
Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions.基于多种拉氏构象分布的蛋白质结构分类与环建模
Comput Struct Biotechnol J. 2017 Feb 8;15:243-254. doi: 10.1016/j.csbj.2017.01.011. eCollection 2017.
2
Nonparametric collective spectral density estimation with an application to clustering the brain signals.非参数总体谱密度估计及其在脑信号聚类中的应用。
Stat Med. 2018 Dec 30;37(30):4789-4806. doi: 10.1002/sim.7972. Epub 2018 Sep 26.
3
Density Estimation for Protein Conformation Angles Using a Bivariate von Mises Distribution and Bayesian Nonparametrics.使用二元冯·米塞斯分布和贝叶斯非参数方法进行蛋白质构象角的密度估计
J Am Stat Assoc. 2009 Jun 1;104(486):586-596. doi: 10.1198/jasa.2009.0024.
4
New efficient statistical sequence-dependent structure prediction of short to medium-sized protein loops based on an exhaustive loop classification.基于详尽的环分类对短至中等大小蛋白质环进行新型高效的统计序列依赖性结构预测。
J Mol Biol. 1999 Jun 25;289(5):1469-90. doi: 10.1006/jmbi.1999.2826.
5
Modeling angles in proteins and circular genomes using multivariate angular distributions based on multiple nonnegative trigonometric sums.基于多个非负三角和,使用多元角分布对蛋白质和环状基因组中的角度进行建模。
Stat Appl Genet Mol Biol. 2014 Feb;13(1):1-18. doi: 10.1515/sagmb-2012-0012.
6
Assessing protein conformational sampling methods based on bivariate lag-distributions of backbone angles.基于主链角度双变量滞后分布评估蛋白质构象抽样方法。
Brief Bioinform. 2013 Nov;14(6):724-36. doi: 10.1093/bib/bbs052. Epub 2012 Aug 27.
7
Protein bioinformatics and mixtures of bivariate von Mises distributions for angular data.用于角度数据的蛋白质生物信息学和二元冯·米塞斯分布混合模型
Biometrics. 2007 Jun;63(2):505-12. doi: 10.1111/j.1541-0420.2006.00682.x.
8
Estimation of Probability Distribution and Its Application in Bayesian Classification and Maximum Likelihood Regression.概率分布的估计及其在贝叶斯分类和最大似然回归中的应用。
Interdiscip Sci. 2019 Sep;11(3):559-574. doi: 10.1007/s12539-019-00343-w. Epub 2019 Jul 17.
9
Performance of a Computational Model of the Mammalian Olfactory System哺乳动物嗅觉系统计算模型的性能
10
Objectively judging the quality of a protein structure from a Ramachandran plot.从拉氏图客观判断蛋白质结构的质量。
Comput Appl Biosci. 1997 Aug;13(4):425-30. doi: 10.1093/bioinformatics/13.4.425.

引用本文的文献

1
Exploring the Inhibitory Potential of Compounds Against -Acetylglucosamine (Mur) Receptor: Insights Into Antibacterial Activity and Drug-Likeness.探索化合物对β-乙酰氨基葡萄糖(Mur)受体的抑制潜力:对抗菌活性和类药性的见解。
ScientificWorldJournal. 2024 Nov 30;2024:3569811. doi: 10.1155/tswj/3569811. eCollection 2024.
2
In silico study reveals unconventional interactions between MDC1 of DDR and Beclin-1 of autophagy.计算机研究揭示 DDR 的 MDC1 和自噬的 Beclin-1 之间的非常规相互作用。
Mol Divers. 2023 Dec;27(6):2789-2802. doi: 10.1007/s11030-022-10579-2. Epub 2022 Dec 8.
3
Prediction of Protein Backbone Torsion Angles Using Deep Residual Inception Neural Networks.

本文引用的文献

1
Assessing protein conformational sampling methods based on bivariate lag-distributions of backbone angles.基于主链角度双变量滞后分布评估蛋白质构象抽样方法。
Brief Bioinform. 2013 Nov;14(6):724-36. doi: 10.1093/bib/bbs052. Epub 2012 Aug 27.
2
P.R.E.S.S.--an R-package for exploring residual-level protein structural statistics.P.R.E.S.S.——一个用于探索残基水平蛋白质结构统计数据的R包。
J Bioinform Comput Biol. 2012 Jun;10(3):1242007. doi: 10.1142/S0219720012420073.
3
A stochastic evolutionary model for protein structure alignment and phylogeny.
使用深度残差 inception 神经网络预测蛋白质主链扭转角
IEEE/ACM Trans Comput Biol Bioinform. 2018 Mar 12. doi: 10.1109/TCBB.2018.2814586.
一种用于蛋白质结构比对和系统发生的随机进化模型。
Mol Biol Evol. 2012 Nov;29(11):3575-87. doi: 10.1093/molbev/mss167. Epub 2012 Jun 21.
4
A protein-dependent side-chain rotamer library.一种依赖于蛋白质的侧链构象文库。
BMC Bioinformatics. 2011 Dec 14;12 Suppl 14(Suppl 14):S10. doi: 10.1186/1471-2105-12-S14-S10.
5
Near-native protein loop sampling using nonparametric density estimation accommodating sparcity.使用非参数密度估计和稀疏性适应进行近天然蛋白质环采样。
PLoS Comput Biol. 2011 Oct;7(10):e1002234. doi: 10.1371/journal.pcbi.1002234. Epub 2011 Oct 20.
6
A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions.基于自适应核密度估计和回归的蛋白质平滑骨架相关构象文库。
Structure. 2011 Jun 8;19(6):844-58. doi: 10.1016/j.str.2011.03.019.
7
A DIRICHLET PROCESS MIXTURE OF HIDDEN MARKOV MODELS FOR PROTEIN STRUCTURE PREDICTION.用于蛋白质结构预测的隐马尔可夫模型的狄利克雷过程混合模型
Ann Appl Stat. 2010 Jun 1;4(2):916-942. doi: 10.1214/09-AOAS296.
8
A probabilistic and continuous model of protein conformational space for template-free modeling.一种用于无模板建模的蛋白质构象空间的概率性连续模型。
J Comput Biol. 2010 Jun;17(6):783-98. doi: 10.1089/cmb.2009.0235.
9
Dali server: conservation mapping in 3D.大理服务器:三维保护图谱构建。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W545-9. doi: 10.1093/nar/gkq366. Epub 2010 May 10.
10
Neighbor-dependent Ramachandran probability distributions of amino acids developed from a hierarchical Dirichlet process model.基于层次狄利克雷过程模型开发的依赖于邻居的氨基酸拉马钱德兰概率分布。
PLoS Comput Biol. 2010 Apr 29;6(4):e1000763. doi: 10.1371/journal.pcbi.1000763.