• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于谱密度比的蛋白质序列二元分割聚类方法:一项比较研究。

Spectral density ratio based clustering methods for the binary segmentation of protein sequences: a comparative study.

作者信息

Ioannou Alexis, Fokianos Konstantinos, Promponas Vasilis J

机构信息

Department of Mathematics & Statistics, University of Cyprus, Nicosia, Cyprus.

出版信息

Biosystems. 2010 May;100(2):132-43. doi: 10.1016/j.biosystems.2010.02.008. Epub 2010 Mar 4.

DOI:10.1016/j.biosystems.2010.02.008
PMID:20206663
Abstract

We compare several spectral domain based clustering methods for partitioning protein sequence data. The main instrument for this exercise is the spectral density ratio model, which specifies that the logarithmic ratio of two or more unknown spectral density functions has a parametric linear combination of cosines. Maximum likelihood inference is worked out in detail and it is shown that its output yields several distance measures among independent stationary time series. These similarity indices are suitable for clustering time series data based on their second order properties. Other spectral domain based distances are investigated as well; and we compare all methods and distances to the problem of producing segmentations of bacterial outer membrane proteins consistent with their transmembrane topology. Protein sequences are transformed to time series data by employing numerical scales of physicochemical parameters. We also present interesting results on the prediction of transmembrane beta-strands, based on the clustering outcome, for a representative set of bacterial outer membrane proteins with given three-dimensional structure.

摘要

我们比较了几种基于谱域的聚类方法,用于对蛋白质序列数据进行划分。此项研究的主要工具是谱密度比模型,该模型规定两个或多个未知谱密度函数的对数比具有余弦的参数线性组合。详细推导了最大似然推断,并表明其输出产生了独立平稳时间序列之间的几种距离度量。这些相似性指标适用于基于时间序列数据的二阶特性对其进行聚类。还研究了其他基于谱域的距离;并且我们将所有方法和距离与产生与细菌外膜蛋白跨膜拓扑结构一致的分割问题进行比较。通过采用物理化学参数的数值尺度,将蛋白质序列转换为时间序列数据。对于一组具有给定三维结构的代表性细菌外膜蛋白,我们还基于聚类结果给出了关于跨膜β链预测的有趣结果。

相似文献

1
Spectral density ratio based clustering methods for the binary segmentation of protein sequences: a comparative study.基于谱密度比的蛋白质序列二元分割聚类方法:一项比较研究。
Biosystems. 2010 May;100(2):132-43. doi: 10.1016/j.biosystems.2010.02.008. Epub 2010 Mar 4.
2
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
3
A similarity network approach for the analysis and comparison of protein sequence/structure sets.相似网络分析方法在蛋白质序列/结构组分析和比较中的应用。
J Biomed Inform. 2010 Apr;43(2):257-67. doi: 10.1016/j.jbi.2010.01.005. Epub 2010 Jan 25.
4
Evolution of outer membrane beta-barrels from an ancestral beta beta hairpin.从祖先的β-β发夹到外膜β-桶的进化。
Mol Biol Evol. 2010 Jun;27(6):1348-58. doi: 10.1093/molbev/msq017. Epub 2010 Jan 27.
5
Blast sampling for structural and functional analyses.用于结构和功能分析的胚细胞采样。
BMC Bioinformatics. 2007 Feb 23;8:62. doi: 10.1186/1471-2105-8-62.
6
Incremental generation of summarized clustering hierarchy for protein family analysis.用于蛋白质家族分析的汇总聚类层次结构的增量生成。
Bioinformatics. 2004 Nov 1;20(16):2586-96. doi: 10.1093/bioinformatics/bth290. Epub 2004 May 6.
7
Clustering protein sequences with a novel metric transformed from sequence similarity scores and sequence alignments with neural networks.使用从序列相似性得分转换而来的新度量以及神经网络进行的序列比对来对蛋白质序列进行聚类。
BMC Bioinformatics. 2005 Oct 3;6:242. doi: 10.1186/1471-2105-6-242.
8
Natural similarity measures between position frequency matrices with an application to clustering.位置频率矩阵之间的自然相似性度量及其在聚类中的应用。
Bioinformatics. 2008 Feb 1;24(3):350-7. doi: 10.1093/bioinformatics/btm610. Epub 2008 Jan 2.
9
Scoredist: a simple and robust protein sequence distance estimator.Scoredist:一种简单且强大的蛋白质序列距离估计器。
BMC Bioinformatics. 2005 Apr 27;6:108. doi: 10.1186/1471-2105-6-108.
10
Alignment and structure prediction of divergent protein families: periplasmic and outer membrane proteins of bacterial efflux pumps.不同蛋白质家族的比对与结构预测:细菌外排泵的周质和外膜蛋白
J Mol Biol. 1999 Apr 2;287(3):695-715. doi: 10.1006/jmbi.1999.2630.

引用本文的文献

1
A Methodology for Discriminant Time Series Analysis Applied to Microclimate Monitoring of Fresco Paintings.一种判别时间序列分析方法及其在壁画微气候监测中的应用。
Sensors (Basel). 2021 Jan 9;21(2):436. doi: 10.3390/s21020436.