• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用一种结合了序列和二级结构相似性得分的核方法进行远程同源性检测。

Remote homology detection using a kernel method that combines sequence and secondary-structure similarity scores.

作者信息

Wieser Daniela, Niranjan Mahesan

机构信息

The European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.

出版信息

In Silico Biol. 2009;9(3):89-103.

PMID:19795568
Abstract

Distant evolutionary relationships between proteins with low sequence similarity are difficult to recognise by computational methods. Consequently, many sequences obtained from large-scale sequencing projects cannot be assigned to any known proteins or families despite being evolutionarily related. To boost sensitivity, various sequence-based methods have been modified to make use of the better conserved secondary structure. Most of these methods are instance-based or generative. Here, we introduce a kernel-based remote homology detection method that allows for a combination of sequence and secondary-structure similarity scores in a discriminative approach. We studied the ability of the method to predict superfamily membership as defined by the SCOP database. We show that a kernel method that combined sequence similarity scores with predicted secondary-structure similarity scores performed similar to a classifier that used scores calculated from sequences and true secondary structures, but performed better than a sequence-only based classifier and achieved a better mean than recently published results on the same data-set. It can be concluded that SVM classifiers trained to predict homology between distantly related proteins, become more accurate, if a joint sequence/secondary-structure similarity score approach is used.

摘要

序列相似性较低的蛋白质之间的远缘进化关系很难通过计算方法识别。因此,尽管从大规模测序项目中获得的许多序列在进化上相关,但它们无法被归类到任何已知的蛋白质或蛋白家族中。为了提高敏感性,各种基于序列的方法已被改进,以利用保守性更好的二级结构。这些方法大多基于实例或生成式。在此,我们介绍一种基于核的远程同源性检测方法,该方法允许在判别式方法中结合序列和二级结构相似性得分。我们研究了该方法预测由SCOP数据库定义的超家族成员的能力。我们表明,一种将序列相似性得分与预测的二级结构相似性得分相结合的核方法,其表现与使用从序列和真实二级结构计算出的得分的分类器相似,但比仅基于序列的分类器表现更好,并且在同一数据集上比最近发表的结果有更好的平均值。可以得出结论,如果使用联合序列/二级结构相似性得分方法,训练用于预测远缘相关蛋白质之间同源性的支持向量机分类器会变得更加准确。

相似文献

1
Remote homology detection using a kernel method that combines sequence and secondary-structure similarity scores.使用一种结合了序列和二级结构相似性得分的核方法进行远程同源性检测。
In Silico Biol. 2009;9(3):89-103.
2
Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.评估基因组学中的注释转移:通过传统分数和概率分数量化蛋白质序列、结构与功能之间的关系。
J Mol Biol. 2000 Mar 17;297(1):233-49. doi: 10.1006/jmbi.2000.3550.
3
Remote protein homology detection and fold recognition using two-layer support vector machine classifiers.使用两层支持向量机分类器进行远程蛋白质同源检测和折叠识别。
Comput Biol Med. 2011 Aug;41(8):687-99. doi: 10.1016/j.compbiomed.2011.06.004. Epub 2011 Jun 25.
4
PFRES: protein fold classification by using evolutionary information and predicted secondary structure.PFRES:利用进化信息和预测的二级结构进行蛋白质折叠分类
Bioinformatics. 2007 Nov 1;23(21):2843-50. doi: 10.1093/bioinformatics/btm475. Epub 2007 Oct 17.
5
SVM-based detection of distant protein structural relationships using pairwise probabilistic suffix trees.基于支持向量机,利用成对概率后缀树检测远距离蛋白质结构关系。
Comput Biol Chem. 2006 Aug;30(4):292-9. doi: 10.1016/j.compbiolchem.2006.05.001.
6
Kernel methods for predicting protein-protein interactions.用于预测蛋白质-蛋白质相互作用的核方法。
Bioinformatics. 2005 Jun;21 Suppl 1:i38-46. doi: 10.1093/bioinformatics/bti1016.
7
Sequence-based protein structure prediction using a reduced state-space hidden Markov model.使用简化状态空间隐马尔可夫模型进行基于序列的蛋白质结构预测。
Comput Biol Med. 2007 Sep;37(9):1211-24. doi: 10.1016/j.compbiomed.2006.10.014. Epub 2006 Dec 11.
8
Surface map comparison: studying function diversity of homologous proteins.表面图谱比较:研究同源蛋白的功能多样性。
J Mol Biol. 2001 Jun 8;309(3):793-806. doi: 10.1006/jmbi.2001.4630.
9
Remote homolog detection using local sequence-structure correlations.利用局部序列-结构相关性进行远程同源物检测。
Proteins. 2004 Nov 15;57(3):518-30. doi: 10.1002/prot.20221.
10
Beyond the Twilight Zone: automated prediction of structural properties of proteins by recursive neural networks and remote homology information.超越模糊地带:利用递归神经网络和远程同源信息自动预测蛋白质的结构特性
Proteins. 2009 Oct;77(1):181-90. doi: 10.1002/prot.22429.

引用本文的文献

1
An evaluation of different classification algorithms for protein sequence-based reverse vaccinology prediction.基于蛋白质序列的反向疫苗学预测的不同分类算法评估。
PLoS One. 2019 Dec 13;14(12):e0226256. doi: 10.1371/journal.pone.0226256. eCollection 2019.
2
Examining marginal sequence similarities between bacterial type III secretion system components and Trypanosoma cruzi surface proteins: horizontal gene transfer or convergent evolution?检测细菌 III 型分泌系统组件与克氏锥虫表面蛋白之间的边缘序列相似性:水平基因转移还是趋同进化?
Front Genet. 2013 Aug 16;4:143. doi: 10.3389/fgene.2013.00143. eCollection 2013.
3
Searching remote homology with spectral clustering with symmetry in neighborhood cluster kernels.
基于邻域聚类核的谱聚类对称性搜索远程同源性。
PLoS One. 2013;8(2):e46468. doi: 10.1371/journal.pone.0046468. Epub 2013 Feb 15.