• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于蛋白质折叠分类的新型氨基酸特性选择方法。

A Novel Amino Acid Properties Selection Method for Protein Fold Classification.

作者信息

Zhang Lichao, Kong Liang

机构信息

School of Mathematics and Statistics, Northeastern University at Qinhuangdao, Qinhuangdao, China.

College of Sciences, Northeastern University, Shenyang, China.

出版信息

Protein Pept Lett. 2020;27(4):287-294. doi: 10.2174/0929866526666190718151753.

DOI:10.2174/0929866526666190718151753
PMID:32207399
Abstract

BACKGROUND

Amino acid physicochemical properties encoded in protein primary structure play a crucial role in protein folding. However, it is not yet clear which of the properties are the most suitable for protein fold classification.

OBJECTIVE

To avoid exhaustively searching the total properties space, an amino acid properties selection method was proposed in this study to rapidly obtain a suitable properties combination for protein fold classification.

METHODS

The proposed amino acid properties selection method was based on sequential floating forward selection strategy. Beginning with an empty set, variable number of features were added iteratively until achieving the iteration termination condition.

RESULTS

The experimental results indicate that the proposed method improved prediction accuracies by 0.26-5% on a widely used benchmark dataset with appropriately selected amino acid properties.

CONCLUSION

The proposed properties selection method can be extended to other biomolecule property related classification problems in bioinformatics.

摘要

背景

蛋白质一级结构中编码的氨基酸物理化学性质在蛋白质折叠中起着至关重要的作用。然而,目前尚不清楚哪些性质最适合用于蛋白质折叠分类。

目的

为避免详尽搜索整个性质空间,本研究提出一种氨基酸性质选择方法,以快速获得适合蛋白质折叠分类的性质组合。

方法

所提出的氨基酸性质选择方法基于序列浮动前向选择策略。从空集开始,迭代添加可变数量的特征,直到达到迭代终止条件。

结果

实验结果表明,在广泛使用的基准数据集上,通过适当选择氨基酸性质,所提出的方法可将预测准确率提高0.26%-5%。

结论

所提出的性质选择方法可扩展到生物信息学中其他与生物分子性质相关的分类问题。

相似文献

1
A Novel Amino Acid Properties Selection Method for Protein Fold Classification.一种用于蛋白质折叠分类的新型氨基酸特性选择方法。
Protein Pept Lett. 2020;27(4):287-294. doi: 10.2174/0929866526666190718151753.
2
Protein fold classification with genetic algorithms and feature selection.基于遗传算法和特征选择的蛋白质折叠分类
J Bioinform Comput Biol. 2009 Oct;7(5):773-88. doi: 10.1142/s0219720009004321.
3
Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.基于支持向量机,利用氨基酸残基和氨基酸残基对的结构特性对蛋白质折叠进行分类。
Bioinformatics. 2007 Dec 15;23(24):3320-7. doi: 10.1093/bioinformatics/btm527. Epub 2007 Nov 7.
4
Predicting and analyzing DNA-binding domains using a systematic approach to identifying a set of informative physicochemical and biochemical properties.使用系统方法预测和分析 DNA 结合域,以确定一组有意义的物理化学和生化特性。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S47. doi: 10.1186/1471-2105-12-S1-S47.
5
Computational methods for ubiquitination site prediction using physicochemical properties of protein sequences.利用蛋白质序列的物理化学性质进行泛素化位点预测的计算方法。
BMC Bioinformatics. 2016 Mar 3;17:116. doi: 10.1186/s12859-016-0959-z.
6
Predicting protein fold types by the general form of Chou's pseudo amino acid composition: approached from optimal feature extractions.基于周氏伪氨基酸组成的一般形式预测蛋白质折叠类型:从最优特征提取入手
Protein Pept Lett. 2012 Apr;19(4):439-49. doi: 10.2174/092986612799789378.
7
A novel fusion based on the evolutionary features for protein fold recognition using support vector machines.一种基于进化特征的新型融合方法,用于使用支持向量机进行蛋白质折叠识别。
Sci Rep. 2020 Sep 1;10(1):14368. doi: 10.1038/s41598-020-71172-x.
8
Predicting protein structural class by incorporating patterns of over-represented k-mers into the general form of Chou's PseAAC.通过将高丰度k聚体模式纳入周氏伪氨基酸组成的一般形式来预测蛋白质结构类
Protein Pept Lett. 2012 Apr;19(4):388-97. doi: 10.2174/092986612799789350.
9
An ensemble approach to protein fold classification by integration of template-based assignment and support vector machine classifier.一种通过整合基于模板的分配和支持向量机分类器进行蛋白质折叠分类的集成方法。
Bioinformatics. 2017 Mar 15;33(6):863-870. doi: 10.1093/bioinformatics/btw768.
10
A Computational Method for the Identification of Endolysins and Autolysins.一种用于鉴定内溶素和自溶素的计算方法。
Protein Pept Lett. 2020;27(4):329-336. doi: 10.2174/0929866526666191002104735.

引用本文的文献

1
Sequence-Based Prediction of Plant Allergenic Proteins: Machine Learning Classification Approach.基于序列的植物变应原蛋白预测:机器学习分类方法
ACS Omega. 2023 Jan 20;8(4):3698-3704. doi: 10.1021/acsomega.2c02842. eCollection 2023 Jan 31.
2
Protein music of enhanced musicality by music style guided exploration of diverse amino acid properties.通过音乐风格引导探索多样氨基酸特性实现增强音乐性的蛋白质音乐。
Heliyon. 2021 Sep 29;7(9):e07933. doi: 10.1016/j.heliyon.2021.e07933. eCollection 2021 Sep.