• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 Chou 的 5 步法则结合进化信息预测 DNA 结合蛋白。

Use Chou's 5-Step Rule to Predict DNA-Binding Proteins with Evolutionary Information.

机构信息

School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, China.

Suzhou Key Laboratory of Virtual Reality Intelligent Interaction and Application Technology, Suzhou University of Science and Technology, Suzhou 215009, China.

出版信息

Biomed Res Int. 2020 Jul 27;2020:6984045. doi: 10.1155/2020/6984045. eCollection 2020.

DOI:10.1155/2020/6984045
PMID:32775434
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7407024/
Abstract

The knowledge of DNA-binding proteins would help to understand the functions of proteins better in cellular biological processes. Research on the prediction of DNA-binding proteins can promote the research of drug proteins and computer acidified drugs. In recent years, methods based on machine learning are usually used to predict proteins. Although great predicted performance can be achieved via current methods, researchers still need to invest more research in terms of the improvement of predicted performance. In this study, the prediction of DNA-binding proteins is studied from the perspective of evolutionary information and the support vector machine method. One machine learning model for predicting DNA-binding proteins based on evolutionary features by using Chou's 5-step rule is put forward. The results show that great predicted performance is obtained on benchmark dataset PDB1075 and independent dataset PDB186, achieving the accuracy of 86.05% and 75.30%, respectively. Thus, the method proposed is comparable to a certain degree, and it may work even better than other methods to some extent.

摘要

DNA 结合蛋白的知识有助于更好地理解蛋白质在细胞生物过程中的功能。对 DNA 结合蛋白的预测研究可以促进药物蛋白和计算机酸化药物的研究。近年来,基于机器学习的方法通常用于预测蛋白质。尽管目前的方法可以达到很好的预测性能,但研究人员仍需要在提高预测性能方面投入更多的研究。在这项研究中,从进化信息和支持向量机方法的角度研究了 DNA 结合蛋白的预测。提出了一种基于进化特征和 Chou 的 5 步规则的 DNA 结合蛋白预测的机器学习模型。结果表明,在基准数据集 PDB1075 和独立数据集 PDB186 上均获得了很好的预测性能,准确率分别达到 86.05%和 75.30%。因此,所提出的方法具有一定的可比性,在某些方面可能比其他方法效果更好。

相似文献

1
Use Chou's 5-Step Rule to Predict DNA-Binding Proteins with Evolutionary Information.利用 Chou 的 5 步法则结合进化信息预测 DNA 结合蛋白。
Biomed Res Int. 2020 Jul 27;2020:6984045. doi: 10.1155/2020/6984045. eCollection 2020.
2
DPP-PseAAC: A DNA-binding protein prediction model using Chou's general PseAAC.DPP-PseAAC:一种基于 Chou 的通用 PseAAC 的 DNA 结合蛋白预测模型。
J Theor Biol. 2018 Sep 7;452:22-34. doi: 10.1016/j.jtbi.2018.05.006. Epub 2018 May 16.
3
FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou's Five-Step Rule.FKRR-MVSF:一种基于模糊核岭回归模型的多视图序列特征方法,通过周的五步法则识别 DNA 结合蛋白。
Int J Mol Sci. 2019 Aug 26;20(17):4175. doi: 10.3390/ijms20174175.
4
MsDBP: Exploring DNA-Binding Proteins by Integrating Multiscale Sequence Information via Chou's Five-Step Rule.MsDBP:通过整合多尺度序列信息和周的五步法则探索 DNA 结合蛋白
J Proteome Res. 2019 Aug 2;18(8):3119-3132. doi: 10.1021/acs.jproteome.9b00226. Epub 2019 Jul 17.
5
PseDNA-Pro: DNA-Binding Protein Identification by Combining Chou's PseAAC and Physicochemical Distance Transformation.PseDNA-Pro:结合周氏伪氨基酸组成和物理化学距离变换的DNA结合蛋白鉴定方法
Mol Inform. 2015 Jan;34(1):8-17. doi: 10.1002/minf.201400025. Epub 2014 Sep 26.
6
Identification of DNA-binding proteins by Kernel Sparse Representation via L-matrix norm.基于 L 矩阵范数的核稀疏表示鉴定 DNA 结合蛋白
Comput Biol Med. 2023 Jun;159:106849. doi: 10.1016/j.compbiomed.2023.106849. Epub 2023 Apr 11.
7
Effective DNA binding protein prediction by using key features via Chou's general PseAAC.利用周元的通用 PseAAC 算法通过关键特征预测有效 DNA 结合蛋白。
J Theor Biol. 2019 Jan 7;460:64-78. doi: 10.1016/j.jtbi.2018.10.027. Epub 2018 Oct 11.
8
DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information.DP-BINDER:一种通过融合进化和物理化学信息来预测 DNA 结合蛋白的机器学习模型。
J Comput Aided Mol Des. 2019 Jul;33(7):645-658. doi: 10.1007/s10822-019-00207-x. Epub 2019 May 23.
9
Predicting apoptosis protein subcellular localization by integrating auto-cross correlation and PSSM into Chou's PseAAC.通过将自相关和 PSSM 整合到 Chou 的 PseAAC 中,预测细胞凋亡蛋白的亚细胞定位。
J Theor Biol. 2018 Nov 14;457:163-169. doi: 10.1016/j.jtbi.2018.08.042. Epub 2018 Sep 1.
10
Improved detection of DNA-binding proteins via compression technology on PSSM information.通过基于位置特异性得分矩阵(PSSM)信息的压缩技术改进DNA结合蛋白的检测。
PLoS One. 2017 Sep 29;12(9):e0185587. doi: 10.1371/journal.pone.0185587. eCollection 2017.

引用本文的文献

1
StackedEnC-AOP: prediction of antioxidant proteins using transform evolutionary and sequential features based multi-scale vector with stacked ensemble learning.StackedEnC-AOP:基于多尺度向量的转换进化和序列特征与堆叠集成学习预测抗氧化蛋白。
BMC Bioinformatics. 2024 Aug 4;25(1):256. doi: 10.1186/s12859-024-05884-6.
2
Hybrid_DBP: Prediction of DNA-binding proteins using hybrid features and convolutional neural networks.Hybrid_DBP:利用混合特征和卷积神经网络预测DNA结合蛋白。
Front Pharmacol. 2022 Oct 10;13:1031759. doi: 10.3389/fphar.2022.1031759. eCollection 2022.
3
Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion.

本文引用的文献

1
Jackknife Model Averaging Prediction Methods for Complex Phenotypes with Gene Expression Levels by Integrating External Pathway Information.通过整合外部通路信息对具有基因表达水平的复杂表型进行折刀法模型平均预测方法
Comput Math Methods Med. 2019 Apr 8;2019:2807470. doi: 10.1155/2019/2807470. eCollection 2019.
2
Identification of DNA-protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information.基于序列信息的多尺度局部平均块识别 DNA-蛋白质结合位点。
Molecules. 2017 Nov 28;22(12):2079. doi: 10.3390/molecules22122079.
3
PseDNA-Pro: DNA-Binding Protein Identification by Combining Chou's PseAAC and Physicochemical Distance Transformation.
基于 LSTM-CNN 特征融合的 DNA 结合蛋白识别方法研究。
Comput Math Methods Med. 2022 Jun 2;2022:9705275. doi: 10.1155/2022/9705275. eCollection 2022.
4
Identifying Membrane Protein Types Based on Lifelong Learning With Dynamically Scalable Networks.基于动态可扩展网络的终身学习识别膜蛋白类型
Front Genet. 2022 Mar 14;12:834488. doi: 10.3389/fgene.2021.834488. eCollection 2021.
5
KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest.KK-DBP:一种基于随机森林的用于DNA结合蛋白识别的多特征融合方法
Front Genet. 2021 Nov 29;12:811158. doi: 10.3389/fgene.2021.811158. eCollection 2021.
PseDNA-Pro:结合周氏伪氨基酸组成和物理化学距离变换的DNA结合蛋白鉴定方法
Mol Inform. 2015 Jan;34(1):8-17. doi: 10.1002/minf.201400025. Epub 2014 Sep 26.
4
DFLpred: High-throughput prediction of disordered flexible linker regions in protein sequences.DFLpred:蛋白质序列中无序柔性连接区的高通量预测
Bioinformatics. 2016 Jun 15;32(12):i341-i350. doi: 10.1093/bioinformatics/btw280.
5
RVMAB: Using the Relevance Vector Machine Model Combined with Average Blocks to Predict the Interactions of Proteins from Protein Sequences.RVMAB:使用相关向量机模型结合平均块从蛋白质序列预测蛋白质相互作用
Int J Mol Sci. 2016 May 18;17(5):757. doi: 10.3390/ijms17050757.
6
Computational learning on specificity-determining residue-nucleotide interactions.关于特异性决定残基-核苷酸相互作用的计算学习
Nucleic Acids Res. 2015 Dec 2;43(21):10180-9. doi: 10.1093/nar/gkv1134. Epub 2015 Nov 2.
7
DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation.通过结合伪氨基酸组成和基于轮廓的蛋白质表示来鉴定DNA结合蛋白
Sci Rep. 2015 Oct 20;5:15479. doi: 10.1038/srep15479.
8
Identifying DNA-binding proteins by combining support vector machine and PSSM distance transformation.通过结合支持向量机和位置特异性得分矩阵距离变换来识别DNA结合蛋白。
BMC Syst Biol. 2015;9 Suppl 1(Suppl 1):S10. doi: 10.1186/1752-0509-9-S1-S10. Epub 2015 Feb 6.
9
Identification of DNA-binding proteins by incorporating evolutionary information into pseudo amino acid composition via the top-n-gram approach.通过 top-n-gram 方法将进化信息纳入伪氨基酸组成,从而鉴定 DNA 结合蛋白。
J Biomol Struct Dyn. 2015;33(8):1720-30. doi: 10.1080/07391102.2014.968624. Epub 2014 Oct 28.
10
iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.iDNA-Prot|dis:通过将氨基酸距离对和简化字母表概况纳入通用伪氨基酸组成来鉴定DNA结合蛋白。
PLoS One. 2014 Sep 3;9(9):e106691. doi: 10.1371/journal.pone.0106691. eCollection 2014.