• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从序列中提高 DNA 结合域的预测和理解能力。

Boosting the prediction and understanding of DNA-binding domains from sequence.

机构信息

Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60612, USA.

出版信息

Nucleic Acids Res. 2010 Jun;38(10):3149-58. doi: 10.1093/nar/gkq061. Epub 2010 Feb 15.

DOI:10.1093/nar/gkq061
PMID:20156993
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2879530/
Abstract

DNA-binding proteins perform vital functions related to transcription, repair and replication. We have developed a new sequence-based machine learning protocol to identify DNA-binding proteins. We compare our method with an extensive benchmark of previously published structure-based machine learning methods as well as a standard sequence alignment technique, BLAST. Furthermore, we elucidate important feature interactions found in a learned model and analyze how specific rules capture general mechanisms that extend across DNA-binding motifs. This analysis is carried out using the malibu machine learning workbench available at http://proteomics.bioengr.uic.edu/malibu and the corresponding data sets and features are available at http://proteomics.bioengr.uic.edu/dna.

摘要

DNA 结合蛋白执行与转录、修复和复制相关的重要功能。我们开发了一种新的基于序列的机器学习协议来识别 DNA 结合蛋白。我们将我们的方法与广泛的先前发表的基于结构的机器学习方法的基准以及标准序列比对技术 BLAST 进行了比较。此外,我们阐明了在学习模型中发现的重要特征相互作用,并分析了特定规则如何捕获跨 DNA 结合基序延伸的一般机制。这项分析是使用可在 http://proteomics.bioengr.uic.edu/malibu 上获得的 malibu 机器学习工作台以及可在 http://proteomics.bioengr.uic.edu/dna 上获得的相应数据集和特征来进行的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/615036eeab3b/gkq061f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/6f4418b2df4d/gkq061f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/b0c701eb24a6/gkq061f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/97686b3b921d/gkq061f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/615036eeab3b/gkq061f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/6f4418b2df4d/gkq061f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/b0c701eb24a6/gkq061f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/97686b3b921d/gkq061f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c3d/2879530/615036eeab3b/gkq061f4.jpg

相似文献

1
Boosting the prediction and understanding of DNA-binding domains from sequence.从序列中提高 DNA 结合域的预测和理解能力。
Nucleic Acids Res. 2010 Jun;38(10):3149-58. doi: 10.1093/nar/gkq061. Epub 2010 Feb 15.
2
Intelligible machine learning with malibu.使用马里布的可理解机器学习。
Annu Int Conf IEEE Eng Med Biol Soc. 2008;2008:3795-8. doi: 10.1109/IEMBS.2008.4650035.
3
Learning to translate sequence and structure to function: identifying DNA binding and membrane binding proteins.学习将序列和结构转化为功能:识别DNA结合蛋白和膜结合蛋白。
Ann Biomed Eng. 2007 Jun;35(6):1043-52. doi: 10.1007/s10439-007-9312-z. Epub 2007 Apr 13.
4
A structure-based protocol for learning the family-specific mechanisms of membrane-binding domains.基于结构的膜结合结构域家族特异性作用机制学习方案
Bioinformatics. 2012 Sep 15;28(18):i431-i437. doi: 10.1093/bioinformatics/bts409.
5
NAPS: a residue-level nucleic acid-binding prediction server.NAPS:一种残基水平的核酸结合预测服务器。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W431-5. doi: 10.1093/nar/gkq361. Epub 2010 May 16.
6
DP-Bind: a web server for sequence-based prediction of DNA-binding residues in DNA-binding proteins.DP-Bind:一个用于基于序列预测DNA结合蛋白中DNA结合残基的网络服务器。
Bioinformatics. 2007 Mar 1;23(5):634-6. doi: 10.1093/bioinformatics/btl672. Epub 2007 Jan 19.
7
Prediction of DNA-binding residues from sequence features.基于序列特征预测DNA结合残基。
J Bioinform Comput Biol. 2006 Dec;4(6):1141-58. doi: 10.1142/s0219720006002387.
8
DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning.多宝:通过整合进化信号和机器学习进行蛋白质结构域边界预测。
BMC Bioinformatics. 2011 Feb 1;12:43. doi: 10.1186/1471-2105-12-43.
9
A novel sequence-based method of predicting protein DNA-binding residues, using a machine learning approach.一种基于序列的新型方法,用于使用机器学习方法预测蛋白质 DNA 结合残基。
Mol Cells. 2010 Aug;30(2):99-105. doi: 10.1007/s10059-010-0093-0. Epub 2010 Jul 23.
10
Building an automated classification of DNA-binding protein domains.构建DNA结合蛋白结构域的自动分类。
Bioinformatics. 2002;18 Suppl 2:S192-201. doi: 10.1093/bioinformatics/18.suppl_2.s192.

引用本文的文献

1
Benchmarking recent computational tools for DNA-binding protein identification.对近期用于DNA结合蛋白识别的计算工具进行基准测试。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae634.
2
ProkDBP: Toward more precise identification of prokaryotic DNA binding proteins.ProkDBP:致力于更精确地识别原核 DNA 结合蛋白。
Protein Sci. 2024 Jun;33(6):e5015. doi: 10.1002/pro.5015.
3
DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features.DNAPred_Prot:利用基于组成和位置的特征识别DNA结合蛋白。

本文引用的文献

1
On the divalent metal ion dependence of DNA cleavage by restriction endonucleases of the EcoRI family.关于EcoRI家族限制性内切核酸酶切割DNA对二价金属离子的依赖性
J Mol Biol. 2009 Oct 16;393(1):140-60. doi: 10.1016/j.jmb.2009.08.011. Epub 2009 Aug 13.
2
From nonspecific DNA-protein encounter complexes to the prediction of DNA-protein interactions.从非特异性DNA-蛋白质相遇复合物到DNA-蛋白质相互作用的预测
PLoS Comput Biol. 2009 Mar;5(3):e1000341. doi: 10.1371/journal.pcbi.1000341. Epub 2009 Apr 3.
3
Identification of DNA-binding proteins using structural, electrostatic and evolutionary features.
Appl Bionics Biomech. 2022 Apr 13;2022:5483115. doi: 10.1155/2022/5483115. eCollection 2022.
4
PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method.PredDBP-Stack:基于堆叠集成方法的使用 HMM 轮廓预测 DNA 结合蛋白
Biomed Res Int. 2020 Apr 13;2020:7297631. doi: 10.1155/2020/7297631. eCollection 2020.
5
HMMPred: Accurate Prediction of DNA-Binding Proteins Based on HMM Profiles and XGBoost Feature Selection.HMMPred:基于 HMM 轮廓和 XGBoost 特征选择的 DNA 结合蛋白精确预测。
Comput Math Methods Med. 2020 Mar 28;2020:1384749. doi: 10.1155/2020/1384749. eCollection 2020.
6
Functional Site Discovery From Incomplete Training Data: A Case Study With Nucleic Acid-Binding Proteins.从不完整训练数据中发现功能位点:以核酸结合蛋白为例的研究
Front Genet. 2019 Aug 30;10:729. doi: 10.3389/fgene.2019.00729. eCollection 2019.
7
HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features.HMMBinder:基于 HMM -profile 特征的 DNA 结合蛋白预测。
Biomed Res Int. 2017;2017:4590609. doi: 10.1155/2017/4590609. Epub 2017 Nov 14.
8
iDNAProt-ES: Identification of DNA-binding Proteins Using Evolutionary and Structural Features.iDNAProt-ES:利用进化和结构特征鉴定 DNA 结合蛋白。
Sci Rep. 2017 Nov 2;7(1):14938. doi: 10.1038/s41598-017-14945-1.
9
Identification of DNA-binding proteins using multi-features fusion and binary firefly optimization algorithm.基于多特征融合和二进制萤火虫优化算法的DNA结合蛋白识别
BMC Bioinformatics. 2016 Aug 26;17(1):323. doi: 10.1186/s12859-016-1201-8.
10
DNA-binding protein prediction using plant specific support vector machines: validation and application of a new genome annotation tool.使用植物特异性支持向量机进行DNA结合蛋白预测:一种新的基因组注释工具的验证与应用
Nucleic Acids Res. 2015 Dec 15;43(22):e158. doi: 10.1093/nar/gkv805. Epub 2015 Aug 24.
利用结构、静电和进化特征鉴定DNA结合蛋白。
J Mol Biol. 2009 Apr 10;387(4):1040-53. doi: 10.1016/j.jmb.2009.02.023. Epub 2009 Feb 20.
4
Intelligible machine learning with malibu.使用马里布的可理解机器学习。
Annu Int Conf IEEE Eng Med Biol Soc. 2008;2008:3795-8. doi: 10.1109/IEMBS.2008.4650035.
5
Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic archaeon Pyrococcus furiosus.利用嗜热古菌激烈火球菌中的氨基酸组成和周期性对新型DNA/RNA结合蛋白进行全蛋白质组预测。
DNA Res. 2007 Jun 30;14(3):91-102. doi: 10.1093/dnares/dsm011. Epub 2007 Jun 15.
6
Learning to translate sequence and structure to function: identifying DNA binding and membrane binding proteins.学习将序列和结构转化为功能:识别DNA结合蛋白和膜结合蛋白。
Ann Biomed Eng. 2007 Jun;35(6):1043-52. doi: 10.1007/s10439-007-9312-z. Epub 2007 Apr 13.
7
Superfamily assignments for the yeast proteome through integration of structure prediction with the gene ontology.通过将结构预测与基因本体相结合对酵母蛋白质组进行超家族分类。
PLoS Biol. 2007 Apr;5(4):e76. doi: 10.1371/journal.pbio.0050076.
8
Residue-level prediction of DNA-binding sites and its application on DNA-binding protein predictions.DNA结合位点的残基水平预测及其在DNA结合蛋白预测中的应用。
FEBS Lett. 2007 Mar 6;581(5):1058-66. doi: 10.1016/j.febslet.2007.01.086. Epub 2007 Feb 7.
9
DISPLAR: an accurate method for predicting DNA-binding sites on protein surfaces.DISPLAR:一种预测蛋白质表面DNA结合位点的精确方法。
Nucleic Acids Res. 2007;35(5):1465-77. doi: 10.1093/nar/gkm008. Epub 2007 Feb 6.
10
Structure Based Prediction of Binding Residues on DNA-binding Proteins.基于结构预测DNA结合蛋白上的结合残基
Conf Proc IEEE Eng Med Biol Soc. 2005;2005:2611-4. doi: 10.1109/IEMBS.2005.1617004.