• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于目标序列的小鼠同源域亚家族分类反映了它们在体外的 DNA 结合偏好。

Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences.

机构信息

Molecular Structure and Function Program, Hospital for Sick Children, Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.

出版信息

Nucleic Acids Res. 2010 Dec;38(22):7927-42. doi: 10.1093/nar/gkq714. Epub 2010 Aug 12.

DOI:10.1093/nar/gkq714
PMID:20705649
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3001082/
Abstract

Classifying proteins into subgroups with similar molecular function on the basis of sequence is an important step in deriving reliable functional annotations computationally. So far, however, available classification procedures have been evaluated against protein subgroups that are defined by experts using mainly qualitative descriptions of molecular function. Recently, in vitro DNA-binding preferences to all possible 8-nt DNA sequences have been measured for 178 mouse homeodomains using protein-binding microarrays, offering the unprecedented opportunity of evaluating the classification methods against quantitative measures of molecular function. To this end, we automatically derive homeodomain subtypes from the DNA-binding data and independently group the same domains using sequence information alone. We test five sequence-based methods, which use different sequence-similarity measures and algorithms to group sequences. Results show that methods that optimize the classification robustness reflect well the detailed functional specificity revealed by the experimental data. In some of these classifications, 73-83% of the subfamilies exactly correspond to, or are completely contained in, the function-based subtypes. Our findings demonstrate that certain sequence-based classifications are capable of yielding very specific molecular function annotations. The availability of quantitative descriptions of molecular function, such as DNA-binding data, will be a key factor in exploiting this potential in the future.

摘要

将蛋白质按其分子功能相似性分类到亚组中,是从计算上推导出可靠功能注释的重要步骤。然而,到目前为止,可用的分类程序是针对专家使用主要基于分子功能的定性描述来定义的蛋白质亚组进行评估的。最近,使用蛋白质结合微阵列测量了 178 个小鼠同源域对所有可能的 8 个核苷酸 DNA 序列的体外 DNA 结合偏好性,为评估分类方法对分子功能的定量测量提供了前所未有的机会。为此,我们从 DNA 结合数据中自动推导出同源域亚型,并仅使用序列信息独立地对同一结构域进行分组。我们测试了五种基于序列的方法,这些方法使用不同的序列相似性度量和算法来对序列进行分组。结果表明,优化分类稳健性的方法很好地反映了实验数据揭示的详细功能特异性。在这些分类中的某些分类中,73%-83%的亚科与基于功能的亚型完全对应或完全包含在其中。我们的发现表明,某些基于序列的分类能够产生非常具体的分子功能注释。将来,定量描述分子功能(如 DNA 结合数据)的可用性将是利用这一潜力的关键因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/98ff38683e9d/gkq714f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/7fab06610f83/gkq714f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/40a58b667920/gkq714f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/3c3fcb7cbc77/gkq714f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/2631d7cfc259/gkq714f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/98ff38683e9d/gkq714f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/7fab06610f83/gkq714f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/40a58b667920/gkq714f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/3c3fcb7cbc77/gkq714f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/2631d7cfc259/gkq714f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/185a/3001082/98ff38683e9d/gkq714f5.jpg

相似文献

1
Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences.基于目标序列的小鼠同源域亚家族分类反映了它们在体外的 DNA 结合偏好。
Nucleic Acids Res. 2010 Dec;38(22):7927-42. doi: 10.1093/nar/gkq714. Epub 2010 Aug 12.
2
Variation in homeodomain DNA binding revealed by high-resolution analysis of sequence preferences.通过对序列偏好的高分辨率分析揭示的同源域DNA结合变异。
Cell. 2008 Jun 27;133(7):1266-76. doi: 10.1016/j.cell.2008.05.024.
3
Homeodomain proteins: what governs their ability to recognize specific DNA sequences?同源结构域蛋白:是什么决定了它们识别特定DNA序列的能力?
J Mol Biol. 1995 Jul 28;250(5):595-608. doi: 10.1006/jmbi.1995.0401.
4
Improving Hox protein classification across the major model organisms.改进主要模式生物中的 Hox 蛋白分类。
PLoS One. 2010 May 25;5(5):e10820. doi: 10.1371/journal.pone.0010820.
5
A molecular code dictates sequence-specific DNA recognition by homeodomains.一种分子编码决定了同源结构域对特定序列DNA的识别。
EMBO J. 1996 Sep 16;15(18):4992-5000.
6
Functional interference between contacting amino acids of homeodomains.同源结构域中相互接触的氨基酸之间的功能干扰。
FEBS Lett. 1997 May 5;407(3):320-4. doi: 10.1016/s0014-5793(97)00368-2.
7
Insights into nonspecific binding of homeodomains from a structure of MATalpha2 bound to DNA.从与DNA结合的MATalpha2结构洞察同源结构域的非特异性结合。
Proteins. 2003 Jun 1;51(4):544-51. doi: 10.1002/prot.10375.
8
Disordered tails of homeodomains facilitate DNA recognition by providing a trade-off between folding and specific binding.无序的同源域尾部通过在折叠和特异性结合之间提供一种权衡,从而促进 DNA 的识别。
J Am Chem Soc. 2009 Oct 28;131(42):15084-5. doi: 10.1021/ja9052784.
9
[Structural aspects of homeodomain interactions with DNA].[同源结构域与DNA相互作用的结构方面]
Mol Biol (Mosk). 2001 Sep-Oct;35(5):764-77.
10
An automated phylogenetic key for classifying homeoboxes.一种用于对同源异型框进行分类的自动化系统发育检索表。
Mol Phylogenet Evol. 2002 Sep;24(3):388-99. doi: 10.1016/s1055-7903(02)00259-2.

引用本文的文献

1
Automated genomic context analysis and experimental validation platform for discovery of prokaryote transcriptional regulator functions.用于发现原核生物转录调节因子功能的自动化基因组背景分析与实验验证平台。
BMC Genomics. 2014 Dec 18;15(1):1142. doi: 10.1186/1471-2164-15-1142.
2
Reverse engineering systems models of regulation: discovery, prediction and mechanisms.逆向工程调控系统模型:发现、预测和机制。
Curr Opin Biotechnol. 2012 Aug;23(4):598-603. doi: 10.1016/j.copbio.2011.12.005. Epub 2011 Dec 28.

本文引用的文献

1
An atlas of combinatorial transcriptional regulation in mouse and man.《小鼠和人类组合转录调控图谱》
Cell. 2010 Mar 5;140(5):744-52. doi: 10.1016/j.cell.2010.01.044.
2
The Pfam protein families database.Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22. doi: 10.1093/nar/gkp985. Epub 2009 Nov 17.
3
Gene3D: merging structure and function for a Thousand genomes.Gene3D:整合结构与功能的千基因组。
Nucleic Acids Res. 2010 Jan;38(Database issue):D296-300. doi: 10.1093/nar/gkp987. Epub 2009 Nov 11.
4
Diversity and complexity in DNA recognition by transcription factors.转录因子对DNA识别的多样性与复杂性
Science. 2009 Jun 26;324(5935):1720-3. doi: 10.1126/science.1162327. Epub 2009 May 14.
5
Markov clustering versus affinity propagation for the partitioning of protein interaction graphs.用于蛋白质相互作用图划分的马尔可夫聚类与亲和传播算法
BMC Bioinformatics. 2009 Mar 30;10:99. doi: 10.1186/1471-2105-10-99.
6
Domain-based and family-specific sequence identity thresholds increase the levels of reliable protein function transfer.基于结构域和家族特异性的序列同一性阈值提高了可靠蛋白质功能转移的水平。
J Mol Biol. 2009 Mar 27;387(2):416-30. doi: 10.1016/j.jmb.2008.12.045. Epub 2008 Dec 25.
7
SUPERFAMILY--sophisticated comparative genomics, data mining, visualization and phylogeny.超级家族——精密的比较基因组学、数据挖掘、可视化及系统发育学。
Nucleic Acids Res. 2009 Jan;37(Database issue):D380-6. doi: 10.1093/nar/gkn762. Epub 2008 Nov 26.
8
AmiGO: online access to ontology and annotation data.AmiGO:在线访问本体和注释数据。
Bioinformatics. 2009 Jan 15;25(2):288-9. doi: 10.1093/bioinformatics/btn615. Epub 2008 Nov 25.
9
InterPro: the integrative protein signature database.InterPro:综合蛋白质特征数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D211-5. doi: 10.1093/nar/gkn785. Epub 2008 Oct 21.
10
HomeoDB: a database of homeobox gene diversity.同源异型框基因数据库(HomeoDB):一个关于同源异型框基因多样性的数据库。
Evol Dev. 2008 Sep-Oct;10(5):516-8. doi: 10.1111/j.1525-142X.2008.00266.x.