• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向结构基因组学中基于结构的全自动化功能预测:一个案例研究

Towards fully automated structure-based function prediction in structural genomics: a case study.

作者信息

Watson James D, Sanderson Steve, Ezersky Alexandra, Savchenko Alexei, Edwards Aled, Orengo Christine, Joachimiak Andrzej, Laskowski Roman A, Thornton Janet M

机构信息

EMBL--European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.

出版信息

J Mol Biol. 2007 Apr 13;367(5):1511-22. doi: 10.1016/j.jmb.2007.01.063. Epub 2007 Jan 30.

DOI:10.1016/j.jmb.2007.01.063
PMID:17316683
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2566530/
Abstract

As the global Structural Genomics projects have picked up pace, the number of structures annotated in the Protein Data Bank as hypothetical protein or unknown function has grown significantly. A major challenge now involves the development of computational methods to assign functions to these proteins accurately and automatically. As part of the Midwest Center for Structural Genomics (MCSG) we have developed a fully automated functional analysis server, ProFunc, which performs a battery of analyses on a submitted structure. The analyses combine a number of sequence-based and structure-based methods to identify functional clues. After the first stage of the Protein Structure Initiative (PSI), we review the success of the pipeline and the importance of structure-based function prediction. As a dataset, we have chosen all structures solved by the MCSG during the 5 years of the first PSI. Our analysis suggests that two of the structure-based methods are particularly successful and provide examples of local similarity that is difficult to identify using current sequence-based methods. No one method is successful in all cases, so, through the use of a number of complementary sequence and structural approaches, the ProFunc server increases the chances that at least one method will find a significant hit that can help elucidate function. Manual assessment of the results is a time-consuming process and subject to individual interpretation and human error. We present a method based on the Gene Ontology (GO) schema using GO-slims that can allow the automated assessment of hits with a success rate approaching that of expert manual assessment.

摘要

随着全球结构基因组学项目的加速推进,蛋白质数据库中注释为假设蛋白或功能未知的结构数量显著增加。当前的一项重大挑战是开发计算方法,以准确、自动地为这些蛋白质赋予功能。作为中西部结构基因组学中心(MCSG)的一部分,我们开发了一个全自动功能分析服务器ProFunc,它对提交的结构进行一系列分析。这些分析结合了多种基于序列和基于结构的方法来识别功能线索。在蛋白质结构计划(PSI)的第一阶段之后,我们回顾了该流程的成功之处以及基于结构的功能预测的重要性。作为一个数据集,我们选择了MCSG在PSI第一阶段的5年中解析的所有结构。我们的分析表明,两种基于结构的方法特别成功,并提供了使用当前基于序列的方法难以识别的局部相似性示例。没有一种方法在所有情况下都能成功,因此,通过使用多种互补的序列和结构方法,ProFunc服务器增加了至少一种方法找到有助于阐明功能的显著匹配的机会。对结果进行人工评估是一个耗时的过程,并且容易受到个人解读和人为错误的影响。我们提出了一种基于基因本体(GO)模式并使用GO精简版的方法,该方法可以对匹配结果进行自动评估,成功率接近专家人工评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/e64297a99204/nihms20879f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/7f18d4a7ba8f/nihms20879f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/30e6859aa593/nihms20879f2a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/b1824a9ddb1d/nihms20879f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/e64297a99204/nihms20879f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/7f18d4a7ba8f/nihms20879f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/30e6859aa593/nihms20879f2a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/b1824a9ddb1d/nihms20879f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6244/2566530/e64297a99204/nihms20879f4.jpg

相似文献

1
Towards fully automated structure-based function prediction in structural genomics: a case study.迈向结构基因组学中基于结构的全自动化功能预测:一个案例研究
J Mol Biol. 2007 Apr 13;367(5):1511-22. doi: 10.1016/j.jmb.2007.01.063. Epub 2007 Jan 30.
2
1,000 structures and more from the MCSG.来自MCSG的1000种及更多结构
BMC Struct Biol. 2011 Jan 10;11:2. doi: 10.1186/1472-6807-11-2.
3
Identification of similar regions of protein structures using integrated sequence and structure analysis tools.使用综合序列和结构分析工具鉴定蛋白质结构的相似区域。
BMC Struct Biol. 2006 Mar 9;6:4. doi: 10.1186/1472-6807-6-4.
4
Progress of structural genomics initiatives: an analysis of solved target structures.结构基因组学计划的进展:已解析目标结构的分析
J Mol Biol. 2005 May 20;348(5):1235-60. doi: 10.1016/j.jmb.2005.03.037. Epub 2005 Apr 2.
5
Pcons.net: protein structure prediction meta server.Pcons.net:蛋白质结构预测元服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W369-74. doi: 10.1093/nar/gkm319. Epub 2007 Jun 21.
6
ProFunc: a server for predicting protein function from 3D structure.ProFunc:一个通过三维结构预测蛋白质功能的服务器。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W89-93. doi: 10.1093/nar/gki414.
7
The ProFunc Function Prediction Server.ProFunc功能预测服务器。
Methods Mol Biol. 2017;1611:75-95. doi: 10.1007/978-1-4939-7015-5_7.
8
Target selection for structural genomics: an overview.结构基因组学的靶点选择:综述
Methods Mol Biol. 2008;426:3-25. doi: 10.1007/978-1-60327-058-8_1.
9
PROSPECT-PSPP: an automatic computational pipeline for protein structure prediction.PROSPECT-PSPP:一种用于蛋白质结构预测的自动计算流程。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W522-5. doi: 10.1093/nar/gkh414.
10
Predicted role for the archease protein family based on structural and sequence analysis of TM1083 and MTH1598, two proteins structurally characterized through structural genomics efforts.基于TM1083和MTH1598这两种通过结构基因组学研究确定了结构特征的蛋白质的结构和序列分析,对解旋酶蛋白家族的预测作用。
Proteins. 2004 Jul 1;56(1):19-27. doi: 10.1002/prot.20141.

引用本文的文献

1
Predicting Protein Function in the AI and Big Data Era.人工智能与大数据时代的蛋白质功能预测
Biochemistry. 2025 Jun 3;64(11):2345-2352. doi: 10.1021/acs.biochem.5c00186. Epub 2025 May 17.
2
Functional prediction of proteins from the human gut archaeome.来自人类肠道古菌组的蛋白质功能预测
ISME Commun. 2024 Jan 10;4(1):ycad014. doi: 10.1093/ismeco/ycad014. eCollection 2024 Jan.
3
Isolation, characterization and in silico analysis of 3-Hydroxy-3-methylglutaryl-coenzyme A reductase (HMGR) gene from Andrographis paniculata (Burm. f) Nees.

本文引用的文献

1
Analysis of metabolic and regulatory pathways through Gene Ontology-derived semantic similarity measures.通过基因本体论衍生的语义相似性度量分析代谢和调控途径。
AMIA Annu Symp Proc. 2005;2005:972.
2
Protein classification using probabilistic chain graphs and the Gene Ontology structure.使用概率链图和基因本体结构进行蛋白质分类。
Bioinformatics. 2006 Aug 1;22(15):1871-8. doi: 10.1093/bioinformatics/btl187. Epub 2006 May 16.
3
Gene3D: modelling protein structure, function and evolution.基因3D:蛋白质结构、功能及进化建模
从穿心莲(Burm. f) Nees 中分离、鉴定和计算机分析 3-羟基-3-甲基戊二酰辅酶 A 还原酶(HMGR)基因。
Mol Biol Rep. 2020 Jan;47(1):639-654. doi: 10.1007/s11033-019-05172-0. Epub 2019 Nov 28.
4
Functional Prediction of Hypothetical Transcription Factors of K-12 Based on Expression Data.基于表达数据对K-12假定转录因子的功能预测
Comput Struct Biotechnol J. 2018 Mar 27;16:157-166. doi: 10.1016/j.csbj.2018.03.003. eCollection 2018.
5
A Novel Geometry-Based Approach to Infer Protein Interface Similarity.一种基于新几何结构的蛋白质界面相似性推断方法。
Sci Rep. 2018 May 29;8(1):8192. doi: 10.1038/s41598-018-26497-z.
6
The impact of structural genomics: the first quindecennial.结构基因组学的影响:首个十五年。
J Struct Funct Genomics. 2016 Mar;17(1):1-16. doi: 10.1007/s10969-016-9201-5. Epub 2016 Mar 2.
7
Use of Nanostructure-Initiator Mass Spectrometry to Deduce Selectivity of Reaction in Glycoside Hydrolases.利用纳米结构引发质谱推断糖苷水解酶反应的选择性。
Front Bioeng Biotechnol. 2015 Oct 27;3:165. doi: 10.3389/fbioe.2015.00165. eCollection 2015.
8
Characterization and structure prediction of partial length protein sequences of pcoA, pcoR and chrB genes from heavy metal resistant bacteria from the Klip River, South Africa.来自南非克利普河重金属抗性细菌的pcoA、pcoR和chrB基因部分长度蛋白质序列的表征与结构预测
Int J Mol Sci. 2015 Apr 1;16(4):7352-74. doi: 10.3390/ijms16047352.
9
Covalent docking predicts substrates for haloalkanoate dehalogenase superfamily phosphatases.共价对接预测卤代烷酸脱卤酶超家族磷酸酶的底物。
Biochemistry. 2015 Jan 20;54(2):528-37. doi: 10.1021/bi501140k. Epub 2015 Jan 5.
10
Systematic structural characterization of metabolites in Arabidopsis via candidate substrate-product pair networks.通过候选底物-产物对网络对拟南芥代谢物进行系统的结构表征。
Plant Cell. 2014 Mar;26(3):929-45. doi: 10.1105/tpc.113.122242. Epub 2014 Mar 31.
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D281-4. doi: 10.1093/nar/gkj057.
4
Protein function prediction using local 3D templates.使用局部三维模板进行蛋白质功能预测。
J Mol Biol. 2005 Aug 19;351(3):614-26. doi: 10.1016/j.jmb.2005.05.067.
5
ProFunc: a server for predicting protein function from 3D structure.ProFunc:一个通过三维结构预测蛋白质功能的服务器。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W89-93. doi: 10.1093/nar/gki414.
6
Who tangos with GOA?-Use of Gene Ontology Annotation (GOA) for biological interpretation of '-omics' data and for validation of automatic annotation tools.谁与基因本体注释(GOA)共舞?——利用基因本体注释(GOA)对“组学”数据进行生物学解读及验证自动注释工具
In Silico Biol. 2005;5(1):5-8.
7
Predicting protein function from sequence and structural data.从序列和结构数据预测蛋白质功能。
Curr Opin Struct Biol. 2005 Jun;15(3):275-84. doi: 10.1016/j.sbi.2005.04.003.
8
A procedure for assessing GO annotation consistency.一种评估基因本体(GO)注释一致性的程序。
Bioinformatics. 2005 Jun;21 Suppl 1:i136-43. doi: 10.1093/bioinformatics/bti1019.
9
GenBank.基因银行
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D34-8. doi: 10.1093/nar/gki063.
10
The Universal Protein Resource (UniProt).通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D154-9. doi: 10.1093/nar/gki070.