Suppr超能文献

PSI-2:用于覆盖蛋白质结构域家族空间的结构基因组学。

PSI-2: structural genomics to cover protein domain family space.

作者信息

Dessailly Benoît H, Nair Rajesh, Jaroszewski Lukasz, Fajardo J Eduardo, Kouranov Andrei, Lee David, Fiser Andras, Godzik Adam, Rost Burkhard, Orengo Christine

机构信息

Department of Structural and Molecular Biology, University College of London, London WC1E6BT, UK.

出版信息

Structure. 2009 Jun 10;17(6):869-81. doi: 10.1016/j.str.2009.03.015.

Abstract

One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.

摘要

包括美国国立卫生研究院资助的蛋白质结构计划(PSI)在内的结构基因组学研究的一个主要目标,是增加蛋白质序列空间的结构覆盖范围。在此,我们展示了PSI第二阶段(PSI-2)所采用的靶点选择策略。该策略由与PSI-2大规模生产中心相关的生物信息学团队共同设计,目标是来自大型、结构未表征的蛋白质结构域家族,以及来自结构覆盖不完整的非常大且多样的家族中的结构未表征亚家族的代表。这些非常大的家族在结构和功能上极其多样,并且在已知蛋白质组中高度富集。基于多项指标,我们接着讨论了PSI-2在其头3年里在多大程度上增加了基因组的结构覆盖范围,并贡献了结构和功能上的新颖性。总体而言,此处呈现的结果表明PSI-2正在成功实现其目标,并为结构和功能空间提供了有用的见解。

相似文献

2
Structural genomics is the largest contributor of novel structural leverage.结构基因组学是新型结构杠杆作用的最大贡献者。
J Struct Funct Genomics. 2009 Apr;10(2):181-91. doi: 10.1007/s10969-008-9055-6. Epub 2009 Feb 5.
8
The impact of structural genomics: the first quindecennial.结构基因组学的影响:首个十五年。
J Struct Funct Genomics. 2016 Mar;17(1):1-16. doi: 10.1007/s10969-016-9201-5. Epub 2016 Mar 2.
10
Structure-based functional inference in structural genomics.结构基因组学中基于结构的功能推断
J Struct Funct Genomics. 2003;4(2-3):129-35. doi: 10.1023/a:1026200610644.

引用本文的文献

1
Novel machine learning approaches revolutionize protein knowledge.新型机器学习方法彻底改变了蛋白质知识。
Trends Biochem Sci. 2023 Apr;48(4):345-359. doi: 10.1016/j.tibs.2022.11.001. Epub 2022 Dec 9.
2
Contrastive learning on protein embeddings enlightens midnight zone.蛋白质嵌入的对比学习照亮了午夜区。
NAR Genom Bioinform. 2022 Jun 11;4(2):lqac043. doi: 10.1093/nargab/lqac043. eCollection 2022 Jun.
5
Fine Sampling of Sequence Space for Membrane Protein Structural Biology.对膜蛋白结构生物学序列空间的精细采样。
J Mol Biol. 2021 Jul 23;433(15):167055. doi: 10.1016/j.jmb.2021.167055. Epub 2021 May 20.
8
Anticipating innovations in structural biology.预测结构生物学的创新。
Q Rev Biophys. 2018 Jan;51:e8. doi: 10.1017/S0033583518000057.

本文引用的文献

1
The protein structure initiative structural genomics knowledgebase.蛋白质结构创新计划结构基因组学知识库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D365-8. doi: 10.1093/nar/gkn790. Epub 2008 Nov 14.
2
Arrangements in the modular evolution of proteins.蛋白质模块化进化中的排列方式。
Trends Biochem Sci. 2008 Sep;33(9):444-51. doi: 10.1016/j.tibs.2008.05.008. Epub 2008 Jul 24.
4
Exploring the structure and function paradigm.探索结构与功能范式。
Curr Opin Struct Biol. 2008 Jun;18(3):394-402. doi: 10.1016/j.sbi.2008.05.007.
5
Target selection for structural genomics: an overview.结构基因组学的靶点选择:综述
Methods Mol Biol. 2008;426:3-25. doi: 10.1007/978-1-60327-058-8_1.
6
The structure of protein evolution and the evolution of protein structure.蛋白质进化的结构与蛋白质结构的进化。
Curr Opin Struct Biol. 2008 Apr;18(2):170-7. doi: 10.1016/j.sbi.2008.01.006. Epub 2008 Mar 6.
7
Update on the protein structure initiative.蛋白质结构启动计划最新进展。
Structure. 2007 Dec;15(12):1519-22. doi: 10.1016/j.str.2007.11.004.
8
The universal protein resource (UniProt).通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2008 Jan;36(Database issue):D190-5. doi: 10.1093/nar/gkm895. Epub 2007 Nov 27.
9
The Pfam protein families database.Pfam蛋白质家族数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D281-8. doi: 10.1093/nar/gkm960. Epub 2007 Nov 26.
10
Gene3D: comprehensive structural and functional annotation of genomes.基因3D:基因组的全面结构与功能注释
Nucleic Acids Res. 2008 Jan;36(Database issue):D414-8. doi: 10.1093/nar/gkm1019. Epub 2007 Nov 21.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验