• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组学 2 蛋白质门户:将遗传筛选结果与蛋白质序列和结构联系起来的资源和发现工具。

Genomics 2 Proteins portal: a resource and discovery tool for linking genetic screening outputs to protein sequences and structures.

机构信息

Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

PATTERN, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

出版信息

Nat Methods. 2024 Oct;21(10):1947-1957. doi: 10.1038/s41592-024-02409-0. Epub 2024 Sep 18.

DOI:10.1038/s41592-024-02409-0
PMID:39294369
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11466821/
Abstract

Recent advances in AI-based methods have revolutionized the field of structural biology. Concomitantly, high-throughput sequencing and functional genomics have generated genetic variants at an unprecedented scale. However, efficient tools and resources are needed to link disparate data types-to 'map' variants onto protein structures, to better understand how the variation causes disease, and thereby design therapeutics. Here we present the Genomics 2 Proteins portal ( https://g2p.broadinstitute.org/ ): a human proteome-wide resource that maps 20,076,998 genetic variants onto 42,413 protein sequences and 77,923 structures, with a comprehensive set of structural and functional features. Additionally, the Genomics 2 Proteins portal allows users to interactively upload protein residue-wise annotations (for example, variants and scores) as well as the protein structure beyond databases to establish the connection between genomics to proteins. The portal serves as an easy-to-use discovery tool for researchers and scientists to hypothesize the structure-function relationship between natural or synthetic variations and their molecular phenotypes.

摘要

基于人工智能的方法的最新进展彻底改变了结构生物学领域。与此同时,高通量测序和功能基因组学以前所未有的规模产生了遗传变异。然而,需要有效的工具和资源将不同的数据类型联系起来,将变体映射到蛋白质结构上,以更好地理解变异如何导致疾病,从而设计治疗方法。在这里,我们介绍基因组学 2 蛋白质门户 ( https://g2p.broadinstitute.org/ ):一个人类蛋白质组范围的资源,它将 20,076,998 个遗传变体映射到 42,413 个蛋白质序列和 77,923 个结构上,并具有全面的结构和功能特征。此外,基因组学 2 蛋白质门户允许用户交互式地上传蛋白质残基注释(例如变体和分数)以及数据库之外的蛋白质结构,以建立基因组学与蛋白质之间的联系。该门户作为一个易于使用的发现工具,供研究人员和科学家假设自然或合成变异与其分子表型之间的结构-功能关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/fa7c1e5aad0a/41592_2024_2409_Fig9_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/c52f39397439/41592_2024_2409_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/2581d2bbe453/41592_2024_2409_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/478afabb9374/41592_2024_2409_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/dcd77751fa2a/41592_2024_2409_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/e7426551940f/41592_2024_2409_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/cebd12c8587d/41592_2024_2409_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/f6c6b034de0b/41592_2024_2409_Fig7_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/86f8173aeba9/41592_2024_2409_Fig8_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/fa7c1e5aad0a/41592_2024_2409_Fig9_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/c52f39397439/41592_2024_2409_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/2581d2bbe453/41592_2024_2409_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/478afabb9374/41592_2024_2409_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/dcd77751fa2a/41592_2024_2409_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/e7426551940f/41592_2024_2409_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/cebd12c8587d/41592_2024_2409_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/f6c6b034de0b/41592_2024_2409_Fig7_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/86f8173aeba9/41592_2024_2409_Fig8_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe64/11466821/fa7c1e5aad0a/41592_2024_2409_Fig9_ESM.jpg

相似文献

1
Genomics 2 Proteins portal: a resource and discovery tool for linking genetic screening outputs to protein sequences and structures.基因组学 2 蛋白质门户:将遗传筛选结果与蛋白质序列和结构联系起来的资源和发现工具。
Nat Methods. 2024 Oct;21(10):1947-1957. doi: 10.1038/s41592-024-02409-0. Epub 2024 Sep 18.
2
Genomics 2 Proteins portal: A resource and discovery tool for linking genetic screening outputs to protein sequences and structures.基因组学2蛋白质门户:一个将基因筛查结果与蛋白质序列和结构相联系的资源及发现工具。
bioRxiv. 2024 Jan 2:2024.01.02.573913. doi: 10.1101/2024.01.02.573913.
3
PhyreRisk: A Dynamic Web Application to Bridge Genomics, Proteomics and 3D Structural Data to Guide Interpretation of Human Genetic Variants.PhyreRisk:一个动态的 Web 应用程序,将基因组学、蛋白质组学和 3D 结构数据联系起来,指导人类遗传变异的解释。
J Mol Biol. 2019 Jun 14;431(13):2460-2466. doi: 10.1016/j.jmb.2019.04.043. Epub 2019 May 7.
4
G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures.G23D:用于在三维蛋白质结构上绘制和可视化基因组变异的在线工具。
BMC Genomics. 2016 Aug 26;17(1):681. doi: 10.1186/s12864-016-3028-0.
5
The Structural Biology Knowledgebase: a portal to protein structures, sequences, functions, and methods.结构生物学知识库:蛋白质结构、序列、功能及方法的门户。
J Struct Funct Genomics. 2011 Jul;12(2):45-54. doi: 10.1007/s10969-011-9106-2. Epub 2011 Apr 7.
6
ProtVar: mapping and contextualizing human missense variation.ProtVar:人类错义变异的映射和情境化分析。
Nucleic Acids Res. 2024 Jul 5;52(W1):W140-W147. doi: 10.1093/nar/gkae413.
7
3D-GENOMICS: a database to compare structural and functional annotations of proteins between sequenced genomes.3D基因组学:一个用于比较已测序基因组之间蛋白质的结构和功能注释的数据库。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D245-50. doi: 10.1093/nar/gkh064.
8
OpenProt 2.0 builds a path to the functional characterization of alternative proteins.OpenProt 2.0 为探索替代蛋白的功能特性开辟了道路。
Nucleic Acids Res. 2024 Jan 5;52(D1):D522-D528. doi: 10.1093/nar/gkad1050.
9
Next Generation Protein Structure Predictions and Genetic Variant Interpretation.下一代蛋白质结构预测与基因变异解读
J Mol Biol. 2021 Oct 1;433(20):167180. doi: 10.1016/j.jmb.2021.167180. Epub 2021 Aug 4.
10
MODBASE, a database of annotated comparative protein structure models, and associated resources.MODBASE,一个带注释的比较蛋白质结构模型数据库及相关资源。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D217-22. doi: 10.1093/nar/gkh095.

引用本文的文献

1
Functional evidence for variant classification from mutational scanning.来自突变扫描的变异分类功能证据。
bioRxiv. 2025 Aug 15:2025.08.11.669723. doi: 10.1101/2025.08.11.669723.
2
CDKN2A and matrix metalloproteinases: key regulators of cellular senescence in squamous cell carcinoma.CDKN2A与基质金属蛋白酶:鳞状细胞癌中细胞衰老的关键调节因子
Am J Transl Res. 2025 Jun 15;17(6):4573-4589. doi: 10.62347/FKDS7259. eCollection 2025.
3
Mapping MAVE data for use in human genomics applications.映射用于人类基因组学应用的MAVE数据。

本文引用的文献

1
Generalized biomolecular modeling and design with RoseTTAFold All-Atom.基于 RoseTTAFold All-Atom 的广义生物分子建模与设计。
Science. 2024 Apr 19;384(6693):eadl2528. doi: 10.1126/science.adl2528.
2
Accurate proteome-wide missense variant effect prediction with AlphaMissense.使用 AlphaMissense 进行精确的全蛋白质错义变异效应预测。
Science. 2023 Sep 22;381(6664):eadg7492. doi: 10.1126/science.adg7492.
3
Genomic newborn screening for rare diseases.针对罕见病的基因组新生儿筛查。
Genome Biol. 2025 Jun 25;26(1):179. doi: 10.1186/s13059-025-03647-x.
4
StructMAn 2.0 Web: a web server for structural annotation of protein sequences and mutations.StructMAn 2.0网络版:一个用于蛋白质序列和突变结构注释的网络服务器。
Nucleic Acids Res. 2025 Jul 7;53(W1):W528-W533. doi: 10.1093/nar/gkaf381.
5
Identifying Sex Differences in Lung Adenocarcinoma Using Multi-Omics Integrative Protein Signaling Networks.利用多组学整合蛋白信号网络识别肺腺癌中的性别差异。
bioRxiv. 2025 Feb 7:2025.02.03.636354. doi: 10.1101/2025.02.03.636354.
6
Databases of ligand-binding pockets and protein-ligand interactions.配体结合口袋和蛋白质-配体相互作用的数据库。
Comput Struct Biotechnol J. 2024 Mar 24;23:1320-1338. doi: 10.1016/j.csbj.2024.03.015. eCollection 2024 Dec.
Nat Rev Genet. 2023 Nov;24(11):755-766. doi: 10.1038/s41576-023-00621-w. Epub 2023 Jun 29.
4
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
5
Predicting locations of cryptic pockets from single protein structures using the PocketMiner graph neural network.利用 PocketMiner 图神经网络从单个蛋白质结构预测隐匿口袋的位置。
Nat Commun. 2023 Mar 1;14(1):1177. doi: 10.1038/s41467-023-36699-3.
6
Third-generation computational approaches for genetic variant interpretation.用于基因变异解读的第三代计算方法。
Brain. 2023 Feb 13;146(2):411-412. doi: 10.1093/brain/awad011.
7
AlphaFill: enriching AlphaFold models with ligands and cofactors.AlphaFill:利用配体和辅因子丰富 AlphaFold 模型。
Nat Methods. 2023 Feb;20(2):205-213. doi: 10.1038/s41592-022-01685-y. Epub 2022 Nov 24.
8
Base editor scanning charts the DNMT3A activity landscape.碱基编辑器扫描图谱描绘 DNMT3A 活性景观。
Nat Chem Biol. 2023 Feb;19(2):176-186. doi: 10.1038/s41589-022-01167-4. Epub 2022 Oct 20.
9
Search and sequence analysis tools services from EMBL-EBI in 2022.2022 年 EMBL-EBI 的搜索和序列分析工具服务。
Nucleic Acids Res. 2022 Jul 5;50(W1):W276-W279. doi: 10.1093/nar/gkac240.
10
Circuit topology predicts pathogenicity of missense mutations.电路拓扑结构可预测错义突变的致病性。
Proteins. 2022 Sep;90(9):1634-1644. doi: 10.1002/prot.26342. Epub 2022 Apr 23.