• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MAVE-NN:从变异效应的多重分析中学习基因型-表型图谱。

MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect.

作者信息

Tareen Ammar, Kooshkbaghi Mahdi, Posfai Anna, Ireland William T, McCandlish David M, Kinney Justin B

机构信息

Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, 11724, NY, USA.

Present Address: Regeneron Pharmaceuticals, Inc., Tarrytown, 10591, NY, USA.

出版信息

Genome Biol. 2022 Apr 15;23(1):98. doi: 10.1186/s13059-022-02661-7.

DOI:10.1186/s13059-022-02661-7
PMID:35428271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9011994/
Abstract

Multiplex assays of variant effect (MAVEs) are a family of methods that includes deep mutational scanning experiments on proteins and massively parallel reporter assays on gene regulatory sequences. Despite their increasing popularity, a general strategy for inferring quantitative models of genotype-phenotype maps from MAVE data is lacking. Here we introduce MAVE-NN, a neural-network-based Python package that implements a broadly applicable information-theoretic framework for learning genotype-phenotype maps-including biophysically interpretable models-from MAVE datasets. We demonstrate MAVE-NN in multiple biological contexts, and highlight the ability of our approach to deconvolve mutational effects from otherwise confounding experimental nonlinearities and noise.

摘要

变异效应多重分析(MAVEs)是一类方法,包括对蛋白质的深度突变扫描实验和对基因调控序列的大规模平行报告基因检测。尽管它们越来越受欢迎,但缺乏一种从MAVE数据推断基因型-表型图谱定量模型的通用策略。在这里,我们介绍了MAVE-NN,这是一个基于神经网络的Python软件包,它实现了一个广泛适用的信息论框架,用于从MAVE数据集中学习基因型-表型图谱,包括具有生物物理可解释性的模型。我们在多个生物学背景下展示了MAVE-NN,并强调了我们的方法从其他混淆的实验非线性和噪声中解卷积突变效应的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/6b56c3748932/13059_2022_2661_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/ba007a894c57/13059_2022_2661_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/8b2d3bea8d13/13059_2022_2661_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/4f91d7f5a7e2/13059_2022_2661_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/5c4a775101b7/13059_2022_2661_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/2784cca5ad4f/13059_2022_2661_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/6b56c3748932/13059_2022_2661_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/ba007a894c57/13059_2022_2661_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/8b2d3bea8d13/13059_2022_2661_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/4f91d7f5a7e2/13059_2022_2661_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/5c4a775101b7/13059_2022_2661_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/2784cca5ad4f/13059_2022_2661_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a0/9011994/6b56c3748932/13059_2022_2661_Fig6_HTML.jpg

相似文献

1
MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect.MAVE-NN:从变异效应的多重分析中学习基因型-表型图谱。
Genome Biol. 2022 Apr 15;23(1):98. doi: 10.1186/s13059-022-02661-7.
2
MaveDB: an open-source platform to distribute and interpret data from multiplexed assays of variant effect.MaveDB:一个开源平台,用于分发和解释来自变异效应多重分析的数据。
Genome Biol. 2019 Nov 4;20(1):223. doi: 10.1186/s13059-019-1845-6.
3
Minimum information and guidelines for reporting a Multiplexed Assay of Variant Effect.多重变异效应分析报告的最低信息及指南
ArXiv. 2023 Jun 26:arXiv:2306.15113v1.
4
satmut_utils: a simulation and variant calling package for multiplexed assays of variant effect.satmut_utils:用于多重变异效应分析的模拟和变异调用包。
Genome Biol. 2023 Apr 20;24(1):82. doi: 10.1186/s13059-023-02922-z.
5
Minimum information and guidelines for reporting a multiplexed assay of variant effect.用于报告变异效应多重分析的最低信息和指南。
Genome Biol. 2024 Apr 19;25(1):100. doi: 10.1186/s13059-024-03223-9.
6
Massively Parallel Assays and Quantitative Sequence-Function Relationships.大规模平行分析与定量序列功能关系。
Annu Rev Genomics Hum Genet. 2019 Aug 31;20:99-127. doi: 10.1146/annurev-genom-083118-014845. Epub 2019 May 15.
7
Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively parallel reporter assays.利用大规模平行报告基因实验的神经网络模型来破译调控 DNA 序列和非编码遗传变异。
PLoS One. 2019 Jun 17;14(6):e0218073. doi: 10.1371/journal.pone.0218073. eCollection 2019.
8
Mapping MAVE data for use in human genomics applications.映射用于人类基因组学应用的MAVE数据。
bioRxiv. 2024 Jun 30:2023.06.20.545702. doi: 10.1101/2023.06.20.545702.
9
Sampling Strategies for Experimentally Mapping Molecular Fitness Landscapes Using High-Throughput Methods.使用高通量方法实验性绘制分子适应性景观的采样策略。
J Mol Evol. 2024 Aug;92(4):402-414. doi: 10.1007/s00239-024-10179-8. Epub 2024 Jun 17.
10
Technical Note: PYRO-NN: Python reconstruction operators in neural networks.技术说明:PYRO-NN:神经网络中的 Python 重建算子。
Med Phys. 2019 Nov;46(11):5110-5115. doi: 10.1002/mp.13753. Epub 2019 Aug 27.

引用本文的文献

1
Learning sequence-function relationships with scalable, interpretable Gaussian processes.通过可扩展、可解释的高斯过程学习序列-函数关系。
bioRxiv. 2025 Aug 19:2025.08.15.670613. doi: 10.1101/2025.08.15.670613.
2
Symmetry, gauge freedoms, and the interpretability of sequence-function relationships.对称性、规范自由度与序列-功能关系的可解释性。
Phys Rev Res. 2025 Apr-Jun;7(2). doi: 10.1103/physrevresearch.7.023005. Epub 2025 Apr 2.
3
Variant scoring tools for deep mutational scanning.用于深度突变扫描的变异评分工具。

本文引用的文献

1
Interpretable modeling of genotype-phenotype landscapes with state-of-the-art predictive power.具有先进预测能力的基因型-表型景观的可解释建模。
Proc Natl Acad Sci U S A. 2022 Jun 28;119(26):e2114021119. doi: 10.1073/pnas.2114021119. Epub 2022 Jun 21.
2
Mapping the energetic and allosteric landscapes of protein binding domains.绘制蛋白质结合域的能量和别构景观。
Nature. 2022 Apr;604(7904):175-183. doi: 10.1038/s41586-022-04586-4. Epub 2022 Apr 6.
3
Neural networks to learn protein sequence-function relationships from deep mutational scanning data.
Mol Syst Biol. 2025 Aug 8. doi: 10.1038/s44320-025-00137-x.
4
On learning functions over biological sequence space: relating Gaussian process priors, regularization, and gauge fixing.关于生物序列空间上的学习函数:关联高斯过程先验、正则化和规范固定。
bioRxiv. 2025 Jul 11:2025.04.26.650699. doi: 10.1101/2025.04.26.650699.
5
On learning functions over biological sequence space: relating Gaussian process priors, regularization, and gauge fixing.关于生物序列空间上的学习函数:关联高斯过程先验、正则化和规范固定。
ArXiv. 2025 Jul 11:arXiv:2504.19034v2.
6
Gauge fixing for sequence-function relationships.序列-功能关系的规范固定
PLoS Comput Biol. 2025 Mar 20;21(3):e1012818. doi: 10.1371/journal.pcbi.1012818. eCollection 2025.
7
Interpreting -regulatory mechanisms from genomic deep neural networks using surrogate models.使用替代模型从基因组深度神经网络解释调控机制。
Nat Mach Intell. 2024 Jun;6(6):701-713. doi: 10.1038/s42256-024-00851-5. Epub 2024 Jun 21.
8
Predictive Modeling of Gene Expression and Localization of DNA Binding Site Using Deep Convolutional Neural Networks.使用深度卷积神经网络对基因表达进行预测建模及DNA结合位点定位
bioRxiv. 2024 Dec 20:2024.12.17.629042. doi: 10.1101/2024.12.17.629042.
9
MoCHI: neural networks to fit interpretable models and quantify energies, energetic couplings, epistasis, and allostery from deep mutational scanning data.MoCHI:用于拟合可解释模型并从深度突变扫描数据中量化能量、能量耦合、上位性和变构的神经网络。
Genome Biol. 2024 Dec 2;25(1):303. doi: 10.1186/s13059-024-03444-y.
10
Importance of higher-order epistasis in large protein sequence-function relationships.高阶上位性在大型蛋白质序列-功能关系中的重要性。
bioRxiv. 2024 Sep 24:2024.09.22.614318. doi: 10.1101/2024.09.22.614318.
神经网络从深度突变扫描数据中学习蛋白质序列-功能关系。
Proc Natl Acad Sci U S A. 2021 Nov 30;118(48). doi: 10.1073/pnas.2104878118.
4
The genetic landscape for amyloid beta fibril nucleation accurately discriminates familial Alzheimer's disease mutations.淀粉样β纤维核形成的遗传景观能准确地区分家族性阿尔茨海默病突变。
Elife. 2021 Feb 1;10:e63364. doi: 10.7554/eLife.63364.
5
DiMSum: an error model and pipeline for analyzing deep mutational scanning data and diagnosing common experimental pathologies.DiMSum:一种用于分析深度突变扫描数据和诊断常见实验病理学的误差模型和分析流程。
Genome Biol. 2020 Aug 17;21(1):207. doi: 10.1186/s13059-020-02091-3.
6
Gene Regulation in and out of Equilibrium.基因调控的平衡与失衡。
Annu Rev Biophys. 2020 May 6;49:199-226. doi: 10.1146/annurev-biophys-121219-081542.
7
Logomaker: beautiful sequence logos in Python.Logomaker:用 Python 绘制优美的序列 logo。
Bioinformatics. 2020 Apr 1;36(7):2272-2274. doi: 10.1093/bioinformatics/btz921.
8
The mutational landscape of a prion-like domain.朊病毒样结构域的突变景观。
Nat Commun. 2019 Sep 13;10(1):4162. doi: 10.1038/s41467-019-12101-z.
9
MPRAnalyze: statistical framework for massively parallel reporter assays.MPRAnalyze:大规模平行报告基因分析的统计框架。
Genome Biol. 2019 Sep 2;20(1):183. doi: 10.1186/s13059-019-1787-z.
10
Protein stability engineering insights revealed by domain-wide comprehensive mutagenesis.通过全域全面突变揭示蛋白质稳定性工程的见解。
Proc Natl Acad Sci U S A. 2019 Aug 13;116(33):16367-16377. doi: 10.1073/pnas.1903888116. Epub 2019 Aug 1.