文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Learning sequence-function relationships with scalable, interpretable Gaussian processes.

作者信息

Zhou Juannan, Martí-Gómez Carlos, Petti Samantha, McCandlish David M

出版信息

bioRxiv. 2025 Aug 19:2025.08.15.670613. doi: 10.1101/2025.08.15.670613.


DOI:10.1101/2025.08.15.670613
PMID:40894759
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12393447/
Abstract

Understanding the relationship between biological sequences, such as DNA, RNA or protein sequences, and their resulting phenotypes is one of the central goals of genetics. This task is complicated by epistasis, i.e., the context dependence of mutational effects. Advances in high-throughput phenotyping now make it possible to study these relationships at unprecedented scale, generating large datasets that measure phenotypes for tens or hundreds of thousands of sequences. However, standard regression models for analyzing such datasets often make unrealistic assumptions about the generalizability of mutational effects and epistatic coefficients across genetic backgrounds. Deep neural networks offer greater flexibility but suffer from limited interpretability and lack uncertainty quantification. Here, we introduce a family of interpretable Gaussian process models for sequence-function relationships that capture epistasis through flexible prior distributions that generalize classical theoretical models from the fitness landscape literature. In particular, these priors are parameterized by interpretable site-, allele-, and mutation-specific factors controlling the degree to which specific mutations decrease the predictability of the effects of other mutations. Using GPU acceleration to scale to large protein, RNA, and genome-wide SNP datasets, our models consistently deliver superior predictive performance while yielding interpretable parameters that both recover known features and uncover novel epistatic interactions. Overall, our methods provide new insights into the structure of the genotype-phenotype map and offer scalable, interpretable approaches for exploring complex genetic interactions across diverse biological systems.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/abd117d30976/nihpp-2025.08.15.670613v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/3a96585a3c07/nihpp-2025.08.15.670613v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/dd7f511ec154/nihpp-2025.08.15.670613v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/9100f492386d/nihpp-2025.08.15.670613v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/abd117d30976/nihpp-2025.08.15.670613v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/3a96585a3c07/nihpp-2025.08.15.670613v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/dd7f511ec154/nihpp-2025.08.15.670613v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/9100f492386d/nihpp-2025.08.15.670613v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa1/12393447/abd117d30976/nihpp-2025.08.15.670613v1-f0004.jpg

相似文献

[1]
Learning sequence-function relationships with scalable, interpretable Gaussian processes.

bioRxiv. 2025-8-19

[2]
Prescription of Controlled Substances: Benefits and Risks

2025-1

[3]
Short-Term Memory Impairment

2025-1

[4]
Genetic determinants of testicular sperm extraction outcomes: insights from a large multicentre study of men with non-obstructive azoospermia.

Hum Reprod Open. 2025-8-29

[5]
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2022-10-4

[6]
Measures implemented in the school setting to contain the COVID-19 pandemic.

Cochrane Database Syst Rev. 2022-1-17

[7]
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024-12-1

[8]
Sexual Harassment and Prevention Training

2025-1

[9]
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.

Comput Methods Programs Biomed. 2025-6-21

[10]
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.

Autism Adulthood. 2024-12-2

本文引用的文献

[1]
Refining the resolution of the yeast genotype-phenotype map using single-cell RNA-sequencing.

Elife. 2025-7-28

[2]
Massive experimental quantification allows interpretable deep learning of protein aggregation.

Sci Adv. 2025-5-2

[3]
Gauge fixing for sequence-function relationships.

PLoS Comput Biol. 2025-3-20

[4]
Massively parallel characterization of transcriptional regulatory elements.

Nature. 2025-3

[5]
Site-saturation mutagenesis of 500 human protein domains.

Nature. 2025-1

[6]
MoCHI: neural networks to fit interpretable models and quantify energies, energetic couplings, epistasis, and allostery from deep mutational scanning data.

Genome Biol. 2024-12-2

[7]
Density estimation for ordinal biological sequences and its applications.

Phys Rev E. 2024-10

[8]
The genetic architecture of protein stability.

Nature. 2024-10

[9]
Addressing epistasis in the design of protein function.

Proc Natl Acad Sci U S A. 2024-8-20

[10]
Neural network extrapolation to distant regions of the protein fitness landscape.

Nat Commun. 2024-7-30

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索