• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大样本量和非线性稀疏模型概述了炎症性肠病中的上位效应。

Large sample size and nonlinear sparse models outline epistatic effects in inflammatory bowel disease.

机构信息

Department of of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium.

出版信息

Genome Biol. 2023 Oct 5;24(1):224. doi: 10.1186/s13059-023-03064-y.

DOI:10.1186/s13059-023-03064-y
PMID:37798735
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10552306/
Abstract

BACKGROUND

Despite clear evidence of nonlinear interactions in the molecular architecture of polygenic diseases, linear models have so far appeared optimal in genotype-to-phenotype modeling. A key bottleneck for such modeling is that genetic data intrinsically suffers from underdetermination ([Formula: see text]). Millions of variants are present in each individual while the collection of large, homogeneous cohorts is hindered by phenotype incidence, sequencing cost, and batch effects.

RESULTS

We demonstrate that when we provide enough training data and control the complexity of nonlinear models, a neural network outperforms additive approaches in whole exome sequencing-based inflammatory bowel disease case-control prediction. To do so, we propose a biologically meaningful sparsified neural network architecture, providing empirical evidence for positive and negative epistatic effects present in the inflammatory bowel disease pathogenesis.

CONCLUSIONS

In this paper, we show that underdetermination is likely a major driver for the apparent optimality of additive modeling in clinical genetics today.

摘要

背景

尽管多基因疾病的分子结构中存在明显的非线性相互作用,但线性模型在基因型到表型建模中似乎迄今为止一直是最优的。这种建模的一个关键瓶颈是遗传数据本质上存在欠定问题([公式:见正文])。每个人的个体中都存在数百万个变体,而大型、同质队列的收集受到表型发生率、测序成本和批次效应的阻碍。

结果

我们证明,当我们提供足够的训练数据并控制非线性模型的复杂性时,神经网络在基于全外显子组测序的炎症性肠病病例对照预测中的表现优于加性方法。为此,我们提出了一种具有生物学意义的稀疏神经网络架构,为炎症性肠病发病机制中存在的正和负上位效应提供了经验证据。

结论

在本文中,我们表明,欠定问题很可能是当今临床遗传学中加性建模表现出明显最优性的主要驱动因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/0762a8506ba9/13059_2023_3064_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/c76d318a5b90/13059_2023_3064_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/37b555166b45/13059_2023_3064_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/ff8edc75cfdf/13059_2023_3064_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/6d5a0bc65a14/13059_2023_3064_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/0762a8506ba9/13059_2023_3064_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/c76d318a5b90/13059_2023_3064_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/37b555166b45/13059_2023_3064_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/ff8edc75cfdf/13059_2023_3064_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/6d5a0bc65a14/13059_2023_3064_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b7a1/10552306/0762a8506ba9/13059_2023_3064_Fig5_HTML.jpg

相似文献

1
Large sample size and nonlinear sparse models outline epistatic effects in inflammatory bowel disease.大样本量和非线性稀疏模型概述了炎症性肠病中的上位效应。
Genome Biol. 2023 Oct 5;24(1):224. doi: 10.1186/s13059-023-03064-y.
2
Biologically meaningful genome interpretation models to address data underdetermination for the leaf and seed ionome prediction in Arabidopsis thaliana.用于解决拟南芥叶和种子离子组预测中数据不足问题的具有生物学意义的基因组解释模型。
Sci Rep. 2024 Jun 8;14(1):13188. doi: 10.1038/s41598-024-63855-6.
3
The deep arbitrary polynomial chaos neural network or how Deep Artificial Neural Networks could benefit from data-driven homogeneous chaos theory.深度任意多项式混沌神经网络或深度人工神经网络如何从数据驱动的均匀混沌理论中受益。
Neural Netw. 2023 Sep;166:85-104. doi: 10.1016/j.neunet.2023.06.036. Epub 2023 Jul 10.
4
Spontaneous preterm birth: advances toward the discovery of genetic predisposition.自发性早产:朝着发现遗传易感性的方向取得进展。
Am J Obstet Gynecol. 2018 Mar;218(3):294-314.e2. doi: 10.1016/j.ajog.2017.12.009. Epub 2017 Dec 14.
5
Comparative performances of machine learning methods for classifying Crohn Disease patients using genome-wide genotyping data.使用全基因组基因分型数据对克罗恩病患者进行分类的机器学习方法的比较性能。
Sci Rep. 2019 Jul 17;9(1):10351. doi: 10.1038/s41598-019-46649-z.
6
Phylogenetic convolutional neural networks in metagenomics.元基因组学中的系统发生卷积神经网络。
BMC Bioinformatics. 2018 Mar 8;19(Suppl 2):49. doi: 10.1186/s12859-018-2033-5.
7
Impact of exome sequencing in inflammatory bowel disease.外显子组测序在炎症性肠病中的影响。
World J Gastroenterol. 2013 Oct 28;19(40):6721-9. doi: 10.3748/wjg.v19.i40.6721.
8
Learn bifurcations of nonlinear parametric systems via equation-driven neural networks.通过方程驱动的神经网络学习非线性参数系统的分岔。
Chaos. 2022 Jan;32(1):011102. doi: 10.1063/5.0078306.
9
The genetic background of inflammatory bowel disease.炎症性肠病的遗传背景。
Hepatogastroenterology. 2000 Jan-Feb;47(31):5-14.
10
Transformer-based hand gesture recognition from instantaneous to fused neural decomposition of high-density EMG signals.基于Transformer 的手 gestures 识别,来自高密度 EMG 信号的即时融合神经分解。
Sci Rep. 2023 Jul 7;13(1):11000. doi: 10.1038/s41598-023-36490-w.

引用本文的文献

1
Explainable deep learning for stratified medicine in inflammatory bowel disease.用于炎症性肠病分层医学的可解释深度学习
Genome Biol. 2025 Jul 24;26(1):223. doi: 10.1186/s13059-025-03692-6.
2
Detecting genetic interactions with visible neural networks.检测与可见神经网络的基因相互作用。
Commun Biol. 2025 Jun 5;8(1):874. doi: 10.1038/s42003-025-08157-x.
3
Genomic prediction with kinship-based multiple kernel learning produces hypothesis on the underlying inheritance mechanisms of phenotypic traits.基于亲缘关系的多核学习进行基因组预测,能够对表型性状的潜在遗传机制提出假设。

本文引用的文献

1
Improving genetic risk prediction across diverse population by disentangling ancestry representations.通过解析祖先表示来改善不同人群的遗传风险预测。
Commun Biol. 2023 Sep 22;6(1):964. doi: 10.1038/s42003-023-05352-6.
2
The UCSC Genome Browser database: 2023 update.UCSC 基因组浏览器数据库:2023 年更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1188-D1195. doi: 10.1093/nar/gkac1072.
3
Single-sequence protein structure prediction using a language model and deep learning.基于语言模型和深度学习的单序列蛋白质结构预测。
Genome Biol. 2025 Apr 4;26(1):84. doi: 10.1186/s13059-025-03544-3.
4
A quantitative benchmark of neural network feature selection methods for detecting nonlinear signals.用于检测非线性信号的神经网络特征选择方法的定量基准。
Sci Rep. 2024 Dec 28;14(1):31180. doi: 10.1038/s41598-024-82583-5.
5
Linking genetic markers and crop model parameters using neural networks to enhance genomic prediction of integrative traits.利用神经网络将遗传标记与作物模型参数相联系,以增强综合性状的基因组预测。
Front Plant Sci. 2024 Jul 30;15:1393965. doi: 10.3389/fpls.2024.1393965. eCollection 2024.
6
A python library for the fast and scalable computation of biologically meaningful individual specific networks.一个用于快速和可扩展地计算具有生物学意义的个体特定网络的 Python 库。
Sci Rep. 2024 Aug 6;14(1):18243. doi: 10.1038/s41598-024-69067-2.
7
Biologically meaningful genome interpretation models to address data underdetermination for the leaf and seed ionome prediction in Arabidopsis thaliana.用于解决拟南芥叶和种子离子组预测中数据不足问题的具有生物学意义的基因组解释模型。
Sci Rep. 2024 Jun 8;14(1):13188. doi: 10.1038/s41598-024-63855-6.
Nat Biotechnol. 2022 Nov;40(11):1617-1623. doi: 10.1038/s41587-022-01432-w. Epub 2022 Oct 3.
4
An Extremes of Phenotype Approach Confirms Significant Genetic Heterogeneity in Patients with Ulcerative Colitis.极端表型法证实溃疡性结肠炎患者存在显著的遗传异质性。
J Crohns Colitis. 2023 Mar 18;17(2):277-288. doi: 10.1093/ecco-jcc/jjac121.
5
Machine learning models outperform deep learning models, provide interpretation and facilitate feature selection for soybean trait prediction.机器学习模型优于深度学习模型,可提供解释并有助于选择大豆性状预测的特征。
BMC Plant Biol. 2022 Apr 8;22(1):180. doi: 10.1186/s12870-022-03559-z.
6
Current progress and open challenges for applying deep learning across the biosciences.深度学习在整个生命科学中的应用现状及面临的开放性挑战。
Nat Commun. 2022 Apr 1;13(1):1728. doi: 10.1038/s41467-022-29268-7.
7
Significant sparse polygenic risk scores across 813 traits in UK Biobank.在英国生物库中,813 项特征存在显著稀疏多基因风险评分。
PLoS Genet. 2022 Mar 24;18(3):e1010105. doi: 10.1371/journal.pgen.1010105. eCollection 2022 Mar.
8
From genotype to phenotype in Arabidopsis thaliana: in-silico genome interpretation predicts 288 phenotypes from sequencing data.拟南芥从基因型到表型:基于测序数据的计算机基因组解读预测 288 种表型。
Nucleic Acids Res. 2022 Feb 22;50(3):e16. doi: 10.1093/nar/gkab1099.
9
Deep neural networks for genomic prediction do not estimate marker effects.深度神经网络在基因组预测中并不估计标记效应。
Plant Genome. 2021 Nov;14(3):e20147. doi: 10.1002/tpg2.20147. Epub 2021 Oct 1.
10
GenNet framework: interpretable deep learning for predicting phenotypes from genetic data.GenNet 框架:从遗传数据预测表型的可解释深度学习。
Commun Biol. 2021 Sep 17;4(1):1094. doi: 10.1038/s42003-021-02622-z.