• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

上位性的符号建模。

Symbolic modeling of epistasis.

作者信息

Moore Jason H, Barney Nate, Tsai Chia-Ti, Chiang Fu-Tien, Gui Jiang, White Bill C

机构信息

Computational Genetics Laboratory, Norris-Cotton Cancer Center, Dartmouth Medical School, Lebanon, NH 03756, USA.

出版信息

Hum Hered. 2007;63(2):120-33. doi: 10.1159/000099184. Epub 2007 Feb 2.

DOI:10.1159/000099184
PMID:17283441
Abstract

The workhorse of modern genetic analysis is the parametric linear model. The advantages of the linear modeling framework are many and include a mathematical understanding of the model fitting process and ease of interpretation. However, an important limitation is that linear models make assumptions about the nature of the data being modeled. This assumption may not be realistic for complex biological systems such as disease susceptibility where nonlinearities in the genotype to phenotype mapping relationship that result from epistasis, plastic reaction norms, locus heterogeneity, and phenocopy, for example, are the norm rather than the exception. We have previously developed a flexible modeling approach called symbolic discriminant analysis (SDA) that makes no assumptions about the patterns in the data. Rather, SDA lets the data dictate the size, shape, and complexity of a symbolic discriminant function that could include any set of mathematical functions from a list of candidates supplied by the user. Here, we outline a new five step process for symbolic model discovery that uses genetic programming (GP) for coarse-grained stochastic searching, experimental design for parameter optimization, graphical modeling for generating expert knowledge, and estimation of distribution algorithms for fine-grained stochastic searching. Finally, we introduce function mapping as a new method for interpreting symbolic discriminant functions. We show that function mapping when combined with measures of interaction information facilitates statistical interpretation by providing a graphical approach to decomposing complex models to highlight synergistic, redundant, and independent effects of polymorphisms and their composite functions. We illustrate this five step SDA modeling process with a real case-control dataset.

摘要

现代遗传分析的主力是参数线性模型。线性建模框架有诸多优点,包括对模型拟合过程的数学理解以及易于解释。然而,一个重要的局限性在于线性模型对所建模数据的性质做出了假设。对于诸如疾病易感性等复杂生物系统而言,这种假设可能并不现实,例如,由于上位性、塑性反应规范、基因座异质性和拟表型导致的基因型到表型映射关系中的非线性是常态而非例外。我们之前开发了一种灵活的建模方法,称为符号判别分析(SDA),它不对数据模式做任何假设。相反,SDA让数据决定符号判别函数的大小、形状和复杂性,该函数可以包括用户提供的候选列表中的任何一组数学函数。在此,我们概述了一个用于符号模型发现的新的五步过程,该过程使用遗传编程(GP)进行粗粒度随机搜索、实验设计进行参数优化、图形建模生成专家知识以及分布估计算法进行细粒度随机搜索。最后,我们引入函数映射作为解释符号判别函数的一种新方法。我们表明,函数映射与交互信息度量相结合时,通过提供一种将复杂模型分解以突出多态性及其复合函数的协同、冗余和独立效应的图形方法,有助于进行统计解释。我们用一个实际的病例对照数据集来说明这个五步SDA建模过程。

相似文献

1
Symbolic modeling of epistasis.上位性的符号建模。
Hum Hered. 2007;63(2):120-33. doi: 10.1159/000099184. Epub 2007 Feb 2.
2
A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility.一种灵活的计算框架,用于在人类疾病易感性的遗传研究中检测、表征和解释上位性的统计模式。
J Theor Biol. 2006 Jul 21;241(2):252-61. doi: 10.1016/j.jtbi.2005.11.036. Epub 2006 Feb 2.
3
Symbolic discriminant analysis of microarray data in autoimmune disease.自身免疫性疾病中微阵列数据的符号判别分析
Genet Epidemiol. 2002 Jun;23(1):57-69. doi: 10.1002/gepi.1117.
4
Modeling genetic architecture: a multilinear theory of gene interaction.遗传结构建模:基因相互作用的多线性理论
Theor Popul Biol. 2001 Feb;59(1):61-86. doi: 10.1006/tpbi.2000.1508.
5
An entropy-based approach for testing genetic epistasis underlying complex diseases.一种基于熵的方法,用于检测复杂疾病背后的基因上位性。
J Theor Biol. 2008 Jan 21;250(2):362-74. doi: 10.1016/j.jtbi.2007.10.001. Epub 2007 Oct 6.
6
Allelic penetrance approach as a tool to model two-locus interaction in complex binary traits.等位基因外显率方法作为一种对复杂二元性状中双基因座相互作用进行建模的工具。
Heredity (Edinb). 2007 Aug;99(2):173-84. doi: 10.1038/sj.hdy.6800979. Epub 2007 Jun 6.
7
ModelMage: a tool for automatic model generation, selection and management.
Genome Inform. 2008;20:52-63.
8
CoaSim: a flexible environment for simulating genetic data under coalescent models.CoaSim:一种用于在溯祖模型下模拟遗传数据的灵活环境。
BMC Bioinformatics. 2005 Oct 14;6:252. doi: 10.1186/1471-2105-6-252.
9
A comparison of analytical methods for genetic association studies.基因关联研究分析方法的比较
Genet Epidemiol. 2008 Dec;32(8):767-78. doi: 10.1002/gepi.20345.
10
An empirical Bayes method for estimating epistatic effects of quantitative trait loci.一种用于估计数量性状基因座上位性效应的经验贝叶斯方法。
Biometrics. 2007 Jun;63(2):513-21. doi: 10.1111/j.1541-0420.2006.00711.x.

引用本文的文献

1
Investigating the parameter space of evolutionary algorithms.研究进化算法的参数空间。
BioData Min. 2018 Feb 17;11:2. doi: 10.1186/s13040-018-0164-x. eCollection 2018.
2
Multi-class computational evolution: development, benchmark evaluation and application to RNA-Seq biomarker discovery.多类计算进化:发展、基准评估及在RNA测序生物标志物发现中的应用
BioData Min. 2017 Apr 24;10:13. doi: 10.1186/s13040-017-0134-8. eCollection 2017.
3
Genome-wide genetic interaction analysis of glaucoma using expert knowledge derived from human phenotype networks.
利用从人类表型网络中获取的专家知识对青光眼进行全基因组遗传相互作用分析。
Pac Symp Biocomput. 2015;20:207-18.
4
Practical aspects of genome-wide association interaction analysis.全基因组关联相互作用分析的实践方面
Hum Genet. 2014 Nov;133(11):1343-58. doi: 10.1007/s00439-014-1480-y. Epub 2014 Aug 28.
5
Genome-wide association interaction analysis for Alzheimer's disease.阿尔茨海默病的全基因组关联相互作用分析
Neurobiol Aging. 2014 Nov;35(11):2436-2443. doi: 10.1016/j.neurobiolaging.2014.05.014. Epub 2014 May 28.
6
Machine learning techniques accurately classify microbial communities by bacterial vaginosis characteristics.机器学习技术可通过细菌性阴道病特征准确分类微生物群落。
PLoS One. 2014 Feb 3;9(2):e87830. doi: 10.1371/journal.pone.0087830. eCollection 2014.
7
ATHENA: the analysis tool for heritable and environmental network associations.ATHENA:遗传性和环境网络关联的分析工具。
Bioinformatics. 2014 Mar 1;30(5):698-705. doi: 10.1093/bioinformatics/btt572. Epub 2013 Oct 21.
8
GWIS--model-free, fast and exhaustive search for epistatic interactions in case-control GWAS.GWIS--无模型、快速且全面搜索病例对照 GWAS 中的上位相互作用。
BMC Genomics. 2013;14 Suppl 3(Suppl 3):S10. doi: 10.1186/1471-2164-14-S3-S10. Epub 2013 May 28.
9
Systems genetics of alcoholism.酒精中毒的系统遗传学
Alcohol Res Health. 2008;31(1):14-25.
10
Statistical epistasis networks reduce the computational complexity of searching three-locus genetic models.统计上位性网络降低了搜索三位点遗传模型的计算复杂性。
Pac Symp Biocomput. 2013:397-408.