Schmidt Mike, Hauser Elizabeth R, Martin Eden R, Schmidt Silke
Center for Human Genetics, Duke University Medical Center.
Stat Appl Genet Mol Biol. 2005;4:Article15. doi: 10.2202/1544-6115.1133. Epub 2005 Jun 6.
We have previously distributed a software package, SIMLA (SIMulation of Linkage and Association), which can be used to generate disease phenotype and marker genotype data in three-generational pedigrees of user-specified structure. To our knowledge, SIMLA is the only publicly available program that can simulate variable levels of both linkage (recombination) and linkage disequilibrium (LD) between marker and disease loci in general pedigrees. While the previous SIMLA version provided flexibility in choosing many parameters relevant for linkage and association mapping of complex human diseases, it did not allow for the segregation of more than one disease locus in a given pedigree and did not incorporate environmental covariates possibly interacting with disease susceptibility genes. Here, we present an extension of the simulation algorithm characterized by a much more general penetrance function, which allows for the joint action of up to two genes and up to two environmental covariates in the simulated pedigrees, with all possible multiplicative interaction effects between them. This makes the program even more useful for comparing the performance of different linkage and association analysis methods applied to complex human phenotypes. SIMLA can assist investigators in planning and designing a variety of linkage and association studies, and can help interpret results of real data analyses by comparing them to results obtained under a user-controlled data generation mechanism.A free download of the SIMLA package is available at http://wwwchg.duhs.duke.edu/software.
我们之前发布了一个软件包SIMLA(连锁与关联模拟),它可用于在用户指定结构的三代家系中生成疾病表型和标记基因型数据。据我们所知,SIMLA是唯一一款可公开获取的程序,能够在一般家系中模拟标记与疾病位点之间不同程度的连锁(重组)和连锁不平衡(LD)。虽然之前的SIMLA版本在选择许多与复杂人类疾病连锁和关联定位相关的参数方面具有灵活性,但它不允许在给定家系中分离多个疾病位点,也未纳入可能与疾病易感性基因相互作用的环境协变量。在此,我们展示了模拟算法的一个扩展版本,其特点是具有更通用的外显率函数,允许在模拟家系中最多两个基因和最多两个环境协变量共同作用,并考虑它们之间所有可能的相乘交互效应。这使得该程序在比较应用于复杂人类表型的不同连锁和关联分析方法的性能时更有用。SIMLA可协助研究人员规划和设计各种连锁和关联研究,并通过将实际数据分析结果与在用户控制的数据生成机制下获得的结果进行比较,帮助解释实际数据分析结果。可从http://wwwchg.duhs.duke.edu/software免费下载SIMLA软件包。