Department of Epidemiology and Biostatistics, School of Medicine, Case Western Reserve University, Cleveland, Ohio, USA.
Cancer Epidemiol Biomarkers Prev. 2012 Dec;21(12):2242-51. doi: 10.1158/1055-9965.EPI-12-0703. Epub 2012 Sep 7.
We propose a 2-step model-based approach, with correction for ascertainment, to linkage analysis of a binary trait with variable age of onset and apply it to a set of multiplex pedigrees segregating for adult glioma.
First, we fit segregation models by formulating the likelihood for a person to have a bivariate phenotype, affection status and age of onset, along with other covariates, and from these we estimate population trait allele frequencies and penetrance parameters as a function of age (N = 281 multiplex glioma pedigrees). Second, the best fitting models are used as trait models in multipoint linkage analysis (N = 74 informative multiplex glioma pedigrees). To correct for ascertainment, a prevalence constraint is used in the likelihood of the segregation models for all 281 pedigrees. Then the trait allele frequencies are reestimated for the pedigree founders of the subset of 74 pedigrees chosen for linkage analysis.
Using the best-fitting segregation models in model-based multipoint linkage analysis, we identified 2 separate peaks on chromosome 17; the first agreed with a region identified by Shete and colleagues who used model-free affected-only linkage analysis, but with a narrowed peak: and the second agreed with a second region they found but had a larger maximum log of the odds (LOD).
Our approach was able to narrow the linkage peak previously published for glioma.
We provide a practical solution to model-based linkage analysis for disease affection status with variable age of onset for the kinds of pedigree data often collected for linkage analysis.
我们提出了一种两步基于模型的方法,对具有可变发病年龄的二分类性状进行连锁分析,并对一组常染色体显性遗传的成年型胶质细胞瘤家系进行了校正。
首先,我们通过构建一个人具有双变量表型、发病状态和发病年龄的似然函数来拟合分离模型,并结合其他协变量,我们估计了人群特征等位基因频率和发病年龄的穿透参数(N = 281 个多发性胶质细胞瘤家系)。其次,最佳拟合模型作为性状模型用于多点连锁分析(N = 74 个有信息的多发性胶质细胞瘤家系)。为了校正鉴定,我们在 281 个家系的分离模型似然中使用了流行率约束。然后,对选择用于连锁分析的 74 个家系子集的家系创始人重新估计特征等位基因频率。
使用基于模型的多点连锁分析中的最佳拟合分离模型,我们在染色体 17 上确定了 2 个独立的峰;第一个与 Shete 等人使用无模型受影响的连锁分析所确定的区域一致,但峰变窄:第二个与他们发现的第二个区域一致,但最大对数优势(LOD)更大。
我们的方法能够缩小先前发表的胶质细胞瘤连锁峰。
我们为具有可变发病年龄的疾病发病状态的基于模型的连锁分析提供了一种实用的解决方案,适用于常染色体显性遗传的连锁分析中经常收集的那种家系数据。