Hsu L, Zhao L P
Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98104, USA.
Am J Hum Genet. 1996 May;58(5):1057-71.
In genetic research of chronic diseases, age-at-onset outcomes within families are often correlated. The nature of correlation of age-at-onset outcomes is indicative of common genetic and/or shared environmental risk factors among family members. Understanding patterns of such correlation may shed light on the disease etiology and, hence, is an important step to take prior to further searching for the responsible genes via segregation and linkage studies. Age-at-onset outcomes are different from those familiar quantitative or qualitative traits for which many statistical methods have been developed. In comparison with the quantitative traits, age-at-onset outcomes are often censored, i.e., instead of actual age-at-onset outcomes, only the current ages or ages at death are observed. They are also different from qualitative traits because of their continuity. Because of the complexity of correlated censored outcomes, few methods have yet been developed. A traditional approach is to impose a parametric joint distribution for the correlated age-at-onset outcomes, which has been criticized for requiring a stringent assumption about the entire distribution of age at onset. The purpose of this paper is to describe a method for assessing familial aggregation of correlated age-at-onset outcomes semiparametrically, by use of estimating equations. This method does not require any parametric assumption for modeling the age at onset. The estimates of parameters, including those quantifying the correlation within families, are consistent and have an asymptotic normal distribution that can be used to make inferences. To illustrate this new method, we analyzed two age-at-onset data sets that were obtained from studies conducted in the States of Washington and Hawaii, with the objective of quantifying the familial aggregations of age at onset of breast cancer.
在慢性病的基因研究中,家庭内发病年龄结果往往具有相关性。发病年龄结果的相关性质表明家庭成员之间存在共同的遗传和/或共享的环境风险因素。了解这种相关性模式可能有助于揭示疾病病因,因此,这是在通过分离和连锁研究进一步寻找致病基因之前要采取的重要步骤。发病年龄结果不同于许多已开发出统计方法的常见定量或定性性状。与定量性状相比,发病年龄结果往往是截尾的,即观察到的不是实际发病年龄结果,而只是当前年龄或死亡年龄。由于其连续性,它们也与定性性状不同。由于相关截尾结果的复杂性,目前开发的方法很少。一种传统方法是为相关的发病年龄结果强加一个参数化联合分布,这种方法因需要对发病年龄的整个分布做出严格假设而受到批评。本文的目的是描述一种使用估计方程半参数评估相关发病年龄结果家族聚集性的方法。该方法在对发病年龄建模时不需要任何参数假设。参数估计,包括那些量化家庭内相关性的参数估计,是一致的,并且具有渐近正态分布,可用于进行推断。为了说明这种新方法,我们分析了从华盛顿州和夏威夷州进行的研究中获得的两个发病年龄数据集,目的是量化乳腺癌发病年龄的家族聚集性。