Liu Ying, Huang Chien Hsun, Hu Inchi, Lo Shaw-Hwa, Zheng Tian
Department of Statistics, Columbia University, New York, NY 10027, USA.
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S106. doi: 10.1186/1753-6561-5-S9-S106.
Both common variants and rare variants are involved in the etiology of most complex diseases in humans. Developments in sequencing technology have led to the identification of a high density of rare variant single-nucleotide polymorphisms (SNPs) on the genome, each of which affects only at most 1% of the population. Genotypes derived from these SNPs allow one to study the involvement of rare variants in common human disorders. Here, we propose an association screening approach that treats genes as units of analysis. SNPs within a gene are used to create partitions of individuals, and inverse-probability weighting is used to overweight genotypic differences observed on rare variants. Association between a phenotype trait and the constructed partition is then evaluated. We consider three association tests (one-way ANOVA, chi-square test, and the partition retention method) and compare these strategies using the simulated data from the Genetic Analysis Workshop 17. Several genes that contain causal SNPs were identified by the proposed method as top genes.
常见变异和罕见变异都与人类大多数复杂疾病的病因有关。测序技术的发展使得在基因组上鉴定出高密度的罕见变异单核苷酸多态性(SNP)成为可能,其中每个SNP最多只影响1%的人群。从这些SNP得到的基因型使人们能够研究罕见变异在常见人类疾病中的作用。在此,我们提出一种将基因作为分析单位的关联筛选方法。基因内的SNP用于创建个体分区,并使用逆概率加权法对在罕见变异上观察到的基因型差异进行加权。然后评估表型性状与构建的分区之间的关联。我们考虑了三种关联检验(单向方差分析、卡方检验和分区保留法),并使用遗传分析研讨会17的模拟数据比较了这些策略。所提出的方法将几个包含因果SNP的基因鉴定为顶级基因。