Cooper Nicholas J, Shtir Corina J, Smyth Deborah J, Guo Hui, Swafford Austin D, Zanda Manuela, Hurles Matthew E, Walker Neil M, Plagnol Vincent, Cooper Jason D, Howson Joanna M M, Burren Oliver S, Onengut-Gumuscu Suna, Rich Stephen S, Todd John A
JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, Department of Medical Genetics, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Cambridge Biomedical Campus, Cambridge CB2 0XY, UK.
Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK, University College London, Darwin Building, London WC1E 6BT, UK.
Hum Mol Genet. 2015 Mar 15;24(6):1774-90. doi: 10.1093/hmg/ddu581. Epub 2014 Nov 25.
Copy number variants (CNVs) have been proposed as a possible source of 'missing heritability' in complex human diseases. Two studies of type 1 diabetes (T1D) found null associations with common copy number polymorphisms, but CNVs of low frequency and high penetrance could still play a role. We used the Log-R-ratio intensity data from a dense single nucleotide polymorphism (SNP) array, ImmunoChip, to detect rare CNV deletions (rDELs) and duplications (rDUPs) in 6808 T1D cases, 9954 controls and 2206 families with T1D-affected offspring. Initial analyses detected CNV associations. However, these were shown to be false-positive findings, failing replication with polymerase chain reaction. We developed a pipeline of quality control (QC) tests that were calibrated using systematic testing of sensitivity and specificity. The case-control odds ratios (OR) of CNV burden on T1D risk resulting from this QC pipeline converged on unity, suggesting no global frequency difference in rDELs or rDUPs. There was evidence that deletions could impact T1D risk for a small minority of cases, with enrichment for rDELs longer than 400 kb (OR = 1.57, P = 0.005). There were also 18 de novo rDELs detected in affected offspring but none for unaffected siblings (P = 0.03). No specific CNV regions showed robust evidence for association with T1D, although frequencies were lower than expected (most less than 0.1%), substantially reducing statistical power, which was examined in detail. We present an R-package, plumbCNV, which provides an automated approach for QC and detection of rare CNVs that can facilitate equivalent analyses of large-scale SNP array datasets.
拷贝数变异(CNV)已被认为是复杂人类疾病中“缺失遗传力”的一个可能来源。两项关于1型糖尿病(T1D)的研究发现,常见拷贝数多态性与之无关联,但低频和高外显率的CNV仍可能起作用。我们使用来自高密度单核苷酸多态性(SNP)芯片ImmunoChip的Log-R-ratio强度数据,在6808例T1D患者、9954例对照以及2206个有T1D患病后代的家庭中检测罕见CNV缺失(rDEL)和重复(rDUP)。初步分析检测到CNV关联。然而,这些被证明是假阳性结果,无法通过聚合酶链反应进行重复验证。我们开发了一套质量控制(QC)测试流程,通过对灵敏度和特异性的系统测试进行校准。由此QC流程得出的CNV负担对T1D风险的病例对照比值比(OR)趋近于1,表明rDEL或rDUP在总体频率上无差异。有证据表明,缺失可能对一小部分病例的T1D风险有影响,400 kb以上的rDEL存在富集现象(OR = 1.57,P = 0.005)。在患病后代中还检测到18个新生rDEL,但未患病的同胞中未检测到(P = 0.03)。尽管频率低于预期(大多低于0.1%),大幅降低了统计效力(对此进行了详细研究),但没有特定的CNV区域显示出与T1D相关的有力证据。我们展示了一个R包plumbCNV,它提供了一种用于QC和检测罕见CNV的自动化方法,有助于对大规模SNP芯片数据集进行等效分析。