Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, Switzerland.
Anim Genet. 2019 Dec;50(6):695-704. doi: 10.1111/age.12834. Epub 2019 Sep 5.
The domestic dog serves as an excellent model to investigate the genetic basis of disease. More than 400 heritable traits analogous to human diseases have been described in dogs. To further canine medical genetics research, we established the Dog Biomedical Variant Database Consortium (DBVDC) and present a comprehensive list of functionally annotated genome variants that were identified with whole genome sequencing of 582 dogs from 126 breeds and eight wolves. The genomes used in the study have a minimum coverage of 10× and an average coverage of ~24×. In total, we identified 23 133 692 single-nucleotide variants (SNVs) and 10 048 038 short indels, including 93% undescribed variants. On average, each individual dog genome carried ∼4.1 million single-nucleotide and ~1.4 million short-indel variants with respect to the reference genome assembly. About 2% of the variants were located in coding regions of annotated genes and loci. Variant effect classification showed 247 141 SNVs and 99 562 short indels having moderate or high impact on 11 267 protein-coding genes. On average, each genome contained heterozygous loss-of-function variants in 30 potentially embryonic lethal genes and 97 genes associated with developmental disorders. More than 50 inherited disorders and traits have been unravelled using the DBVDC variant catalogue, enabling genetic testing for breeding and diagnostics. This resource of annotated variants and their corresponding genotype frequencies constitutes a highly useful tool for the identification of potential variants causative for rare inherited disorders in dogs.
家犬是研究疾病遗传基础的极佳模型。已经在犬类中描述了 400 多种与人类疾病类似的可遗传性状。为了进一步开展犬类医学遗传学研究,我们成立了犬生物医学变异数据库联盟(DBVDC),并提供了一份全面的功能注释基因组变异列表,这些变异是通过对 126 个品种的 582 只犬和 8 只狼进行全基因组测序而确定的。研究中使用的基因组最小覆盖率为 10×,平均覆盖率约为 24×。总共,我们确定了 23133692 个单核苷酸变异(SNV)和 10048038 个短插入缺失,其中包括 93%未描述的变异。平均而言,每个个体犬基因组相对于参考基因组组装携带约 410 万个单核苷酸和~140 万个短插入缺失。大约 2%的变异位于注释基因和基因座的编码区。变异效应分类显示,247141 个 SNV 和 99562 个短插入缺失对 11267 个蛋白质编码基因具有中度或高度影响。平均而言,每个基因组包含 30 个潜在胚胎致死基因和 97 个与发育障碍相关基因的杂合失活变异。使用 DBVDC 变异目录已经揭示了 50 多种遗传性疾病和性状,从而实现了用于繁殖和诊断的遗传测试。该注释变异及其相应基因型频率的资源构成了识别犬类罕见遗传性疾病潜在变异的非常有用的工具。