Montemayor-Garcia Celina, Karagianni Panagiota, Stiles David A, Reese Erika M, Smellie Danielle A, Loy Debrean A, Levy Kimberly Y, Nwokocha Magdalene, Bueno Marina U, Miller Jeffery L, Klein Harvey G
Department of Transfusion Medicine, NIH Clinical Center, National Institutes of Health, Bethesda, Maryland.
B.S.R.C. Alexander Fleming, Vari, Greece.
Transfusion. 2018 Nov;58(11):2693-2704. doi: 10.1111/trf.14953. Epub 2018 Oct 12.
The 1000 Genomes Project provides a database of genomic variants from whole genome sequencing of 2504 individuals across five continental superpopulations. This database can enrich our background knowledge of worldwide blood group variant geographic distribution and identify novel variants of potential clinical significance.
The 1000 Genomes database was analyzed to 1) expand knowledge about continental distributions of known blood group variants, 2) identify novel variants with antigenic potential and their geographic association, and 3) establish a baseline scaffold of chromosomal coordinates to translate next-generation sequencing output files into a predicted red blood cell (RBC) phenotype.
Forty-two genes were investigated. A total of 604 known variants were mapped to the GRCh37 assembly; 120 of these were reported by 1000 Genomes in at least one superpopulation. All queried variants, including the ACKR1 promoter silencing mutation, are located within exon pull-down boundaries. The analysis yielded 41 novel population distributions for 34 known variants, as well as 12 novel blood group variants that warrant further validation and study. Four prediction algorithms collectively flagged 79 of 109 (72%) known antigenic or enzymatically detrimental blood group variants, while 4 of 12 variants that do not result in an altered RBC phenotype were flagged as deleterious.
Next-generation sequencing has known potential for high-throughput and extended RBC phenotype prediction; a database of GRCh37 and GRCh38 chromosomal coordinates for 120 worldwide blood group variants is provided as a basis for this clinical application.
千人基因组计划提供了一个数据库,该数据库包含来自五个大陆超级群体中2504个人的全基因组测序的基因组变异信息。这个数据库可以丰富我们对全球血型变异地理分布的背景知识,并识别具有潜在临床意义的新变异。
对千人基因组数据库进行分析,以:1)扩展关于已知血型变异的大陆分布的知识;2)识别具有抗原潜力的新变异及其地理关联;3)建立染色体坐标的基线支架,以便将下一代测序输出文件转化为预测的红细胞(RBC)表型。
研究了42个基因。总共604个已知变异被定位到GRCh37组装上;其中120个变异在至少一个超级群体中被千人基因组计划报道。所有查询的变异,包括ACKR1启动子沉默突变,都位于外显子下拉边界内。分析得出34个已知变异的41种新的群体分布,以及12个需要进一步验证和研究的新血型变异。四种预测算法共同标记了109个已知抗原性或酶促有害血型变异中的79个(72%),而12个未导致RBC表型改变的变异中有4个被标记为有害。
下一代测序在高通量和扩展RBC表型预测方面具有已知潜力;提供了120个全球血型变异的GRCh37和GRCh38染色体坐标数据库,作为这种临床应用的基础。