Fichou Yann, Berlivet Isabelle, Richard Gaëlle, Tournamille Christophe, Castilho Lilian, Férec Claude
EFS, Inserm, Univ Brest, UMR 1078, GGB, Brest, France.
Laboratoire d'Excellence GR-Ex, Paris, France.
Transfus Med Hemother. 2020 Feb;47(1):23-32. doi: 10.1159/000504584. Epub 2019 Dec 11.
In the novel era of blood group genomics, (re-)defining reference gene/allele sequences of blood group genes has become an important goal to achieve, both for diagnostic and research purposes. As novel potent sequencing technologies are available, we thought to investigate the variability encountered in the three most common alleles of , the gene encoding the clinically relevant Duffy antigens, at the haplotype level by a long-read sequencing approach.
After long-range PCR amplification spanning the whole gene locus (∼2.5 kilobases), amplicons generated from 81 samples with known genotypes were sequenced in a single read by using the Pacific Biosciences (PacBio) single molecule, real-time (SMRT) sequencing technology.
High-quality sequencing reads were obtained for the 162 alleles (accuracy >0.999). Twenty-two nucleotide variations reported in databases were identified, defining 19 haplotypes: four, eight, and seven haplotypes in 46 *, 63 *, and 53 * alleles, respectively.
Overall, we have defined a subset of reference alleles by third-generation (long-read) sequencing. This technology, which provides a "longitudinal" overview of the loci of interest (several thousand base pairs) and is complementary to the second-generation (short-read) next-generation sequencing technology, is of critical interest for resolving novel, rare, and null alleles.
在血型基因组学的新时代,为了诊断和研究目的,重新定义血型基因的参考基因/等位基因序列已成为一个重要目标。由于有了新型高效的测序技术,我们想用长读长测序方法在单倍型水平上研究编码临床相关达菲抗原的基因的三个最常见等位基因中的变异情况。
在对整个基因座(约2.5千碱基)进行长距离PCR扩增后,使用太平洋生物科学公司(PacBio)的单分子实时(SMRT)测序技术对来自81个已知基因型样本的扩增子进行单读长测序。
获得了162个等位基因的高质量测序读数(准确率>0.999)。鉴定出数据库中报告的22个核苷酸变异,定义了19种单倍型:分别在46个*、63个和53个等位基因中有4种、8种和7种单倍型。
总体而言,我们通过第三代(长读长)测序定义了一个参考等位基因子集。这项技术能够对感兴趣的基因座(几千个碱基对)进行“纵向”概述,并且是第二代(短读长)下一代测序技术的补充,对于解析新的、罕见的和无效等位基因至关重要。