Zhang Suhua, Bian Yingnan, Zhang Zheren, Zheng Hancheng, Wang Zheng, Zha Lagabaiyila, Cai Jifeng, Gao Yuzhen, Ji Chaoneng, Hou Yiping, Li Chengtao
Shanghai Key Laboratory of Forensic Medicine, Institute of Forensic Sciences, Ministry of Justice, P.R. China, Shanghai 200063, P.R. China.
State Key Laboratory of Genetic Engineering, Institute of Genetics, School of Life Sciences, Fudan University, Shanghai 200433, P.R. China.
Sci Rep. 2015 Dec 22;5:18683. doi: 10.1038/srep18683.
SNPs, abundant in human genome with lower mutation rate, are attractive to genetic application like forensic, anthropological and evolutionary studies. Universal SNPs showing little allelic frequency variation among populations while remaining highly informative for human identification were obtained from previous studies. However, genotyping tools target only dozens of markers simultaneously, limiting their applications. Here, 124 SNPs were simultaneous tested using Ampliseq technology with Ion Torrent PGM platform. Concordance study was performed with 2 reference samples of 9947A and 9948 between NGS and Sanger sequencing. Full concordance were obtained except genotype of rs576261 with 9947A. Parameter of FMAR (%) was introduced for NGS data analysis for the first time, evaluating allelic performance, sensitivity testing and mixture testing. FMAR values for accurate heterozygotes should be range from 50% to 60%, for homozygotes or Y-SNP should be above 90%. SNPs of rs7520386, rs4530059, rs214955, rs1523537, rs2342747, rs576261 and rs12997453 were recognized as poorly performing loci, either with allelic imbalance or with lower coverage. Sensitivity testing demonstrated that with DNA range from 10 ng-0.5 ng, all correct genotypes were obtained. For mixture testing, a clear linear correlation (R(2) = 0.9429) between the excepted FMAR and observed FMAR values of mixtures was observed.
单核苷酸多态性(SNPs)在人类基因组中大量存在且突变率较低,在法医、人类学和进化研究等遗传应用领域颇具吸引力。以往研究已获得在人群中等位基因频率变化较小但对人类识别仍具有高度信息价值的通用SNPs。然而,基因分型工具同时只能针对几十种标记物,限制了它们的应用。在此,利用Ion Torrent PGM平台的扩增子测序技术同时对124个SNPs进行了检测。使用9947A和9948这两个参考样本在二代测序(NGS)和桑格测序之间进行了一致性研究。除了9947A样本中rs576261的基因型外,其余均获得了完全一致的结果。首次引入了错误等位基因率(FMAR,%)参数用于NGS数据分析,以评估等位基因性能、灵敏度测试和混合样本测试。准确杂合子的FMAR值应在50%至60%之间,纯合子或Y染色体SNPs的FMAR值应高于90%。rs7520386、rs4530059、rs214955、rs1523537、rs2342747、rs576261和rs12997453这些SNPs被认为是性能不佳的位点,存在等位基因不平衡或覆盖率较低的情况。灵敏度测试表明,当DNA浓度范围为10 ng - 0.5 ng时,所有基因型均能正确获得。对于混合样本测试,观察到混合样本预期FMAR值与观察到的FMAR值之间存在明显的线性相关性(R(2) = 0.9429)。