Sebastian Alvaro, Herdegen Magdalena, Migalska Magdalena, Radwan Jacek
Evolutionary Biology Group, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, 61-614, Poznan, Poland.
Mol Ecol Resour. 2016 Mar;16(2):498-510. doi: 10.1111/1755-0998.12453. Epub 2015 Aug 26.
Next-generation sequencing (NGS) technologies are revolutionizing the fields of biology and medicine as powerful tools for amplicon sequencing (AS). Using combinations of primers and barcodes, it is possible to sequence targeted genomic regions with deep coverage for hundreds, even thousands, of individuals in a single experiment. This is extremely valuable for the genotyping of gene families in which locus-specific primers are often difficult to design, such as the major histocompatibility complex (MHC). The utility of AS is, however, limited by the high intrinsic sequencing error rates of NGS technologies and other sources of error such as polymerase amplification or chimera formation. Correcting these errors requires extensive bioinformatic post-processing of NGS data. Amplicon Sequence Assignment (AMPLISAS) is a tool that performs analysis of AS results in a simple and efficient way, while offering customization options for advanced users. AMPLISAS is designed as a three-step pipeline consisting of (i) read demultiplexing, (ii) unique sequence clustering and (iii) erroneous sequence filtering. Allele sequences and frequencies are retrieved in excel spreadsheet format, making them easy to interpret. AMPLISAS performance has been successfully benchmarked against previously published genotyped MHC data sets obtained with various NGS technologies.
下一代测序(NGS)技术作为扩增子测序(AS)的强大工具,正在彻底改变生物学和医学领域。通过引物和条形码的组合,可以在单个实验中对数百甚至数千个个体的靶向基因组区域进行深度覆盖测序。这对于基因家族的基因分型极为有价值,因为在这些基因家族中,位点特异性引物往往难以设计,例如主要组织相容性复合体(MHC)。然而,AS的效用受到NGS技术的高固有测序错误率以及其他错误来源(如聚合酶扩增或嵌合体形成)的限制。纠正这些错误需要对NGS数据进行广泛的生物信息学后处理。扩增子序列分配(AMPLISAS)是一种工具,它以简单高效的方式对AS结果进行分析,同时为高级用户提供定制选项。AMPLISAS被设计为一个三步流程,包括(i)读取解复用、(ii)独特序列聚类和(iii)错误序列过滤。等位基因序列和频率以Excel电子表格格式获取,便于解释。AMPLISAS的性能已成功地与之前使用各种NGS技术获得的已发表的基因分型MHC数据集进行了基准测试。