Wingo Thomas S, Kotlar Alex, Cutler David J
Division of Neurology, Atlanta VA Medical Center, Decatur, 30033, GA, USA.
Department of Neurology, Emory University School of Medicine, Atlanta, 30322, GA, USA.
BMC Bioinformatics. 2017 Jan 5;18(1):14. doi: 10.1186/s12859-016-1453-3.
Targeted resequencing offers a cost-effective alternative to whole-genome and whole-exome sequencing when investigating regions known to be associated with a trait or disease. There are a number of approaches to targeted resequencing, including microfluidic PCR amplification, which may be enhanced by multiplex PCR. Currently, there is no open-source software that can design next-generation multiplex PCR experiments that ensures primers are unique at a genome-level and efficiently pools compatible primers.
We present MPD, a software package that automates the design of multiplex PCR primers for next-generation sequencing. The core of MPD is implemented in C for speed and uses a hashed genome to ensure primer uniqueness, avoids placing primers over sites of known variation, and efficiently pools compatible primers. A JavaScript web application ( http://multiplexprimer.io ) utilizing the MPD Perl package provides a convenient platform for users to make designs. Using a realistic set of genes identified by genome-wide association studies (GWAS), we achieve 90% coverage of all exonic regions using stringent design criteria. Using the first 47 primer pools for wet-lab validation, we sequenced ~25Kb at 99.7% completeness with a mean coverage of 300X among 313 samples simultaneously and identified 224 variants. The number and nature of variants we observe are consistent with high quality sequencing.
MPD can successfully design multiplex PCR experiments suitable for next-generation sequencing, and simplifies retooling targeted resequencing pipelines to focus on new targets as new genetic evidence emerges.
在研究已知与某一性状或疾病相关的区域时,靶向重测序为全基因组测序和全外显子组测序提供了一种经济高效的替代方法。靶向重测序有多种方法,包括微流控PCR扩增,多重PCR可对其进行增强。目前,尚无能够设计确保引物在基因组水平上具有唯一性并能有效汇集兼容引物的新一代多重PCR实验的开源软件。
我们展示了MPD,这是一个用于为新一代测序自动设计多重PCR引物的软件包。MPD的核心用C语言实现以提高速度,并使用哈希基因组来确保引物的唯一性,避免在已知变异位点上放置引物,并有效汇集兼容引物。利用MPD Perl包的JavaScript网络应用程序(http://multiplexprimer.io)为用户提供了一个方便的设计平台。使用通过全基因组关联研究(GWAS)确定的一组实际基因,我们采用严格的设计标准实现了对所有外显子区域90%的覆盖。使用前47个引物池进行湿实验室验证,我们在313个样本中同时对约25Kb进行了测序,完整性达到99.7%,平均覆盖度为300X,并鉴定出224个变异。我们观察到的变异数量和性质与高质量测序结果一致。
MPD能够成功设计适用于新一代测序的多重PCR实验,并在新的遗传证据出现时简化重新调整靶向重测序流程以专注于新目标的过程。