Gould Genevieve M, Grauman Peter V, Theilmann Mark R, Spurka Lindsay, Wang Irving E, Melroy Laura M, Chin Robert G, Hite Dustin H, Chu Clement S, Maguire Jared R, Hogan Gregory J, Muzzey Dale
Counsyl, 180 Kimball Way, South San Francisco, CA, 94080, USA.
BMC Med Genet. 2018 Sep 29;19(1):176. doi: 10.1186/s12881-018-0691-9.
Hereditary cancer screening (HCS) for germline variants in the 3' exons of PMS2, a mismatch repair gene implicated in Lynch syndrome, is technically challenging due to homology with its pseudogene PMS2CL. Sequences of PMS2 and PMS2CL are so similar that next-generation sequencing (NGS) of short fragments-common practice in multigene HCS panels-may identify the presence of a variant but fail to disambiguate whether its origin is the gene or the pseudogene. Molecular approaches utilizing longer DNA fragments, such as long-range PCR (LR-PCR), can definitively localize variants in PMS2, yet applying such testing to all samples can have logistical and economic drawbacks.
To address these drawbacks, we propose and characterize a reflex workflow for variant discovery in the 3' exons of PMS2. We cataloged the natural variation in PMS2 and PMS2CL in 707 samples and designed hybrid-capture probes to enrich the gene and pseudogene with equal efficiency. For PMS2 exon 11, NGS reads were aligned, filtered using gene-specific variants, and subject to standard diploid variant calling. For PMS2 exons 12-15, the NGS reads were permissively aligned to PMS2, and variant calling was performed with the expectation of observing four alleles (i.e., tetraploid calling). In this reflex workflow, short-read NGS identifies potentially reportable variants that are then subject to disambiguation via LR-PCR-based testing.
Applying short-read NGS screening to 299 HCS samples and cell lines demonstrated >99% analytical sensitivity and >99% analytical specificity for single-nucleotide variants (SNVs) and short insertions and deletions (indels), as well as >96% analytical sensitivity and >99% analytical specificity for copy-number variants. Importantly, 92% of samples had resolved genotypes from short-read NGS alone, with the remaining 8% requiring LR-PCR reflex.
Our reflex workflow mitigates the challenges of screening in PMS2 and serves as a guide for clinical laboratories performing multigene HCS. To facilitate future exploration and testing of PMS2 variants, we share the raw and processed LR-PCR data from commercially available cell lines, as well as variant frequencies from a diverse patient cohort.
错配修复基因PMS2的3'外显子中的种系变异的遗传性癌症筛查(HCS)在技术上具有挑战性,因为它与其假基因PMS2CL具有同源性。PMS2和PMS2CL的序列非常相似,以至于多基因HCS面板中的常见做法——短片段的下一代测序(NGS)——可能会识别出变异的存在,但无法确定其起源是基因还是假基因。利用较长DNA片段的分子方法,如长距离PCR(LR-PCR),可以明确地将变异定位在PMS2中,但将这种检测应用于所有样本可能存在后勤和经济方面的缺点。
为了解决这些缺点,我们提出并描述了一种用于发现PMS2 3'外显子中变异的反射式工作流程。我们对707个样本中PMS2和PMS2CL的自然变异进行了编目,并设计了杂交捕获探针以同等效率富集基因和假基因。对于PMS2外显子11,将NGS读数进行比对,使用基因特异性变异进行过滤,并进行标准的二倍体变异调用。对于PMS2外显子12 - 15,将NGS读数宽松地比对到PMS2上,并在期望观察到四个等位基因的情况下进行变异调用(即四倍体调用)。在这个反射式工作流程中,短读长NGS识别出潜在的可报告变异,然后通过基于LR-PCR的检测进行区分。
将短读长NGS筛查应用于299个HCS样本和细胞系,对单核苷酸变异(SNV)、短插入和缺失(indel)显示出>99%的分析灵敏度和>99%的分析特异性,对拷贝数变异显示出>96%的分析灵敏度和>99%的分析特异性。重要的是,92%的样本仅通过短读长NGS就解析出了基因型,其余8%需要LR-PCR反射检测。
我们的反射式工作流程减轻了PMS2筛查的挑战,并为进行多基因HCS的临床实验室提供了指导。为了便于未来对PMS2变异的探索和检测,我们分享了来自市售细胞系的原始和处理后的LR-PCR数据,以及来自不同患者队列的变异频率。