Yorkshire Regional Genetics Service, St. James's University Hospital, Leeds, United Kingdom; MRC Medical Bioinformatics Centre, Leeds Institute for Data Analytics, St. James's University Hospital, Leeds, United Kingdom; MRC Single Cell Functional Genomics Centre, University of Leeds, St. James's University Hospital, Leeds, United Kingdom.
Yorkshire Regional Genetics Service, St. James's University Hospital, Leeds, United Kingdom.
J Mol Diagn. 2017 Nov;19(6):933-940. doi: 10.1016/j.jmoldx.2017.08.002. Epub 2017 Sep 1.
Like many clinical diagnostic laboratories, the Yorkshire Regional Genetics Service undertakes routine investigation of cancer-predisposed individuals by high-throughput sequencing of patient DNA that has been target-enriched for genes associated with hereditary cancer. Accurate diagnosis using such reagents requires alertness regarding rare nonpathogenic variants that may interfere with variant calling. In a cohort of 2042 such cases, we identified 5 that initially appeared to be carriers of a 95-bp deletion of SMAD4 intron 6. More detailed analysis indicated that these individuals all carried one copy of a SMAD4 processed gene. Because of its interference with diagnostic analysis, we characterized this processed gene in detail. Whole-genome sequencing and confirmatory Sanger sequencing of junction PCR products were used to show that in each of the 5 cases, the SMAD4 processed gene was integrated at the same position on chromosome 9, located within the last intron of the SCAI gene. This rare polymorphic processed gene therefore reflects the occurrence of a single ancestral retrotransposition event. Compared to the reference SMAD4 mRNA sequence NM_005359.5 (https://www.ncbi.nlm.nih.gov/nucleotide), the 5' and 3' untranslated regions of the processed gene are both truncated, but its open reading frame is unaltered. Our experience leads us to advocate the use of an RNA-seq aligner as part of diagnostic assay quality assurance, since this allows recognition of processed pseudogenes in a comparatively facile automated fashion.
与许多临床诊断实验室一样,约克郡地区遗传学服务机构通过对目标富集与遗传性癌症相关基因的患者 DNA 进行高通量测序,对易患癌症的个体进行常规检测。使用此类试剂进行准确诊断需要警惕可能干扰变异体检测的罕见非致病性变异体。在 2042 例此类病例的队列中,我们发现了 5 例最初似乎携带 SMAD4 内含子 6 处 95bp 缺失的携带者。更详细的分析表明,这些个体均携带一个 SMAD4 处理基因的拷贝。由于其对诊断分析的干扰,我们详细描述了该处理基因。我们使用全基因组测序和连接 PCR 产物的确认性 Sanger 测序,表明在 5 例中的每一例中,SMAD4 处理基因均位于 9 号染色体上的相同位置,位于 SCAI 基因的最后一个内含子内。因此,这种罕见的多态性处理基因反映了单个祖先逆转座事件的发生。与参考 SMAD4 mRNA 序列 NM_005359.5(https://www.ncbi.nlm.nih.gov/nucleotide)相比,处理基因的 5' 和 3' 非翻译区均被截断,但开放阅读框未改变。我们的经验使我们主张在诊断检测质量保证中使用 RNA-seq 比对器,因为这可以以相对简单的自动化方式识别处理后的假基因。