Nishio Shin-Ya, Moteki Hideaki, Usami Shin-Ichi
Department of Otorhinolaryngology, Shinshu University School of Medicine, Matsumoto City, Japan.
Mol Genet Genomic Med. 2018 Apr 6;6(4):678-86. doi: 10.1002/mgg3.399.
Recent advances in molecular genetic analysis using next-generation sequencing (NGS) have drastically accelerated the identification of disease-causing gene mutations. Most next-generation sequencing analyses of inherited diseases have mainly focused on single-nucleotide variants and short indels, although, recently, structure variations including copy number variations have come to be considered an important cause of many different diseases. However, only a limited number of tools are available for multiplex PCR-based target genome enrichment.
In this paper, we reported a simple and efficient copy number variation visualization method for Ion AmpliSeq™ target resequencing data. Unlike the hybridization capture-based target genome enrichment system, Ion AmpliSeq™ reads are multiplex PCR products, and each read generated by the same amplicon is quite uniform in length and position. Based on this feature, the depth of coverage information for each amplicon included in the barcode/amplicon coverage matrix file was used for copy number detection analysis. We also performed copy number analysis to investigate the utility of this method through the use of positive controls and a large Japanese hearing loss cohort.
Using this method, we successfully confirmed previously reported copy number loss cases involving the STRC gene and copy number gain in trisomy 21 cases. We also performed copy number analysis of a large Japanese hearing loss cohort (2,475 patients) and identified many gene copy number variants. The most prevalent copy number variation was STRC gene copy number loss, with 129 patients carrying this copy number variation.
Our copy number visualization method for Ion AmpliSeq™ data can be utilized in efficient copy number analysis for the comparison of a large number of samples. This method is simple and requires only easy calculations using standard spread sheet software.
利用新一代测序(NGS)进行的分子遗传分析的最新进展极大地加速了致病基因突变的鉴定。大多数遗传性疾病的新一代测序分析主要集中在单核苷酸变异和短插入缺失上,不过,最近包括拷贝数变异在内的结构变异已被视为许多不同疾病的重要病因。然而,基于多重PCR的目标基因组富集可用的工具数量有限。
在本文中,我们报告了一种用于Ion AmpliSeq™目标重测序数据的简单高效的拷贝数变异可视化方法。与基于杂交捕获的目标基因组富集系统不同,Ion AmpliSeq™读取的是多重PCR产物,并且由同一扩增子产生的每个读取在长度和位置上都相当均匀。基于这一特征,条形码/扩增子覆盖矩阵文件中包含的每个扩增子的覆盖深度信息被用于拷贝数检测分析。我们还通过使用阳性对照和一个大型日本听力损失队列进行拷贝数分析,以研究该方法的实用性。
使用该方法,我们成功确认了先前报道的涉及STRC基因的拷贝数缺失病例以及21三体病例中的拷贝数增加。我们还对一个大型日本听力损失队列(2475名患者)进行了拷贝数分析,并鉴定出许多基因拷贝数变异。最常见的拷贝数变异是STRC基因拷贝数缺失,有129名患者携带这种拷贝数变异。
我们针对Ion AmpliSeq™数据的拷贝数可视化方法可用于对大量样本进行比较的高效拷贝数分析。该方法简单,只需要使用标准电子表格软件进行简单计算。