Center for Clinical Genomics, Kanazawa Medical University Hospital, 1-1 Daigaku, Uchinada, Kahoku, Ishikawa, 920-0923, Japan.
Division of Genomic Medicine, Department of Advanced Medicine, Medical Research Institute, Kanazawa Medical University, 1-1 Daigaku, Uchinada, Kahoku, Ishikawa, 920-0923, Japan.
Sci Rep. 2022 Jun 22;12(1):10599. doi: 10.1038/s41598-022-14902-7.
The full-length double-strand cDNA sequencing, one of the RNA-Seq methods, is a powerful method used to investigate the transcriptome status of a gene of interest, such as its transcription level and alternative splicing variants. Furthermore, full-length double-strand cDNA sequencing has the advantage that it can create a library from a small amount of sample and the library can be applied to long-read sequencers in addition to short-read sequencers. Nevertheless, one of our previous studies indicated that the full-length double-strand cDNA sequencing yields non-specific genomic DNA amplification, affecting transcriptome analysis, such as transcript quantification and alternative splicing analysis. In this study, it was confirmed that it is possible to produce the RNA-Seq library from only genomic DNA and that the full-length double-strand cDNA sequencing of genomic DNA yielded non-specific genomic DNA amplification. To avoid non-specific genomic DNA amplification, two methods were examined, which are the DNase I-treated full-length double-strand cDNA sequencing and poly(A) capture full-length double-strand cDNA sequencing. Contrary to expectations, the non-specific genomic DNA amplification was increased and the number of the detected expressing genes was reduced in DNase I-treated full-length double-strand cDNA sequencing. On the other hand, in the poly(A) capture full-length double-strand cDNA sequencing, the non-specific genomic DNA amplification was significantly reduced, accordingly the accuracy and the number of detected expressing genes and splicing events were increased. The expression pattern and percentage spliced in index of splicing events were highly correlated. Our results indicate that the poly(A) capture full-length double-strand cDNA sequencing improves transcript quantification accuracy and the detection ability of alternative splicing events. It is also expected to contribute to the determination of the significance of DNA variants to splicing events.
全长双链 cDNA 测序是 RNA-Seq 方法之一,是一种强大的方法,用于研究感兴趣基因的转录组状态,例如转录水平和选择性剪接变体。此外,全长双链 cDNA 测序的优点是可以从小量样本中创建文库,并且该文库除了短读测序器之外还可以应用于长读测序器。然而,我们之前的一项研究表明,全长双链 cDNA 测序会产生非特异性基因组 DNA 扩增,从而影响转录组分析,例如转录定量和选择性剪接分析。在这项研究中,证实了仅从基因组 DNA 就可以产生 RNA-Seq 文库,并且全长双链 cDNA 测序的基因组 DNA 会产生非特异性基因组 DNA 扩增。为了避免非特异性基因组 DNA 扩增,检查了两种方法,即 DNase I 处理的全长双链 cDNA 测序和 poly(A) 捕获全长双链 cDNA 测序。出乎意料的是,在 DNase I 处理的全长双链 cDNA 测序中,非特异性基因组 DNA 扩增增加,并且检测到的表达基因数量减少。另一方面,在 poly(A) 捕获全长双链 cDNA 测序中,非特异性基因组 DNA 扩增显著减少,因此,检测到的表达基因和剪接事件的数量和准确性增加。表达模式和剪接事件的拼接索引百分比高度相关。我们的结果表明,poly(A) 捕获全长双链 cDNA 测序提高了转录定量准确性和选择性剪接事件的检测能力。它还有望有助于确定 DNA 变体对剪接事件的意义。