Pyatnitskiy Mikhail A, Arzumanian Viktoriia A, Radko Sergey P, Ptitsyn Konstantin G, Vakhrushev Igor V, Poverennaya Ekaterina V, Ponomarenko Elena A
Institute of Biomedical Chemistry, 119121 Moscow, Russia.
Federal Research and Clinical Center of Physical-Chemical Medicine, 119435 Moscow, Russia.
Biology (Basel). 2021 Nov 4;10(11):1131. doi: 10.3390/biology10111131.
Long-read direct RNA sequencing developed by Oxford Nanopore Technologies (ONT) is quickly gaining popularity for transcriptome studies, while fast turnaround time and low cost make it an attractive instrument for clinical applications. There is a growing interest to utilize transcriptome data to unravel activated biological processes responsible for disease progression and response to therapies. This trend is of particular interest for precision medicine which aims at single-patient analysis. Here we evaluated whether gene abundances measured by MinION direct RNA sequencing are suited to produce robust estimates of pathway activation for single sample scoring methods. We performed multiple RNA-seq analyses for a single sample that originated from the HepG2 cell line, namely five ONT replicates, and three replicates using Illumina NovaSeq. Two pathway scoring methods were employed-ssGSEA and singscore. We estimated the ONT performance in terms of detected protein-coding genes and average pairwise correlation between pathway activation scores using an exhaustive computational scheme for all combinations of replicates. In brief, we found that at least two ONT replicates are required to obtain reproducible pathway scores for both algorithms. We hope that our findings may be of interest to researchers planning their ONT direct RNA-seq experiments.
由牛津纳米孔技术公司(ONT)开发的长读长直接RNA测序在转录组研究中迅速受到欢迎,其快速的周转时间和低成本使其成为临床应用中颇具吸引力的工具。利用转录组数据来揭示导致疾病进展和对治疗反应的激活生物过程的兴趣日益浓厚。这一趋势对于旨在进行单患者分析的精准医学尤为重要。在这里,我们评估了通过MinION直接RNA测序测量的基因丰度是否适合为单样本评分方法生成可靠的通路激活估计值。我们对源自HepG2细胞系的单个样本进行了多次RNA测序分析,即五个ONT重复样本,以及使用Illumina NovaSeq的三个重复样本。采用了两种通路评分方法——单样本基因集富集分析(ssGSEA)和单样本评分(singscore)。我们使用针对所有重复样本组合的详尽计算方案,从检测到的蛋白质编码基因以及通路激活分数之间的平均成对相关性方面评估了ONT的性能。简而言之,我们发现至少需要两个ONT重复样本才能为这两种算法获得可重复的通路分数。我们希望我们的发现可能会引起计划进行ONT直接RNA测序实验的研究人员的兴趣。