Oketch Daisy J A, Giulietti Matteo, Piva Francesco
Department of Specialistic Clinical and Odontostomatological Sciences, Polytechnic University of Marche, 60131 Ancona, Italy.
Biomedicines. 2024 Aug 5;12(8):1759. doi: 10.3390/biomedicines12081759.
Single-cell RNA sequencing (scRNA-seq) technique has enabled detailed analysis of gene expression at the single cell level, enhancing the understanding of subtle mechanisms that underly pathologies and drug resistance. To derive such biological meaning from sequencing data in oncology, some critical processing must be performed, including identification of the tumor cells by markers and algorithms that infer copy number variations (CNVs). We compared the performance of sciCNV, InferCNV, CopyKAT and SCEVAN tools that identify tumor cells by inferring CNVs from scRNA-seq data. Sequencing data from Pancreatic Ductal Adenocarcinoma (PDAC) patients, adjacent and healthy tissues were analyzed, and the predicted tumor cells were compared to those identified by well-assessed PDAC markers. Results from InferCNV, CopyKAT and SCEVAN overlapped by less than 30% with InferCNV showing the highest sensitivity (0.72) and SCEVAN the highest specificity (0.75). We show that the predictions are highly dependent on the sample and the software used, and that they return so many false positives hence are of little use in verifying or filtering predictions made via tumor biomarkers. We highlight how critical this processing can be, warn against the blind use of these software and point out the great need for more reliable algorithms.
单细胞RNA测序(scRNA-seq)技术能够在单细胞水平上对基因表达进行详细分析,加深了我们对疾病病理和耐药性潜在微妙机制的理解。为了从肿瘤学测序数据中获取此类生物学意义,必须进行一些关键处理,包括通过标记物和推断拷贝数变异(CNV)的算法来识别肿瘤细胞。我们比较了sciCNV、InferCNV、CopyKAT和SCEVAN工具从scRNA-seq数据推断CNV以识别肿瘤细胞的性能。分析了来自胰腺导管腺癌(PDAC)患者、癌旁组织和健康组织的测序数据,并将预测的肿瘤细胞与通过评估良好的PDAC标记物识别的肿瘤细胞进行比较。InferCNV、CopyKAT和SCEVAN的结果重叠率不到30%,其中InferCNV的灵敏度最高(0.72),SCEVAN的特异性最高(0.75)。我们表明,预测高度依赖于所使用的样本和软件,而且它们返回了大量假阳性结果,因此在验证或筛选通过肿瘤生物标志物做出的预测方面几乎没有用处。我们强调了这一处理过程的关键程度,警告不要盲目使用这些软件,并指出迫切需要更可靠的算法。