Department of Breast and Endocrine Surgery, Graduate School of Comprehensive Human Sciences, University of Tsukuba, Ibaraki, Japan.
Department of Medical Genetics, Faculty of Medicine, University of Tsukuba, Ibaraki, Japan.
Cancer Sci. 2021 Jan;112(1):465-475. doi: 10.1111/cas.14720. Epub 2020 Nov 29.
Neoantigens have attracted attention as biomarkers or therapeutic targets. However, accurate prediction of neoantigens is still challenging, especially in terms of its accuracy and cost. Variant detection using RNA sequencing (RNA-seq) data has been reported to be a low-accuracy but cost-effective tool, but the feasibility of RNA-seq data for neoantigen prediction has not been fully examined. In the present study, we used whole-exome sequencing (WES) and RNA-seq data of tumor and matched normal samples from six breast cancer patients to evaluate the utility of RNA-seq data instead of WES data in variant calling to detect neoantigen candidates. Somatic variants were called in three protocols using: (i) tumor and normal WES data (DNA method, Dm); (ii) tumor and normal RNA-seq data (RNA method, Rm); and (iii) combination of tumor RNA-seq and normal WES data (Combination method, Cm). We found that the Rm had both high false-positive and high false-negative rates because this method depended greatly on the expression status of normal transcripts. When we compared the results of Dm with those of Cm, only 14% of the neoantigen candidates detected in Dm were identified in Cm, but the majority of the missed candidates lacked coverage or variant allele reads in the tumor RNA. In contrast, about 70% of the neoepitope candidates with higher expression and rich mutant transcripts could be detected in Cm. Our results showed that Cm could be an efficient and a cost-effective approach to predict highly expressed neoantigens in tumor samples.
新抗原作为生物标志物或治疗靶点引起了关注。然而,新抗原的准确预测仍然具有挑战性,尤其是在准确性和成本方面。据报道,使用 RNA 测序 (RNA-seq) 数据进行变体检测是一种低准确率但具有成本效益的工具,但尚未充分检查 RNA-seq 数据在新抗原预测方面的可行性。在本研究中,我们使用来自六名乳腺癌患者的肿瘤和匹配正常样本的全外显子测序 (WES) 和 RNA-seq 数据,评估 RNA-seq 数据代替 WES 数据在变体调用中检测新抗原候选物的效用。使用三种方案在三个方案中调用体细胞变体:(i)肿瘤和正常 WES 数据(DNA 方法,Dm);(ii)肿瘤和正常 RNA-seq 数据(RNA 方法,Rm);和(iii)肿瘤 RNA-seq 和正常 WES 数据的组合(组合方法,Cm)。我们发现 Rm 具有高假阳性率和高假阴性率,因为该方法严重依赖于正常转录本的表达状态。当我们将 Dm 的结果与 Cm 的结果进行比较时,在 Dm 中检测到的新抗原候选物只有 14%在 Cm 中被识别,但大多数错过的候选物在肿瘤 RNA 中缺乏覆盖或变异等位基因读数。相比之下,在 Cm 中可以检测到约 70%表达较高和富含突变转录本的新表位候选物。我们的结果表明,Cm 可能是一种高效且具有成本效益的方法,可以预测肿瘤样本中高表达的新抗原。