Healthcare Business Department, PFDeNA, Tokyo, Japan.
Preferred Networks, Tokyo, Japan.
Cancer Sci. 2022 Jun;113(6):2144-2166. doi: 10.1111/cas.15309. Epub 2022 Mar 14.
Liquid biopsy is expected to be a promising cancer screening method because of its low invasiveness and the possibility of detecting multiple types in a single test. In the last decade, many studies on cancer detection using small RNAs in blood have been reported. To put small RNA tests into practical use as a multiple cancer type screening test, it is necessary to develop a method that can be applied to multiple facilities. We collected samples of eight cancer types and healthy controls from 20 facilities to evaluate the performance of cancer type classification. A total of 2,475 cancer samples and 496 healthy control samples were collected using a standardized protocol. After obtaining a small RNA expression profile, we constructed a classification model and evaluated its performance. First, we investigated the classification performance using samples from five single facilities. Each model showed areas under the receiver curve (AUC) ranging from 0.67 to 0.89. Second, we performed principal component analysis (PCA) to examine the characteristics of the facilities. The degree of hemolysis and the data acquisition period affected the expression profiles. Finally, we constructed the classification model by reducing the influence of these factors, and its performance had an AUC of 0.76. The results reveal that small RNA can be used for the classification of cancer types in samples from a single facility. However, interfacility biases will affect the classification of samples from multiple facilities. These findings will provide important insights to improve the performance of multiple cancer type classifications using small RNA expression profiles acquired from multiple facilities.
液体活检因其微创性和单次检测中检测多种类型的可能性,有望成为一种很有前途的癌症筛查方法。在过去的十年中,已经有许多关于利用血液中小 RNA 进行癌症检测的研究。为了将小 RNA 检测作为一种多癌症类型筛查试验实际应用,有必要开发一种可应用于多种设备的方法。我们从 20 个机构收集了 8 种癌症类型和健康对照的样本,以评估癌症类型分类的性能。使用标准化方案共收集了 2475 个癌症样本和 496 个健康对照样本。在获得小 RNA 表达谱后,我们构建了一个分类模型并评估了其性能。首先,我们使用来自五个单一机构的样本调查了分类性能。每个模型的接收曲线下面积(AUC)均在 0.67 到 0.89 之间。其次,我们进行了主成分分析(PCA)以检查机构的特征。溶血程度和数据采集周期影响了表达谱。最后,我们通过减少这些因素的影响构建了分类模型,其 AUC 为 0.76。结果表明,小 RNA 可用于单个机构样本中癌症类型的分类。然而,机构间的偏差会影响来自多个机构的样本的分类。这些发现将为提高从小 RNA 表达谱中获取的多个机构的多癌症类型分类性能提供重要的见解。