Jopek Maksym A, Sieczczyński Michał, Pastuszak Krzysztof, Łapińska-Szumczyk Sylwia, Jassem Jacek, Żaczek Anna J, Rondina Matthew T, Supernat Anna
Laboratory of Translational Oncology, Intercollegiate Faculty of Biotechnology of the University of Gdańsk and the Medical University of Gdańsk, Gdańsk, Poland.
Centre of Biostatistics and Bioinformatics, Medical University of Gdańsk, Gdańsk, Poland.
Blood Adv. 2025 Mar 11;9(5):979-989. doi: 10.1182/bloodadvances.2024014008.
Ovarian cancer (OC) presents a diagnostic challenge, often resulting in poor patient outcomes. Platelet RNA sequencing, which reflects host response to disease, shows promise for earlier OC detection. This study examines the impact of sex, age, platelet count, and the training on cancer types other than OC on classification accuracy achieved in the previous platelet-alone training data set. A total of 339 samples from healthy donors and 1396 samples from patients with cancer, spanning 18 cancer types (including 135 OC cases) were analyzed. Logistic regression was applied to verify our classifiers' performance and interpretability. Models were tested at 100% specificity and 100% sensitivity levels. Incorporating patient age as an additional feature along with gene expression increased sensitivity from 68.6% to 72.6%. Models trained on data from both sexes and on female-only data achieved a sensitivity of 68.6% and 74.5%, respectively. Training solely on OC data reduced late-stage sensitivity from 69.1% to 44.1% but increased early-stage sensitivity from 66.7% to 69.7%. This study highlights the potential of platelet RNA profiling for OC detection and the importance of clinical variables in refining classification accuracy. Incorporating age with gene expression data may enhance OC diagnostic accuracy. The inclusion of male samples deteriorates classifier performance. Data from diverse cancer types improves advanced cancer detection but negatively affects early-stage diagnosis.
卵巢癌(OC)的诊断具有挑战性,常常导致患者预后不佳。血小板RNA测序能够反映宿主对疾病的反应,在早期卵巢癌检测方面显示出前景。本研究考察了性别、年龄、血小板计数以及针对非OC癌症类型的训练对先前仅使用血小板的训练数据集中分类准确性的影响。共分析了来自健康供体的339个样本以及来自癌症患者的1396个样本,涵盖18种癌症类型(包括135例OC病例)。应用逻辑回归来验证我们分类器的性能和可解释性。模型在100%特异性和100%敏感性水平下进行测试。将患者年龄作为附加特征与基因表达一起纳入,可使敏感性从68.6%提高到72.6%。在来自两性的数据和仅来自女性的数据上训练的模型,敏感性分别达到68.6%和74.5%。仅在OC数据上进行训练会使晚期敏感性从69.1%降至44.1%,但会使早期敏感性从66.7%提高到69.7%。本研究突出了血小板RNA谱分析在OC检测中的潜力以及临床变量在提高分类准确性方面的重要性。将年龄与基因表达数据相结合可能会提高OC诊断准确性。纳入男性样本会降低分类器性能。来自多种癌症类型的数据可改善晚期癌症检测,但对早期诊断有负面影响。