• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

提高基于血小板 RNA 的诊断:用于癌症检测和多类分类的机器学习模型的比较分析。

Improving platelet-RNA-based diagnostics: a comparative analysis of machine learning models for cancer detection and multiclass classification.

机构信息

Laboratory of Translational Oncology, Intercollegiate Faculty of Biotechnology of the University of Gdańsk and the Medical University of Gdańsk, Poland.

Centre of Biostatistics and Bioinformatics, Medical University of Gdańsk, Poland.

出版信息

Mol Oncol. 2024 Nov;18(11):2743-2754. doi: 10.1002/1878-0261.13689. Epub 2024 Jun 17.

DOI:10.1002/1878-0261.13689
PMID:38887841
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11547247/
Abstract

Liquid biopsy demonstrates excellent potential in patient management by providing a minimally invasive and cost-effective approach to detecting and monitoring cancer, even at its early stages. Due to the complexity of liquid biopsy data, machine-learning techniques are increasingly gaining attention in sample analysis, especially for multidimensional data such as RNA expression profiles. Yet, there is no agreement in the community on which methods are the most effective or how to process the data. To circumvent this, we performed a large-scale study using various machine-learning techniques. First, we took a closer look at existing datasets and filtered out some patients to assert data collection quality. The final data collection included platelet RNA samples acquired from 1397 cancer patients (17 types of cancer) and 354 asymptomatic, presumed healthy, donors. Then, we assessed an array of different machine-learning models and techniques (e.g., feature selection of RNA transcripts) in pan-cancer detection and multiclass classification. Our results show that simple logistic regression performs the best, reaching a 68% cancer detection rate at a 99% specificity level, and multiclass classification accuracy of 79.38% when distinguishing between five cancer types. In summary, by revisiting classical machine-learning models, we have exceeded the previously used method by 5% and 9.65% in cancer detection and multiclass classification, respectively. To ease further research, we open-source our code and data processing pipelines (https://gitlab.com/jopekmaksym/improving-platelet-rna-based-diagnostics), which we hope will serve the community as a strong baseline.

摘要

液体活检通过提供一种微创且具有成本效益的方法来检测和监测癌症,即使在早期阶段,也具有巨大的潜力,在患者管理方面显示出了卓越的应用前景。由于液体活检数据的复杂性,机器学习技术在样本分析中越来越受到关注,尤其是对于 RNA 表达谱等多维数据。然而,在社区中,哪种方法最有效或如何处理数据尚未达成共识。为了解决这个问题,我们使用了各种机器学习技术进行了大规模研究。首先,我们仔细研究了现有的数据集,并过滤掉了一些患者,以确保数据采集的质量。最终的数据采集包括从 1397 名癌症患者(17 种癌症)和 354 名无症状、假定健康的供体中获得的血小板 RNA 样本。然后,我们评估了一系列不同的机器学习模型和技术(例如,RNA 转录本的特征选择)在泛癌症检测和多类分类中的应用。我们的研究结果表明,简单的逻辑回归表现最佳,在达到 99%特异性水平时,癌症检测率为 68%,在区分五种癌症类型时,多类分类准确率为 79.38%。总的来说,通过重新审视经典的机器学习模型,我们在癌症检测和多类分类方面分别提高了之前使用方法的 5%和 9.65%。为了方便进一步的研究,我们开源了我们的代码和数据处理管道(https://gitlab.com/jopekmaksym/improving-platelet-rna-based-diagnostics),我们希望这些代码和数据处理管道能够作为一个强大的基准,为社区提供服务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/9eea5cdc8413/MOL2-18-2743-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/7f21f2135e02/MOL2-18-2743-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/4a19df0dbb48/MOL2-18-2743-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/93c989f5b952/MOL2-18-2743-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/dd3059df66a0/MOL2-18-2743-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/9eea5cdc8413/MOL2-18-2743-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/7f21f2135e02/MOL2-18-2743-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/4a19df0dbb48/MOL2-18-2743-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/93c989f5b952/MOL2-18-2743-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/dd3059df66a0/MOL2-18-2743-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e033/11547247/9eea5cdc8413/MOL2-18-2743-g005.jpg

相似文献

1
Improving platelet-RNA-based diagnostics: a comparative analysis of machine learning models for cancer detection and multiclass classification.提高基于血小板 RNA 的诊断:用于癌症检测和多类分类的机器学习模型的比较分析。
Mol Oncol. 2024 Nov;18(11):2743-2754. doi: 10.1002/1878-0261.13689. Epub 2024 Jun 17.
2
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
6
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
9
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
10
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤
Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.

引用本文的文献

1
Impact of clinical factors on accuracy of ovarian cancer detection via platelet RNA profiling.临床因素对通过血小板RNA谱检测卵巢癌准确性的影响。
Blood Adv. 2025 Mar 11;9(5):979-989. doi: 10.1182/bloodadvances.2024014008.

本文引用的文献

1
Development and Validation of an 18-Gene Urine Test for High-Grade Prostate Cancer.开发和验证一种用于高级别前列腺癌的 18 基因尿液检测方法。
JAMA Oncol. 2024 Jun 1;10(6):726-736. doi: 10.1001/jamaoncol.2024.0455.
2
Overexpressed FKBP5 mediates colorectal cancer progression and sensitivity to FK506 treatment via the NF-κB signaling pathway.过表达的FKBP5通过NF-κB信号通路介导结直肠癌进展及对FK506治疗的敏感性。
FEBS J. 2024 Jul;291(14):3128-3146. doi: 10.1111/febs.17126. Epub 2024 Apr 11.
3
New Perspectives on the Role of Liquid Biopsy in Bladder Cancer: Applicability to Precision Medicine.
液体活检在膀胱癌中的作用新视角:对精准医学的适用性
Cancers (Basel). 2024 Feb 16;16(4):803. doi: 10.3390/cancers16040803.
4
Circulating Tumor Cells: From Basic to Translational Research.循环肿瘤细胞:从基础研究到转化研究。
Clin Chem. 2024 Jan 4;70(1):81-89. doi: 10.1093/clinchem/hvad142.
5
TMSB4X: A novel prognostic marker for non-small cell lung cancer.TMSB4X:一种用于非小细胞肺癌的新型预后标志物。
Heliyon. 2023 Nov 4;9(11):e21505. doi: 10.1016/j.heliyon.2023.e21505. eCollection 2023 Nov.
6
Blood-based tests for multicancer early detection (PATHFINDER): a prospective cohort study.基于血液的多种癌症早期检测(PATHFINDER):一项前瞻性队列研究。
Lancet. 2023 Oct 7;402(10409):1251-1260. doi: 10.1016/S0140-6736(23)01700-2.
7
Pseudogenes in Cancer: State of the Art.癌症中的假基因:最新进展
Cancers (Basel). 2023 Aug 8;15(16):4024. doi: 10.3390/cancers15164024.
8
Identification of a novel glycolysis-related prognosis risk signature in triple-negative breast cancer.三阴性乳腺癌中一种新型糖酵解相关预后风险特征的鉴定
Front Oncol. 2023 May 18;13:1171496. doi: 10.3389/fonc.2023.1171496. eCollection 2023.
9
LC-MS based urine untargeted metabolomic analyses to identify and subdivide urothelial cancer.基于液相色谱-质谱联用的尿液非靶向代谢组学分析以识别和细分尿路上皮癌。
Front Oncol. 2023 May 12;13:1160965. doi: 10.3389/fonc.2023.1160965. eCollection 2023.
10
Hemoglobin β Expression Is Associated with Poor Prognosis in Clear Cell Renal Cell Carcinoma.血红蛋白β表达与透明细胞肾细胞癌的不良预后相关。
Biomedicines. 2023 Apr 30;11(5):1330. doi: 10.3390/biomedicines11051330.