• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OncoOrigin的验证:一种用于原发性癌症部位预测的集成人工智能工具,带有图形用户界面以促进临床应用。

Validation of OncoOrigin: An Integrative AI Tool for Primary Cancer Site Prediction with Graphical User Interface to Facilitate Clinical Application.

作者信息

Brlek Petar, Bulić Luka, Shah Nidhi, Shah Parth, Primorac Dragan

机构信息

St. Catherine Specialty Hospital, 10000 Zagreb, Croatia.

School of Medicine, Josip Juraj Strossmayer University of Osijek, 31000 Osijek, Croatia.

出版信息

Int J Mol Sci. 2025 Mar 13;26(6):2568. doi: 10.3390/ijms26062568.

DOI:10.3390/ijms26062568
PMID:40141210
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11942019/
Abstract

Cancers of unknown primary (CUPs) represent a significant diagnostic and therapeutic challenge in the field of oncology. Due to the limitations of current diagnostic tools in these cases, novel approaches must be brought forward to improve treatment outcomes for these patients. The objective of this study was to develop a machine-learning-based software for primary cancer site prediction (OncoOrigin), based on genetic data acquired from tumor DNA sequencing. By design, this was an diagnostic study, conducted using data from the cBioPortal database (accessed on 21 September 2024) and several data processing and machine learning Python libraries. The study involved over 20,000 tumor samples with information on patient age, sex, and the presence of genetic variants in over 600 genes. The main outcome of interest was machine-learning-based discrimination between cancer site classes. Model quality was assessed by training set cross-validation and evaluation on a segregated test set. Finally, the optimal model was incorporated with a graphical user interface into the OncoOrigin software. Feature importance for class discrimination was also determined on the optimal model. Out of the four tested machine learning estimators, the XGBoostClassifier-based model proved superior in test set evaluation, with a top-2 accuracy of 0.91 and ROC-AUC of 0.97. Unlike other machine learning models published in the literature, OncoOrigin stands out as the only one integrated with a graphical user interface, which is crucial for facilitating its use by oncology specialists in everyday clinical practice, where its application and implementation will have the greatest value in the future.

摘要

原发灶不明的癌症(CUPs)是肿瘤学领域一项重大的诊断和治疗挑战。由于当前诊断工具在这些病例中的局限性,必须提出新的方法来改善这些患者的治疗效果。本研究的目的是基于从肿瘤DNA测序获得的基因数据,开发一种基于机器学习的原发性癌症部位预测软件(OncoOrigin)。从设计上来说,这是一项诊断性研究,使用了来自cBioPortal数据库(于2024年9月21日访问)的数据以及几个数据处理和机器学习的Python库。该研究涉及超过20000个肿瘤样本,包含患者年龄、性别以及600多个基因中基因变异的存在情况等信息。主要关注的结果是基于机器学习对癌症部位类别进行区分。通过训练集交叉验证和在隔离测试集上的评估来评估模型质量。最后,将最优模型与图形用户界面整合到OncoOrigin软件中。还在最优模型上确定了类别区分的特征重要性。在四个测试的机器学习估计器中,基于XGBoostClassifier的模型在测试集评估中表现出色,前两名准确率为0.91,ROC-AUC为0.97。与文献中发表的其他机器学习模型不同,OncoOrigin是唯一集成了图形用户界面的模型,这对于肿瘤学专家在日常临床实践中使用它至关重要,其应用和实施在未来将具有最大价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/44b164b64060/ijms-26-02568-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/06d99f0f3fc8/ijms-26-02568-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/d4bedd4fae68/ijms-26-02568-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/6a22a6155769/ijms-26-02568-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/66c5e92905b0/ijms-26-02568-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/947acb34bde7/ijms-26-02568-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/efb3a3b67675/ijms-26-02568-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/44b164b64060/ijms-26-02568-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/06d99f0f3fc8/ijms-26-02568-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/d4bedd4fae68/ijms-26-02568-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/6a22a6155769/ijms-26-02568-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/66c5e92905b0/ijms-26-02568-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/947acb34bde7/ijms-26-02568-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/efb3a3b67675/ijms-26-02568-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65e2/11942019/44b164b64060/ijms-26-02568-g007.jpg

相似文献

1
Validation of OncoOrigin: An Integrative AI Tool for Primary Cancer Site Prediction with Graphical User Interface to Facilitate Clinical Application.OncoOrigin的验证:一种用于原发性癌症部位预测的集成人工智能工具,带有图形用户界面以促进临床应用。
Int J Mol Sci. 2025 Mar 13;26(6):2568. doi: 10.3390/ijms26062568.
2
CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence.CUP-AI-Dx:一种使用 RNA 基因表达数据和人工智能推断癌症组织来源和分子亚型的工具。
EBioMedicine. 2020 Nov;61:103030. doi: 10.1016/j.ebiom.2020.103030. Epub 2020 Oct 9.
3
Machine learning random forest for predicting oncosomatic variant NGS analysis.机器学习随机森林预测肿瘤体细胞变异 NGS 分析。
Sci Rep. 2021 Nov 8;11(1):21820. doi: 10.1038/s41598-021-01253-y.
4
An Advanced Machine Learning Model for a Web-Based Artificial Intelligence-Based Clinical Decision Support System Application: Model Development and Validation Study.基于人工智能的临床决策支持系统的基于网络的人工智能临床决策支持系统应用的高级机器学习模型:模型开发和验证研究。
J Med Internet Res. 2024 Sep 4;26:e56022. doi: 10.2196/56022.
5
An Application of Machine-Learning-Oriented Radiomics Model in Clear Cell Renal Cell Carcinoma (ccRCC) Early Diagnosis.面向机器学习的影像组学模型在透明细胞肾细胞癌(ccRCC)早期诊断中的应用
Br J Hosp Med (Lond). 2024 Nov 30;85(11):1-19. doi: 10.12968/hmed.2024.0238. Epub 2024 Nov 25.
6
Application of a Neural Network Whole Transcriptome-Based Pan-Cancer Method for Diagnosis of Primary and Metastatic Cancers.基于神经网络全转录组的泛癌方法在原发性和转移性癌症诊断中的应用。
JAMA Netw Open. 2019 Apr 5;2(4):e192597. doi: 10.1001/jamanetworkopen.2019.2597.
7
An artificial intelligence-assisted diagnosis modeling software (AIMS) platform based on medical images and machine learning: a development and validation study.基于医学图像和机器学习的人工智能辅助诊断建模软件(AIMS)平台:一项开发与验证研究。
Quant Imaging Med Surg. 2023 Nov 1;13(11):7504-7522. doi: 10.21037/qims-23-20. Epub 2023 Jul 7.
8
Development and validation of a web-based artificial intelligence prediction model to assess massive intraoperative blood loss for metastatic spinal disease using machine learning techniques.开发和验证一种基于网络的人工智能预测模型,使用机器学习技术评估转移性脊柱疾病的术中大量失血。
Spine J. 2024 Jan;24(1):146-160. doi: 10.1016/j.spinee.2023.09.001. Epub 2023 Sep 11.
9
The Impact of Information Relevancy and Interactivity on Intensivists' Trust in a Machine Learning-Based Bacteremia Prediction System: Simulation Study.基于机器学习的菌血症预测系统对重症监护医生信任的影响:模拟研究
JMIR Hum Factors. 2024 Aug 1;11:e56924. doi: 10.2196/56924.
10
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.

本文引用的文献

1
Precision Oncology: 2024 in Review.精准肿瘤学:2024年回顾
Cancer Discov. 2024 Dec 2;14(12):2332-2345. doi: 10.1158/2159-8290.CD-24-1476.
2
Validation and Implementation of a Somatic-Only Tumor Exome for Routine Clinical Application.仅针对肿瘤外显子的体细胞验证和实施用于常规临床应用。
J Mol Diagn. 2024 Sep;26(9):815-824. doi: 10.1016/j.jmoldx.2024.05.013. Epub 2024 Jul 6.
3
Considerations for multidisciplinary management of synchronous primary breast cancer and primary lung cancer - Analysis of thirty-one patients.
同步原发性乳腺癌和原发性肺癌的多学科管理考虑因素 - 31 例患者分析。
Thorac Cancer. 2024 May;15(14):1132-1137. doi: 10.1111/1759-7714.15284. Epub 2024 Apr 4.
4
Implementing Whole Genome Sequencing (WGS) in Clinical Practice: Advantages, Challenges, and Future Perspectives.在临床实践中实施全基因组测序(WGS):优势、挑战和未来展望。
Cells. 2024 Mar 13;13(6):504. doi: 10.3390/cells13060504.
5
Cancer of unknown primary and the «seed and soil» hypothesis.原发灶不明癌与“种子与土壤”假说。
Crit Rev Oncol Hematol. 2024 Apr;196:104297. doi: 10.1016/j.critrevonc.2024.104297. Epub 2024 Feb 11.
6
Wnt/β-catenin-driven EMT regulation in human cancers.Wnt/β-catenin 驱动的人类癌症中的 EMT 调控。
Cell Mol Life Sci. 2024 Feb 9;81(1):79. doi: 10.1007/s00018-023-05099-7.
7
Single-cell and spatial profiling identify three response trajectories to pembrolizumab and radiation therapy in triple negative breast cancer.单细胞和空间分析确定了三阴性乳腺癌对帕博利珠单抗和放射治疗的三种反应轨迹。
Cancer Cell. 2024 Jan 8;42(1):70-84.e8. doi: 10.1016/j.ccell.2023.12.012.
8
Glioblastoma vaccines: past, present, and opportunities.胶质母细胞瘤疫苗:过去、现在和未来的机遇。
EBioMedicine. 2024 Feb;100:104963. doi: 10.1016/j.ebiom.2023.104963. Epub 2024 Jan 5.
9
Targeting Tumor Heterogeneity with Neoantigen-Based Cancer Vaccines.利用基于新抗原的癌症疫苗靶向肿瘤异质性。
Cancer Res. 2024 Feb 1;84(3):353-363. doi: 10.1158/0008-5472.CAN-23-2042.
10
Analysis and Visualization of Longitudinal Genomic and Clinical Data from the AACR Project GENIE Biopharma Collaborative in cBioPortal.在 cBioPortal 中分析和可视化 AACR 项目 GENIE 生物制药协作的纵向基因组和临床数据。
Cancer Res. 2023 Dec 1;83(23):3861-3867. doi: 10.1158/0008-5472.CAN-23-0816.