Suppr超能文献

统计活检:一种用于癌症早期检测的新兴筛查方法。

Statistical biopsy: An emerging screening approach for early detection of cancers.

作者信息

Hart Gregory R, Yan Vanessa, Nartowt Bradley J, Roffman David A, Stark Gigi, Muhammad Wazir, Deng Jun

机构信息

Institute for Disease Modeling, Global Health Division, Bill and Melinda Gates Foundation, Seattle, WA, United States.

Department of Therapeutic Radiology, Yale University, New Haven, CT, United States.

出版信息

Front Artif Intell. 2023 Jan 20;5:1059093. doi: 10.3389/frai.2022.1059093. eCollection 2022.

Abstract

Despite large investment cancer continues to be a major source of mortality and morbidity throughout the world. Traditional methods of detection and diagnosis such as biopsy and imaging, tend to be expensive and have risks of complications. As data becomes more abundant and machine learning continues advancing, it is natural to ask how they can help solve some of these problems. In this paper we show that using a person's personal health data it is possible to predict their risk for a wide variety of cancers. We dub this process a "statistical biopsy." Specifically, we train two neural networks, one predicting risk for 16 different cancer types in females and the other predicting risk for 15 different cancer types in males. The networks were trained as binary classifiers identifying individuals that were diagnosed with the different cancer types within 5 years of joining the PLOC trial. However, rather than use the binary output of the classifiers we show that the continuous output can instead be used as a cancer risk allowing a holistic look at an individual's cancer risks. We tested our multi-cancer model on the UK Biobank dataset showing that for most cancers the predictions generalized well and that looking at multiple cancer risks at once from personal health data is a possibility. While the statistical biopsy will not be able to replace traditional biopsies for diagnosing cancers, we hope there can be a shift of paradigm in how statistical models are used in cancer detection moving to something more powerful and more personalized than general population screening guidelines.

摘要

尽管投入巨大,但癌症仍是全球主要的死亡和发病原因。传统的检测和诊断方法,如活检和成像,往往成本高昂且存在并发症风险。随着数据日益丰富以及机器学习不断发展,自然而然会思考它们如何能帮助解决其中一些问题。在本文中,我们表明利用个人健康数据能够预测其患多种癌症的风险。我们将这一过程称为“统计活检”。具体而言,我们训练了两个神经网络,一个用于预测女性16种不同癌症类型的风险,另一个用于预测男性15种不同癌症类型的风险。这些网络被训练为二元分类器,以识别在加入PLOC试验后5年内被诊断出患有不同癌症类型的个体。然而,我们并非使用分类器的二元输出,而是表明连续输出可被用作癌症风险,从而全面审视个体的癌症风险。我们在英国生物银行数据集上测试了我们的多癌症模型,结果表明对于大多数癌症,预测具有良好的泛化性,并且从个人健康数据中一次性查看多种癌症风险是可行的。虽然统计活检无法取代用于诊断癌症的传统活检,但我们希望在癌症检测中使用统计模型的范式能够发生转变,转向比一般人群筛查指南更强大、更个性化的方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0ea/9895959/18440775d9f7/frai-05-1059093-g0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验