一个基于网络的用于分析液体活检数据的自动化机器学习平台。

A web-based automated machine learning platform to analyze liquid biopsy data.

作者信息

Shen Hanfei, Liu Tony, Cui Jesse, Borole Piyush, Benjamin Ari, Kording Konrad, Issadore David

机构信息

Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA.

出版信息

Lab Chip. 2020 Jun 21;20(12):2166-2174. doi: 10.1039/d0lc00096e. Epub 2020 May 18.

DOI:10.1039/d0lc00096e

PMID:32420563

Abstract

Liquid biopsy (LB) technologies continue to improve in sensitivity, specificity, and multiplexing and can measure an ever growing library of disease biomarkers. However, clinical interpretation of the increasingly large sets of data these technologies generate remains a challenge. Machine learning is a popular approach to discover and detect signatures of disease. However, limited machine learning expertise in the LB field has kept the discipline from fully leveraging these tools and risks improper analyses and irreproducible results. In this paper, we develop a web-based automated machine learning tool tailored specifically for LB, where machine learning models can be built without the user's input. We also incorporate a differential privacy algorithm, designed to limit the effects of overfitting that can arise from users iteratively developing a panel with feedback from our platform. We validate our approach by performing a meta-analysis on 11 published LB datasets, and found that we had similar or better performance compared to those reported in the literature. Moreover, we show that our platform's performance improved when incorporating information from prior LB datasets, suggesting that this approach can continue to improve with increased access to LB data. Finally, we show that by using our platform the results achieved in the literature can be matched using 40% of the number of subjects in the training set, potentially reducing study cost and time. This self-improving and overfitting-resistant automatic machine learning platform provides a new standard that can be used to validate machine learning works in the LB field.

摘要

液体活检（LB）技术在灵敏度、特异性和多重检测方面不断改进，能够检测越来越多的疾病生物标志物库。然而，对这些技术所产生的日益庞大的数据集进行临床解读仍然是一项挑战。机器学习是发现和检测疾病特征的常用方法。然而，LB领域有限的机器学习专业知识阻碍了该学科充分利用这些工具，存在分析不当和结果不可重复的风险。在本文中，我们开发了一种专门为LB量身定制的基于网络的自动化机器学习工具，无需用户输入即可构建机器学习模型。我们还纳入了一种差分隐私算法，旨在限制因用户根据我们平台的反馈迭代开发一个检测组而可能出现的过拟合影响。我们通过对11个已发表的LB数据集进行荟萃分析来验证我们的方法，发现我们的性能与文献报道的相似或更好。此外，我们表明，纳入先前LB数据集的信息时，我们平台的性能有所提高，这表明随着获取LB数据的增加，这种方法可以持续改进。最后，我们表明，使用我们的平台，在训练集中使用40%的受试者数量就能达到文献中的结果，这有可能降低研究成本和时间。这个自我改进且抗过拟合的自动机器学习平台提供了一个可用于验证LB领域机器学习工作的新标准。

相似文献

A web-based automated machine learning platform to analyze liquid biopsy data.一个基于网络的用于分析液体活检数据的自动化机器学习平台。

Lab Chip. 2020 Jun 21;20(12):2166-2174. doi: 10.1039/d0lc00096e. Epub 2020 May 18.

Machine learning to detect signatures of disease in liquid biopsies - a user's guide.机器学习在液体活检中检测疾病特征——使用指南。

Lab Chip. 2018 Jan 30;18(3):395-405. doi: 10.1039/c7lc00955k.

Is There a Role for Machine Learning in Liquid Biopsy for Brain Tumors? A Systematic Review.机器学习在脑肿瘤液体活检中的作用？系统综述。

Int J Mol Sci. 2023 Jun 3;24(11):9723. doi: 10.3390/ijms24119723.

KETOS: Clinical decision support and machine learning as a service - A training and deployment platform based on Docker, OMOP-CDM, and FHIR Web Services.KETOS：临床决策支持和机器学习即服务 - 基于 Docker、OMOP-CDM 和 FHIR Web Services 的培训和部署平台。

PLoS One. 2019 Oct 3;14(10):e0223010. doi: 10.1371/journal.pone.0223010. eCollection 2019.

Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测：机器学习在 1 型糖尿病中的应用。

Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.

Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.老年人日常对话中的社会怀旧：使用自然语言处理和机器学习的自动检测。

J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.

BioSeq-Analysis: a platform for DNA, RNA and protein sequence analysis based on machine learning approaches.生物序列分析：一个基于机器学习方法的 DNA、RNA 和蛋白质序列分析平台。

Brief Bioinform. 2019 Jul 19;20(4):1280-1294. doi: 10.1093/bib/bbx165.

Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.使用机器学习和深度学习技术评估白内障手术视频中的相位自动识别。

JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.

Beyond multidrug resistance: Leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction.超越多药耐药性：利用机器和统计学习模型在结核分枝杆菌耐药性预测中的罕见变异。

EBioMedicine. 2019 May;43:356-369. doi: 10.1016/j.ebiom.2019.04.016. Epub 2019 Apr 29.

Systematic Poisoning Attacks on and Defenses for Machine Learning in Healthcare.系统中毒攻击与机器学习在医疗保健中的防御

IEEE J Biomed Health Inform. 2015 Nov;19(6):1893-905. doi: 10.1109/JBHI.2014.2344095. Epub 2014 Jul 30.

引用本文的文献

miRNA panel from HER2+ and CD24+ plasma extracellular vesicle subpopulations as biomarkers of early-stage breast cancer.来自HER2+和CD24+血浆细胞外囊泡亚群的miRNA检测作为早期乳腺癌的生物标志物

Breast Cancer Res. 2025 May 22;27(1):90. doi: 10.1186/s13058-025-02029-2.

Role of AI in empowering and redefining the oncology care landscape: perspective from a developing nation.人工智能在赋能和重新定义肿瘤护理格局中的作用：来自一个发展中国家的视角。

Front Digit Health. 2025 Mar 4;7:1550407. doi: 10.3389/fdgth.2025.1550407. eCollection 2025.

Liquid biopsy into the clinics: Current evidence and future perspectives.液体活检进入临床：当前证据与未来展望。

J Liq Biopsy. 2024 Feb 11;4:100146. doi: 10.1016/j.jlb.2024.100146. eCollection 2024 Jun.

Does circulating tumor DNA apply as a reliable biomarker for the diagnosis and prognosis of head and neck squamous cell carcinoma?循环肿瘤DNA能否作为头颈部鳞状细胞癌诊断和预后的可靠生物标志物？

Discov Oncol. 2024 Sep 11;15(1):427. doi: 10.1007/s12672-024-01308-2.

Artificial Intelligence in Cancer Diagnosis: A Game-Changer in Healthcare.癌症诊断中的人工智能：医疗保健领域的变革者。

Curr Pharm Biotechnol. 2024 Jun 6. doi: 10.2174/0113892010298852240528123911.

Innovative Drug Modalities for the Treatment of Advanced Prostate Cancer.用于治疗晚期前列腺癌的创新药物模式

Diseases. 2024 May 2;12(5):87. doi: 10.3390/diseases12050087.

Computational kinematics of dance: distinguishing hip hop genres.舞蹈的计算运动学：区分嘻哈舞种

Front Robot AI. 2024 May 2;11:1295308. doi: 10.3389/frobt.2024.1295308. eCollection 2024.

Emerging Immunotherapy Approaches for Treating Prostate Cancer.新兴的免疫疗法在前列腺癌治疗中的应用。

Int J Mol Sci. 2023 Sep 20;24(18):14347. doi: 10.3390/ijms241814347.

Advances in novel strategies for isolation, characterization, and analysis of CTCs and ctDNA.循环肿瘤细胞（CTCs）和循环肿瘤DNA（ctDNA）的分离、表征及分析新策略的进展

Ther Adv Med Oncol. 2023 Sep 7;15:17588359231192401. doi: 10.1177/17588359231192401. eCollection 2023.

Brain-derived extracellular vesicles as serologic markers of brain injury following cardiac arrest: A pilot feasibility study.脑源性细胞外囊泡作为心搏骤停后脑损伤的血清标志物：一项初步可行性研究。

Resuscitation. 2023 Oct;191:109937. doi: 10.1016/j.resuscitation.2023.109937. Epub 2023 Aug 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一个基于网络的用于分析液体活检数据的自动化机器学习平台。

A web-based automated machine learning platform to analyze liquid biopsy data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献