Suppr超能文献

建立基于病理特征的机器学习模型以预测结肠癌中CD276(B7-H3)的表达。

Establishment of a pathomic-based machine learning model to predict CD276 (B7-H3) expression in colon cancer.

作者信息

Li Jia, Wang Dongxu, Zhang Chenxin

机构信息

Department of Gastroenterology, The 983rd Hospital of Joint Logistic Support Force of PLA, Tianjin, China.

Department of General Surgery, The 983rd Hospital of Joint Logistic Support Force of PLA, Tianjin, China.

出版信息

Front Oncol. 2024 Jan 8;13:1232192. doi: 10.3389/fonc.2023.1232192. eCollection 2023.

Abstract

CD276 is a promising prognostic indicator and an attractive therapeutic target in various malignancies. However, current methods for CD276 detection are time-consuming and expensive, limiting extensive studies and applications of CD276. We aimed to develop a pathomic model for CD276 prediction from H&E-stained pathological images, and explore the underlying mechanism of the pathomic features by associating the pathomic model with transcription profiles. A dataset of colon adenocarcinoma (COAD) patients was retrieved from the Cancer Genome Atlas (TCGA) database. The dataset was divided into the training and validation sets according to the ratio of 8:2 by a stratified sampling method. Using the gradient boosting machine (GBM) algorithm, we established a pathomic model to predict CD276 expression in COAD. Univariate and multivariate Cox regression analyses were conducted to assess the predictive performance of the pathomic model for overall survival in COAD. Gene Set Enrichment Analysis (GESA) was performed to explore the underlying biological mechanisms of the pathomic model. The pathomic model formed by three pathomic features for CD276 prediction showed an area under the curve (AUC) of 0.833 (95%CI: 0.784-0.882) in the training set and 0.758 (95%CI: 0.637-0.878) in the validation set, respectively. The calibration curves and Hosmer-Lemeshow goodness of fit test showed that the prediction probability of high/low expression of CD276 was in favorable agreement with the real situation in both the training and validation sets (=0.176 and 0.255, respectively). The DCA curves suggested that the pathomic model acquired high clinical benefit. All the subjects were categorized into high pathomic score (PS) (PS-H) and low PS (PS-L) groups according to the cutoff value of PS. Univariate and multivariate Cox regression analysis indicated that PS was a risk factor for overall survival in COAD. Furthermore, through GESA analysis, we found several immune and inflammatory-related pathways and genes were associated with the pathomic model. We constructed a pathomics-based machine learning model for CD276 prediction directly from H&E-stained images in COAD. Through integrated analysis of the pathomic model and transcriptomics, the interpretability of the pathomic model provide a theoretical basis for further hypothesis and experimental research.

摘要

CD276是一种很有前景的预后指标,也是各种恶性肿瘤中一个有吸引力的治疗靶点。然而,目前检测CD276的方法既耗时又昂贵,限制了对CD276的广泛研究和应用。我们旨在从苏木精-伊红(H&E)染色的病理图像中开发一种用于预测CD276的病理组学模型,并通过将病理组学模型与转录谱相关联来探索病理组学特征的潜在机制。从癌症基因组图谱(TCGA)数据库中检索了结肠腺癌(COAD)患者的数据集。采用分层抽样方法,按照8:2的比例将数据集分为训练集和验证集。使用梯度提升机(GBM)算法,我们建立了一个病理组学模型来预测COAD中CD276的表达。进行单变量和多变量Cox回归分析,以评估病理组学模型对COAD总生存期的预测性能。进行基因集富集分析(GESA)以探索病理组学模型的潜在生物学机制。由三个用于预测CD276的病理组学特征构成的病理组学模型在训练集中的曲线下面积(AUC)为0.833(95%CI:0.784-0.882),在验证集中为0.758(95%CI:0.637-0.878)。校准曲线和Hosmer-Lemeshow拟合优度检验表明,CD276高/低表达的预测概率与训练集和验证集中的实际情况均高度吻合(分别为=0.17 和0.255)。决策曲线分析(DCA)表明该病理组学模型具有较高的临床效益。根据病理组学评分(PS)的临界值,将所有受试者分为高病理组学评分(PS-H)和低PS(PS-L)组。单变量和多变量Cox回归分析表明,PS是COAD总生存期的一个危险因素。此外,通过GESA分析,我们发现一些免疫和炎症相关通路及基因与病理组学模型相关。我们直接从COAD的H&E染色图像构建了一个基于病理组学的用于预测CD276的机器学习模型。通过对病理组学模型和转录组学的综合分析,病理组学模型的可解释性为进一步的假设和实验研究提供了理论依据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a790/10802857/7adf59bf88fa/fonc-13-1232192-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验