Molecular Biology Division, Bhabha Atomic Research Centre, Mumbai 400085, India.
Homi Bhabha National Institute, Mumbai 400094, India.
Curr Oncol. 2023 Oct 19;30(10):9244-9261. doi: 10.3390/curroncol30100668.
Gastroenteropancreatic neuroendocrine tumors (GEP-NETs) account for 80% of gastroenteropancreatic neuroendocrine neoplasms (GEP-NENs). GEP-NETs are well-differentiated tumors, highly heterogeneous in biology and origin, and are often diagnosed at the metastatic stage. Diagnosis is commonly through clinical symptoms, histopathology, and PET-CT imaging, while molecular markers for metastasis and the primary site are unknown. Here, we report the identification of multi-gene signatures for hepatic metastasis and primary sites through analyses on RNA-SEQ datasets of pancreatic and small intestinal NETs tissue samples. Relevant gene features, identified from the normalized RNA-SEQ data using the mRMRe algorithm, were used to develop seven Machine Learning models (LDA, RF, CART, k-NN, SVM, XGBOOST, GBM). Two multi-gene random forest (RF) models classified primary and metastatic samples with 100% accuracy in training and test cohorts and >90% accuracy in an independent validation cohort. Similarly, three multi-gene RF models identified the pancreas or small intestine as the primary site with 100% accuracy in training and test cohorts, and >95% accuracy in an independent cohort. Multi-label models for concurrent prediction of hepatic metastasis and primary site returned >98.42% and >87.42% accuracies on training and test cohorts, respectively. A robust molecular signature to predict liver metastasis or the primary site for GEP-NETs is reported for the first time and could complement the clinical management of GEP-NETs.
胃肠胰神经内分泌肿瘤(GEP-NETs)占胃肠胰神经内分泌肿瘤(GEP-NENs)的 80%。GEP-NETs 是分化良好的肿瘤,在生物学和起源上高度异质,并且通常在转移阶段被诊断。诊断通常通过临床症状、组织病理学和 PET-CT 成像进行,而转移和原发部位的分子标志物尚不清楚。在这里,我们通过对胰腺和小肠 NET 组织样本的 RNA-SEQ 数据集进行分析,报告了鉴定肝转移和原发部位的多基因特征。使用 mRMRe 算法从归一化 RNA-SEQ 数据中识别出的相关基因特征,用于开发七种机器学习模型(LDA、RF、CART、k-NN、SVM、XGBOOST、GBM)。两个多基因随机森林(RF)模型在训练和测试队列中以 100%的准确率对原发和转移样本进行分类,在独立验证队列中的准确率>90%。同样,三个多基因 RF 模型在训练和测试队列中以 100%的准确率识别胰腺或小肠为原发部位,在独立队列中的准确率>95%。用于同时预测肝转移和原发部位的多标签模型在训练和测试队列上的准确率分别>98.42%和>87.42%。首次报道了用于预测 GEP-NET 肝转移或原发部位的稳健分子特征,可补充 GEP-NET 的临床管理。