• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 EMT 网络的特征选择可改善肺腺癌的预后预测。

EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma.

机构信息

Zuse Institute Berlin, Berlin, Germany.

Dept of mathematics and computer science, Freie Universität Berlin, Berlin, Germany.

出版信息

PLoS One. 2019 Jan 31;14(1):e0204186. doi: 10.1371/journal.pone.0204186. eCollection 2019.

DOI:10.1371/journal.pone.0204186
PMID:30703089
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6354965/
Abstract

Various feature selection algorithms have been proposed to identify cancer prognostic biomarkers. In recent years, however, their reproducibility is criticized. The performance of feature selection algorithms is shown to be affected by the datasets, underlying networks and evaluation metrics. One of the causes is the curse of dimensionality, which makes it hard to select the features that generalize well on independent data. Even the integration of biological networks does not mitigate this issue because the networks are large and many of their components are not relevant for the phenotype of interest. With the availability of multi-omics data, integrative approaches are being developed to build more robust predictive models. In this scenario, the higher data dimensions create greater challenges. We proposed a phenotype relevant network-based feature selection (PRNFS) framework and demonstrated its advantages in lung cancer prognosis prediction. We constructed cancer prognosis relevant networks based on epithelial mesenchymal transition (EMT) and integrated them with different types of omics data for feature selection. With less than 2.5% of the total dimensionality, we obtained EMT prognostic signatures that achieved remarkable prediction performance (average AUC values >0.8), very significant sample stratifications, and meaningful biological interpretations. In addition to finding EMT signatures from different omics data levels, we combined these single-omics signatures into multi-omics signatures, which improved sample stratifications significantly. Both single- and multi-omics EMT signatures were tested on independent multi-omics lung cancer datasets and significant sample stratifications were obtained.

摘要

已经提出了各种特征选择算法来识别癌症预后生物标志物。然而,近年来,它们的可重复性受到了批评。特征选择算法的性能受到数据集、基础网络和评估指标的影响。原因之一是维度的诅咒,这使得很难选择在独立数据上表现良好的特征。即使整合生物网络也不能缓解这个问题,因为网络很大,其中许多组件与感兴趣的表型无关。随着多组学数据的可用性,正在开发综合方法来构建更稳健的预测模型。在这种情况下,更高的数据维度带来了更大的挑战。我们提出了一种基于表型相关网络的特征选择(PRNFS)框架,并在肺癌预后预测中证明了其优势。我们基于上皮间质转化(EMT)构建了癌症预后相关网络,并将其与不同类型的组学数据集成进行特征选择。在总维度的不到 2.5%的情况下,我们获得了 EMT 预后特征,实现了出色的预测性能(平均 AUC 值>0.8)、非常显著的样本分层和有意义的生物学解释。除了从不同的组学数据水平发现 EMT 特征外,我们还将这些单组学特征组合成多组学特征,这显著提高了样本分层。单组学和多组学 EMT 特征都在独立的多组学肺癌数据集上进行了测试,并获得了显著的样本分层。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/3e1bcb3a56e1/pone.0204186.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/0cb68c6ea07c/pone.0204186.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/866f8162b00c/pone.0204186.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/2ff777a69230/pone.0204186.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/3e1bcb3a56e1/pone.0204186.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/0cb68c6ea07c/pone.0204186.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/866f8162b00c/pone.0204186.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/2ff777a69230/pone.0204186.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5398/6354965/3e1bcb3a56e1/pone.0204186.g004.jpg

相似文献

1
EMT network-based feature selection improves prognosis prediction in lung adenocarcinoma.基于 EMT 网络的特征选择可改善肺腺癌的预后预测。
PLoS One. 2019 Jan 31;14(1):e0204186. doi: 10.1371/journal.pone.0204186. eCollection 2019.
2
Multi-omics facilitated variable selection in Cox-regression model for cancer prognosis prediction.多组学技术助力Cox回归模型中的变量选择以进行癌症预后预测。
Methods. 2017 Jul 15;124:100-107. doi: 10.1016/j.ymeth.2017.06.010. Epub 2017 Jun 13.
3
KRT8 and KRT19, associated with EMT, are hypomethylated and overexpressed in lung adenocarcinoma and link to unfavorable prognosis.KRT8 和 KRT19 与 EMT 相关,在肺腺癌中呈低甲基化和过表达,与不良预后相关。
Biosci Rep. 2020 Jul 31;40(7). doi: 10.1042/BSR20193468.
4
LUADpp: an effective prediction model on prognosis of lung adenocarcinomas based on somatic mutational features.LUADpp:基于体细胞突变特征的肺腺癌预后有效预测模型。
BMC Cancer. 2019 Mar 22;19(1):263. doi: 10.1186/s12885-019-5433-7.
5
Synergistic Effects of Different Levels of Genomic Data for the Staging of Lung Adenocarcinoma: An Illustrative Study.不同层次基因组数据对肺腺癌分期的协同作用:一项说明性研究。
Genes (Basel). 2021 Nov 24;12(12):1872. doi: 10.3390/genes12121872.
6
Multi-scale supervised clustering-based feature selection for tumor classification and identification of biomarkers and targets on genomic data.基于多尺度监督聚类的特征选择在肿瘤分类和基因组数据的生物标志物和靶标鉴定中的应用。
BMC Genomics. 2020 Sep 22;21(1):650. doi: 10.1186/s12864-020-07038-3.
7
Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data.基于多组学数据预测卵巢癌生存的最小冗余最大相关性多视图特征选择。
BMC Med Genomics. 2018 Sep 14;11(Suppl 3):71. doi: 10.1186/s12920-018-0388-0.
8
Analysis of genomic and transcriptomic variations as prognostic signature for lung adenocarcinoma.分析基因组和转录组变化作为肺腺癌的预后特征。
BMC Bioinformatics. 2020 Sep 30;21(Suppl 14):368. doi: 10.1186/s12859-020-03691-3.
9
A gene expression-based single sample predictor of lung adenocarcinoma molecular subtype and prognosis.基于基因表达的肺腺癌分子亚型和预后的单一样本预测器。
Int J Cancer. 2021 Jan 1;148(1):238-251. doi: 10.1002/ijc.33242. Epub 2020 Aug 12.
10
ALDH1 expression correlates with an epithelial-like phenotype and favorable prognosis in lung adenocarcinoma: a study based on immunohistochemistry and mRNA expression data.ALDH1 表达与肺腺癌的上皮样表型和良好预后相关:基于免疫组织化学和 mRNA 表达数据的研究。
J Cancer Res Clin Oncol. 2019 Jun;145(6):1427-1436. doi: 10.1007/s00432-019-02906-2. Epub 2019 Mar 28.

引用本文的文献

1
CT-based radiomics for predicting lymph node metastasis in esophageal cancer: a systematic review and meta-analysis.基于CT的放射组学预测食管癌淋巴结转移的系统评价与Meta分析
Front Oncol. 2024 Mar 19;14:1267596. doi: 10.3389/fonc.2024.1267596. eCollection 2024.
2
UBE2L3 promotes lung adenocarcinoma invasion and metastasis through the GSK-3β/Snail signaling pathway.UBE2L3通过GSK-3β/Snail信号通路促进肺腺癌的侵袭和转移。
Am J Transl Res. 2022 Jul 15;14(7):4549-4561. eCollection 2022.
3
A chimeric virus-based probe unambiguously detects live circulating tumor cells with high specificity and sensitivity.

本文引用的文献

1
EMT in cancer.肿瘤中的 EMT。
Nat Rev Cancer. 2018 Feb;18(2):128-134. doi: 10.1038/nrc.2017.118. Epub 2018 Jan 12.
2
IODNE: An integrated optimization method for identifying the deregulated subnetwork for precision medicine in cancer.IODNE:一种用于识别癌症精准医学中失调子网的综合优化方法。
CPT Pharmacometrics Syst Pharmacol. 2017 Mar;6(3):168-176. doi: 10.1002/psp4.12167. Epub 2017 Mar 7.
3
Network-based analysis of omics data: the LEAN method.基于网络的组学数据分析:LEAN方法。
一种基于嵌合病毒的探针能够以高特异性和高灵敏度明确检测出循环中的活肿瘤细胞。
Mol Ther Methods Clin Dev. 2021 Aug 28;23:78-86. doi: 10.1016/j.omtm.2021.08.007. eCollection 2021 Dec 10.
4
Prediction of an outcome using NETwork Clusters (NET-C).使用 NETwork Clusters (NET-C) 进行预后预测。
Comput Biol Chem. 2021 Feb;90:107425. doi: 10.1016/j.compbiolchem.2020.107425. Epub 2020 Dec 8.
5
A novel single-cell based method for breast cancer prognosis.一种基于单细胞的新型乳腺癌预后方法。
PLoS Comput Biol. 2020 Aug 24;16(8):e1008133. doi: 10.1371/journal.pcbi.1008133. eCollection 2020 Aug.
Bioinformatics. 2017 Mar 1;33(5):701-709. doi: 10.1093/bioinformatics/btw676.
4
Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent.通过坐标下降法求解Cox比例风险模型的正则化路径
J Stat Softw. 2011 Mar;39(5):1-13. doi: 10.18637/jss.v039.i05.
5
The Molecular Signatures Database (MSigDB) hallmark gene set collection.分子特征数据库(MSigDB)标志性基因集集合。
Cell Syst. 2015 Dec 23;1(6):417-425. doi: 10.1016/j.cels.2015.12.004.
6
Genome-wide DNA methylation analyses in lung adenocarcinomas: Association with EGFR, KRAS and TP53 mutation status, gene expression and prognosis.肺腺癌的全基因组DNA甲基化分析:与表皮生长因子受体(EGFR)、 Kirsten大鼠肉瘤病毒癌基因(KRAS)和肿瘤蛋白p53(TP53)突变状态、基因表达及预后的关联
Mol Oncol. 2016 Feb;10(2):330-43. doi: 10.1016/j.molonc.2015.10.021. Epub 2015 Nov 6.
7
Identification of a multi-cancer gene expression biomarker for cancer clinical outcomes using a network-based algorithm.使用基于网络的算法鉴定用于癌症临床结果的多癌基因表达生物标志物。
Sci Rep. 2015 Jul 23;5:11966. doi: 10.1038/srep11966.
8
Network-based survival-associated module biomarker and its crosstalk with cell death genes in ovarian cancer.基于网络的生存相关模块生物标志物及其在卵巢癌中与细胞死亡基因的相互作用
Sci Rep. 2015 Jun 23;5:11566. doi: 10.1038/srep11566.
9
FERAL: network-based classifier with application to breast cancer outcome prediction.FERAL:基于网络的分类器及其在乳腺癌预后预测中的应用。
Bioinformatics. 2015 Jun 15;31(12):i311-9. doi: 10.1093/bioinformatics/btv255.
10
Molecular portraits of epithelial, mesenchymal, and hybrid States in lung adenocarcinoma and their relevance to survival.肺腺癌上皮、间质和混合状态的分子特征及其与生存的相关性。
Cancer Res. 2015 May 1;75(9):1789-800. doi: 10.1158/0008-5472.CAN-14-2535. Epub 2015 Mar 5.