Suppr超能文献

基于组特异性网络的特定特征识别(SFR-GSN):一种癌症分期的生物标志物识别模型。

Specific feature recognition on group specific networks (SFR-GSN): a biomarker identification model for cancer stages.

作者信息

Chen Bolin, Wang Yuxin, Zhang Jinlei, Han Yourui, Benhammouda Hamza, Bian Jun, Kang Ruiming, Shang Xuequn

机构信息

School of Computer Science, Northwestern Polytechnical University, Xi'an, Shaanxi, China.

Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, Shaanxi, China.

出版信息

Front Genet. 2024 May 23;15:1407072. doi: 10.3389/fgene.2024.1407072. eCollection 2024.

Abstract

BACKGROUND AND OBJECTIVE

Accurate identification of cancer stages is challenging due to the complexity and heterogeneity of the disease. Current clinical diagnosis methods primarily rely on phenotypic observations, which may not capture early molecular-level changes accurately.

METHODS

In this study, a novel biomarker recognition method was proposed tailored for cancer stages by considering the change of gene expression relationships. Utilizing the sample-specific information and protein-protein interaction networks, the group specific networks were constructed to address the limited specificity of potential biomarkers. Then, a specific feature recognition method was proposed based on these group specific networks, which employed the random forest algorithm for initial screening followed by a recursive feature elimination process to identify the optimal biomarker subset. During exploring optimal results, a strategy termed the Cost-Benefit Ratio, was devised to facilitate the identification of stage-specific biomarkers.

RESULTS

Comparative experiments were conducted on lung adenocarcinoma and breast cancer datasets to validate the method's efficacy and generalizability. The results showed that the identified biomarkers were highly stage-specific, and the F1 scores for predicting cancer stages were significantly improved. For the lung adenocarcinoma dataset, the F1 score reached 97.68%, and for the breast cancer dataset, it achieved 96.87%. These results significantly surpassed those of three conventional methods in terms of F1 scores. Moreover, from the perspective of biological functions, the biomarkers were proved playing an important role in cancer stage-evolution.

CONCLUSION

The proposed method demonstrated its effectiveness in identifying stage-related biomarkers. By using these biomarkers as features, accurate prediction of cancer stages was achieved. Furthermore, the method exhibited potential for biomarker identification in subtype analyses, offering novel perspectives for cancer prognosis.

摘要

背景与目的

由于癌症疾病的复杂性和异质性,准确识别癌症阶段具有挑战性。当前的临床诊断方法主要依赖于表型观察,这可能无法准确捕捉早期分子水平的变化。

方法

在本研究中,通过考虑基因表达关系的变化,提出了一种针对癌症阶段量身定制的新型生物标志物识别方法。利用样本特异性信息和蛋白质-蛋白质相互作用网络,构建组特异性网络以解决潜在生物标志物特异性有限的问题。然后,基于这些组特异性网络提出了一种特定特征识别方法,该方法采用随机森林算法进行初始筛选,随后进行递归特征消除过程以识别最佳生物标志物子集。在探索最佳结果的过程中,设计了一种称为成本效益比的策略,以促进阶段特异性生物标志物的识别。

结果

在肺腺癌和乳腺癌数据集上进行了对比实验,以验证该方法的有效性和通用性。结果表明,所识别的生物标志物具有高度的阶段特异性,预测癌症阶段的F1分数显著提高。对于肺腺癌数据集,F1分数达到97.68%,对于乳腺癌数据集,达到96.87%。这些结果在F1分数方面显著超过了三种传统方法。此外,从生物学功能的角度来看,这些生物标志物被证明在癌症阶段演变中发挥着重要作用。

结论

所提出的方法在识别与阶段相关的生物标志物方面证明了其有效性。通过将这些生物标志物用作特征,实现了对癌症阶段的准确预测。此外,该方法在亚型分析中具有生物标志物识别的潜力,为癌症预后提供了新的视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/adf9/11153737/60b8511dfa6a/fgene-15-1407072-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验