• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于权重的特征选择(WBFS)算法利用蛋白质组学数据对肺癌亚型进行分类。

The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data.

作者信息

Wang Yangyang, Gao Xiaoguang, Ru Xinxin, Sun Pengzhan, Wang Jihan

机构信息

School of Electronics and Information, Northwestern Polytechnical University, Xi'an 710129, China.

Xi'an Key Laboratory of Stem Cell and Regenerative Medicine, Institute of Medical Research, Northwestern Polytechnical University, Xi'an 710072, China.

出版信息

Entropy (Basel). 2023 Jun 29;25(7):1003. doi: 10.3390/e25071003.

DOI:10.3390/e25071003
PMID:37509950
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10378569/
Abstract

Feature selection plays an important role in improving the performance of classification or reducing the dimensionality of high-dimensional datasets, such as high-throughput genomics/proteomics data in bioinformatics. As a popular approach with computational efficiency and scalability, information theory has been widely incorporated into feature selection. In this study, we propose a unique weight-based feature selection (WBFS) algorithm that assesses selected features and candidate features to identify the key protein biomarkers for classifying lung cancer subtypes from The Cancer Proteome Atlas (TCPA) database and we further explored the survival analysis between selected biomarkers and subtypes of lung cancer. Results show good performance of the combination of our WBFS method and Bayesian network for mining potential biomarkers. These candidate signatures have valuable biological significance in tumor classification and patient survival analysis. Taken together, this study proposes the WBFS method that helps to explore candidate biomarkers from biomedical datasets and provides useful information for tumor diagnosis or therapy strategies.

摘要

特征选择在提高分类性能或降低高维数据集(如生物信息学中的高通量基因组学/蛋白质组学数据)的维度方面发挥着重要作用。作为一种具有计算效率和可扩展性的流行方法,信息论已被广泛应用于特征选择。在本研究中,我们提出了一种独特的基于权重的特征选择(WBFS)算法,该算法评估已选特征和候选特征,以从癌症蛋白质组图谱(TCPA)数据库中识别用于分类肺癌亚型的关键蛋白质生物标志物,并且我们进一步探索了所选生物标志物与肺癌亚型之间的生存分析。结果表明,我们的WBFS方法与贝叶斯网络相结合在挖掘潜在生物标志物方面表现良好。这些候选特征在肿瘤分类和患者生存分析中具有重要的生物学意义。综上所述,本研究提出了WBFS方法,有助于从生物医学数据集中探索候选生物标志物,并为肿瘤诊断或治疗策略提供有用信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/dd64e6dd8c1c/entropy-25-01003-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/a30c579de41c/entropy-25-01003-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/2d32dd42dacd/entropy-25-01003-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/f7ee998e0250/entropy-25-01003-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/51793600d252/entropy-25-01003-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/dd64e6dd8c1c/entropy-25-01003-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/a30c579de41c/entropy-25-01003-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/2d32dd42dacd/entropy-25-01003-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/f7ee998e0250/entropy-25-01003-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/51793600d252/entropy-25-01003-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21e5/10378569/dd64e6dd8c1c/entropy-25-01003-g005.jpg

相似文献

1
The Weight-Based Feature Selection (WBFS) Algorithm Classifies Lung Cancer Subtypes Using Proteomic Data.基于权重的特征选择(WBFS)算法利用蛋白质组学数据对肺癌亚型进行分类。
Entropy (Basel). 2023 Jun 29;25(7):1003. doi: 10.3390/e25071003.
2
Using feature selection and Bayesian network identify cancer subtypes based on proteomic data.基于蛋白质组学数据,使用特征选择和贝叶斯网络识别癌症亚型。
J Proteomics. 2023 May 30;280:104895. doi: 10.1016/j.jprot.2023.104895. Epub 2023 Apr 5.
3
Identification of protein signatures for lung cancer subtypes based on BPSO method.基于 BPSO 方法的肺癌亚型蛋白质特征识别。
PLoS One. 2023 Dec 7;18(12):e0294243. doi: 10.1371/journal.pone.0294243. eCollection 2023.
4
Functional Proteomic Profiling Analysis in Four Major Types of Gastrointestinal Cancers.四种主要胃肠道癌的功能蛋白质组学分析。
Biomolecules. 2023 Apr 20;13(4):701. doi: 10.3390/biom13040701.
5
A self-organizing deep neuro-fuzzy system approach for classification of kidney cancer subtypes using miRNA genomics data.一种使用miRNA基因组数据对肾癌亚型进行分类的自组织深度神经模糊系统方法。
Comput Methods Programs Biomed. 2021 Jul;206:106132. doi: 10.1016/j.cmpb.2021.106132. Epub 2021 Apr 27.
6
High-throughput proteomics of breast cancer subtypes: Biological characterization and multiple candidate biomarker panels to patients' stratification.乳腺癌亚型的高通量蛋白质组学:生物学特征及用于患者分层的多个候选生物标志物组
J Proteomics. 2023 Aug 15;285:104955. doi: 10.1016/j.jprot.2023.104955. Epub 2023 Jun 28.
7
Feature screening for survival trait with application to TCGA high-dimensional genomic data.基于 TCGA 高维基因组数据的生存特征筛选。
PeerJ. 2022 Mar 10;10:e13098. doi: 10.7717/peerj.13098. eCollection 2022.
8
Functional genomics and proteomics in the clinical neurosciences: data mining and bioinformatics.临床神经科学中的功能基因组学和蛋白质组学:数据挖掘与生物信息学
Prog Brain Res. 2006;158:83-108. doi: 10.1016/S0079-6123(06)58004-5.
9
A hybrid feature selection algorithm and its application in bioinformatics.一种混合特征选择算法及其在生物信息学中的应用。
PeerJ Comput Sci. 2022 Mar 22;8:e933. doi: 10.7717/peerj-cs.933. eCollection 2022.
10
Data mining techniques for cancer detection using serum proteomic profiling.利用血清蛋白质组分析进行癌症检测的数据挖掘技术
Artif Intell Med. 2004 Oct;32(2):71-83. doi: 10.1016/j.artmed.2004.03.006.

引用本文的文献

1
Utilizing Feature Selection Techniques for AI-Driven Tumor Subtype Classification: Enhancing Precision in Cancer Diagnostics.利用特征选择技术进行人工智能驱动的肿瘤亚型分类:提高癌症诊断的精度。
Biomolecules. 2025 Jan 8;15(1):81. doi: 10.3390/biom15010081.

本文引用的文献

1
Feature selection approaches identify potential plasma metabolites in postmenopausal osteoporosis patients.特征选择方法鉴定绝经后骨质疏松症患者潜在的血浆代谢物。
Metabolomics. 2022 Nov 1;18(11):86. doi: 10.1007/s11306-022-01937-0.
2
Identification of therapeutically potential targets and their ligands for the treatment of OSCC.鉴定用于治疗口腔鳞状细胞癌的具有治疗潜力的靶点及其配体。
Front Oncol. 2022 Sep 20;12:910494. doi: 10.3389/fonc.2022.910494. eCollection 2022.
3
A hybrid feature selection algorithm and its application in bioinformatics.
一种混合特征选择算法及其在生物信息学中的应用。
PeerJ Comput Sci. 2022 Mar 22;8:e933. doi: 10.7717/peerj-cs.933. eCollection 2022.
4
Predictive models for stage and risk classification in head and neck squamous cell carcinoma (HNSCC).头颈部鳞状细胞癌(HNSCC)分期及风险分类的预测模型
PeerJ. 2020 Sep 22;8:e9656. doi: 10.7717/peerj.9656. eCollection 2020.
5
Mining TCGA database for tumor mutation burden and their clinical significance in bladder cancer.从 TCGA 数据库挖掘肿瘤突变负荷及其在膀胱癌中的临床意义。
Biosci Rep. 2020 Apr 30;40(4). doi: 10.1042/BSR20194337.
6
Identification of prognostic genes in the acute myeloid leukemia immune microenvironment based on TCGA data analysis.基于 TCGA 数据分析的急性髓系白血病免疫微环境中的预后基因鉴定。
Cancer Immunol Immunother. 2019 Dec;68(12):1971-1978. doi: 10.1007/s00262-019-02408-7. Epub 2019 Oct 24.
7
Research Techniques Made Simple: Feature Selection for Biomarker Discovery.研究技巧解析:生物标志物发现中的特征选择
J Invest Dermatol. 2019 Oct;139(10):2068-2074.e1. doi: 10.1016/j.jid.2019.07.682.
8
Review of Causal Discovery Methods Based on Graphical Models.基于图形模型的因果发现方法综述
Front Genet. 2019 Jun 4;10:524. doi: 10.3389/fgene.2019.00524. eCollection 2019.
9
Abandoning the Notion of Non-Small Cell Lung Cancer.摒弃非小细胞肺癌的概念。
Trends Mol Med. 2019 Jul;25(7):585-594. doi: 10.1016/j.molmed.2019.04.012. Epub 2019 May 30.
10
A hybrid gene selection algorithm based on interaction information for microarray-based cancer classification.基于互信息的混合基因选择算法在基于微阵列的癌症分类中的应用。
PLoS One. 2019 Feb 15;14(2):e0212333. doi: 10.1371/journal.pone.0212333. eCollection 2019.