动物园：通过集成受动物启发的群体智能特征选择算法来选择转录组学和甲基组学生物标志物。

Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms.

机构信息

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China.

出版信息

Genes (Basel). 2021 Nov 18;12(11):1814. doi: 10.3390/genes12111814.

DOI:10.3390/genes12111814

PMID:34828418

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8621246/

Abstract

Biological omics data such as transcriptomes and methylomes have the inherent "large p small n" paradigm, i.e., the number of features is much larger than that of the samples. A feature selection (FS) algorithm selects a subset of the transcriptomic or methylomic biomarkers in order to build a better prediction model. The hidden patterns in the FS solution space make it challenging to achieve a feature subset with satisfying prediction performances. Swarm intelligence (SI) algorithms mimic the target searching behaviors of various animals and have demonstrated promising capabilities in selecting features with good machine learning performances. Our study revealed that different SI-based feature selection algorithms contributed complementary searching capabilities in the FS solution space, and their collaboration generated a better feature subset than the individual SI feature selection algorithms. Nine SI-based feature selection algorithms were integrated to vote for the selected features, which were further refined by the dynamic recursive feature elimination framework. In most cases, the proposed Zoo algorithm outperformed the existing feature selection algorithms on transcriptomics and methylomics datasets.

摘要

生物组学数据（如转录组和甲基组）具有固有的“大 p 小 n”范式，即特征数量远远大于样本数量。特征选择（FS）算法选择转录组或甲基组生物标志物的子集，以构建更好的预测模型。FS 解决方案空间中的隐藏模式使得很难获得具有令人满意的预测性能的特征子集。群体智能（SI）算法模拟了各种动物的目标搜索行为，在选择具有良好机器学习性能的特征方面表现出了有前景的能力。我们的研究表明，基于不同 SI 的特征选择算法在 FS 解决方案空间中贡献了互补的搜索能力，它们的协作生成了比单个 SI 特征选择算法更好的特征子集。九个基于 SI 的特征选择算法被整合起来为选定的特征投票，这些特征进一步通过动态递归特征消除框架进行细化。在大多数情况下，与现有的特征选择算法相比，所提出的 Zoo 算法在转录组和甲基组数据集上表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afdf/8621246/b79dbccffe77/genes-12-01814-g001.jpg

相似文献

Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms.动物园：通过集成受动物启发的群体智能特征选择算法来选择转录组学和甲基组学生物标志物。

Genes (Basel). 2021 Nov 18;12(11):1814. doi: 10.3390/genes12111814.

Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm.利用群体智能特征选择算法发现紧凑型癌症生物标志物。

Comput Biol Chem. 2010 Aug;34(4):244-50. doi: 10.1016/j.compbiolchem.2010.08.003. Epub 2010 Sep 9.

Novel Improved Salp Swarm Algorithm: An Application for Feature Selection.新型改进沙蚕群算法：在特征选择中的应用。

Sensors (Basel). 2022 Feb 22;22(5):1711. doi: 10.3390/s22051711.

An OMIC biomarker detection algorithm TriVote and its application in methylomic biomarker detection.一种 OMIC 生物标志物检测算法 TriVote 及其在甲基化组生物标志物检测中的应用。

Epigenomics. 2018 Apr;10(4):335-347. doi: 10.2217/epi-2017-0097. Epub 2018 Jan 19.

RIFS: a randomly restarted incremental feature selection algorithm.RIFS：一种随机重启的增量特征选择算法。

Sci Rep. 2017 Oct 12;7(1):13013. doi: 10.1038/s41598-017-13259-6.

Feature Selection by Hybrid Brain Storm Optimization Algorithm for COVID-19 Classification.基于混合脑暴优化算法的 COVID-19 分类特征选择。

J Comput Biol. 2022 Jun;29(6):515-529. doi: 10.1089/cmb.2021.0256. Epub 2022 Apr 19.

RHSOFS: Feature Selection Using the Rock Hyrax Swarm Optimization Algorithm for Credit Card Fraud Detection System.RHSOFS：使用岩蹄兔群优化算法进行信用卡欺诈检测系统的特征选择。

Sensors (Basel). 2022 Nov 30;22(23):9321. doi: 10.3390/s22239321.

A Novel Rank Aggregation-Based Hybrid Multifilter Wrapper Feature Selection Method in Software Defect Prediction.一种新颖的基于排序聚合的混合多过滤器包装特征选择方法在软件缺陷预测中。

Comput Intell Neurosci. 2021 Nov 24;2021:5069016. doi: 10.1155/2021/5069016. eCollection 2021.

Medical data mining in sentiment analysis based on optimized swarm search feature selection.基于优化群体搜索特征选择的情感分析中的医学数据挖掘

Australas Phys Eng Sci Med. 2018 Dec;41(4):1087-1100. doi: 10.1007/s13246-018-0674-3. Epub 2018 Sep 11.

A dynamic recursive feature elimination framework (dRFE) to further refine a set of OMIC biomarkers.一种用于进一步优化一组组学生物标志物的动态递归特征消除框架（dRFE）。

Bioinformatics. 2021 Aug 9;37(15):2183-2189. doi: 10.1093/bioinformatics/btab055.

引用本文的文献

Multiomics with Evolutionary Computation to Identify Molecular and Module Biomarkers for Early Diagnosis and Treatment of Complex Disease.结合多组学与进化计算以识别复杂疾病早期诊断和治疗的分子及模块生物标志物。

Genes (Basel). 2025 Feb 20;16(3):244. doi: 10.3390/genes16030244.

Machine Learning Methods for Survival Analysis with Clinical and Transcriptomics Data of Breast Cancer.机器学习方法在乳腺癌临床和转录组学数据中的生存分析。

Methods Mol Biol. 2023;2553:325-393. doi: 10.1007/978-1-0716-2617-7_16.

本文引用的文献

An efficient alpha seeding method for optimized extreme learning machine-based feature selection algorithm.一种用于优化基于极端学习机的特征选择算法的高效 alpha 种子生成方法。

Comput Biol Med. 2021 Jul;134:104505. doi: 10.1016/j.compbiomed.2021.104505. Epub 2021 May 23.

EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models.EnRank：一种基于特征选择和机器学习模型检测肺动脉高压生物标志物的集成方法。

Front Genet. 2021 Apr 27;12:636429. doi: 10.3389/fgene.2021.636429. eCollection 2021.

RIFS2D: A two-dimensional version of a randomly restarted incremental feature selection algorithm with an application for detecting low-ranked biomarkers.RIFS2D：一种随机重启增量特征选择算法的二维版本，应用于检测低阶生物标志物。

Comput Biol Med. 2021 Jun;133:104405. doi: 10.1016/j.compbiomed.2021.104405. Epub 2021 Apr 17.

Distant metastasis time to event analysis with CNNs in independent head and neck cancer cohorts.基于卷积神经网络的独立头颈部肿瘤队列远处转移时间事件分析。

Sci Rep. 2021 Mar 19;11(1):6418. doi: 10.1038/s41598-021-85671-y.

An efficient framework for automated screening of Clinically Significant Macular Edema.用于临床显著黄斑水肿自动筛查的高效框架。

Comput Biol Med. 2021 Mar;130:104128. doi: 10.1016/j.compbiomed.2020.104128. Epub 2020 Nov 24.

A dynamic recursive feature elimination framework (dRFE) to further refine a set of OMIC biomarkers.一种用于进一步优化一组组学生物标志物的动态递归特征消除框架（dRFE）。

Bioinformatics. 2021 Aug 9;37(15):2183-2189. doi: 10.1093/bioinformatics/btab055.

Region of Interest Selection for Functional Features.功能特征的感兴趣区域选择

Neurocomputing (Amst). 2021 Jan;422:235-244. doi: 10.1016/j.neucom.2020.10.009. Epub 2020 Oct 14.

FeSTwo, a two-step feature selection algorithm based on feature engineering and sampling for the chronological age regression problem.FeSTwo，一种基于特征工程和采样的两步特征选择算法，用于解决年龄回归问题。

Comput Biol Med. 2020 Oct;125:104008. doi: 10.1016/j.compbiomed.2020.104008. Epub 2020 Sep 26.

Clinical data classification using an enhanced SMOTE and chaotic evolutionary feature selection.使用增强型SMOTE和混沌进化特征选择的临床数据分类

Comput Biol Med. 2020 Nov;126:103991. doi: 10.1016/j.compbiomed.2020.103991. Epub 2020 Sep 18.

GeFeS: A generalized wrapper feature selection approach for optimizing classification performance.GeFeS：一种用于优化分类性能的广义包装特征选择方法。

Comput Biol Med. 2020 Oct;125:103974. doi: 10.1016/j.compbiomed.2020.103974. Epub 2020 Aug 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

动物园：通过集成受动物启发的群体智能特征选择算法来选择转录组学和甲基组学生物标志物。

Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献