Suppr超能文献

dRFEtools:组学的动态递归特征消除。

dRFEtools: dynamic recursive feature elimination for omics.

机构信息

Lieber Institute for Brain Development, Baltimore, MD 21205, United States.

Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, United States.

出版信息

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad513.

Abstract

MOTIVATION

Advances in technology have generated larger omics datasets with potential applications for machine learning. In many datasets, however, cost and limited sample availability result in an excessively higher number of features as compared to observations. Moreover, biological processes are associated with networks of core and peripheral genes, while traditional feature selection approaches capture only core genes.

RESULTS

To overcome these limitations, we present dRFEtools that implements dynamic recursive feature elimination (RFE), reducing computational time with high accuracy compared to standard RFE, expanding dynamic RFE to regression algorithms, and outputting the subsets of features that hold predictive power with and without peripheral features. dRFEtools integrates with scikit-learn (the popular Python machine learning platform) and thus provides new opportunities for dynamic RFE in large-scale omics data while enhancing its interpretability.

AVAILABILITY AND IMPLEMENTATION

dRFEtools is freely available on PyPI at https://pypi.org/project/drfetools/ or on GitHub https://github.com/LieberInstitute/dRFEtools, implemented in Python 3, and supported on Linux, Windows, and Mac OS.

摘要

动机

技术的进步产生了具有潜在机器学习应用的更大规模组学数据集。然而,在许多数据集中,成本和有限的样本可用性导致特征数量相对于观测值过高。此外,生物过程与核心和外围基因网络相关联,而传统的特征选择方法仅捕获核心基因。

结果

为了克服这些限制,我们提出了 dRFEtools,它实现了动态递归特征消除 (RFE),与标准 RFE 相比,提高了准确性并降低了计算时间,将动态 RFE 扩展到回归算法,并输出具有和不具有外围特征的具有预测能力的特征子集。dRFEtools 与 scikit-learn(流行的 Python 机器学习平台)集成,从而为大规模组学数据中的动态 RFE 提供了新的机会,同时提高了其可解释性。

可用性和实现

dRFEtools 可在 PyPI 上免费获得,网址为 https://pypi.org/project/drfetools/ 或在 GitHub 上获得,网址为 https://github.com/LieberInstitute/dRFEtools,它是用 Python 3 实现的,支持 Linux、Windows 和 Mac OS。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1b5/10471895/0b01bd035f9d/btad513f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验