项目反应理论作为机器学习中特征选择和解释的工具。

Item response theory as a feature selection and interpretation tool in the context of machine learning.

机构信息

Department of Biomedical Engineering, University of Calgary, Calgary, AB, Canada.

Undergraduate Medical Education, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.

出版信息

Med Biol Eng Comput. 2021 Feb;59(2):471-482. doi: 10.1007/s11517-020-02301-x. Epub 2021 Feb 3.

DOI:10.1007/s11517-020-02301-x

PMID:33534111

Abstract

Optimizing the number and utility of features to use in a classification analysis has been the subject of many research studies. Most current models use end-classifications as part of the feature reduction process, leading to circularity in the methodology. The approach demonstrated in the present research uses item response theory (IRT) to select features independent of the end-classification results without the biased accuracies that this circularity engenders. Dichotomous and polytomous IRT models were used to analyze 30 histological breast cancer features from 569 patients using the Wisconsin Diagnostic Breast Cancer data set. Based on their characteristics, three features were selected for use in a machine learning classifier. For comparison purposes, two machine learning-based feature selection protocols were run-recursive feature elimination (RFE) and ridge regression-and the three features selected from these analyses were also used in the subsequent learning classifier. Classification results demonstrated that all three selection processes performed comparably. The non-biased nature of the IRT protocol and information provided about the specific characteristics of the features as to why they are of use in classification help to shed light on understanding which attributes of features make them suitable for use in a machine learning context.

摘要

优化分类分析中使用的特征数量和效用一直是许多研究的主题。大多数现有模型将终端分类用作特征减少过程的一部分，导致方法学中的循环。本研究中展示的方法使用项目反应理论（IRT）在不产生这种循环的有偏差准确性的情况下，独立于终端分类结果选择特征。二项式和多项式 IRT 模型用于使用威斯康星州诊断乳腺癌数据集分析来自 569 名患者的 30 个乳腺癌组织学特征。基于其特征，选择了三个特征用于机器学习分类器。出于比较目的，运行了两种基于机器学习的特征选择协议——递归特征消除（RFE）和岭回归——并在后续学习分类器中使用了这些分析中选择的三个特征。分类结果表明，所有三个选择过程的性能相当。IRT 协议的无偏性质以及关于特征为何在分类中有用的特定特征的信息提供有助于阐明理解哪些特征属性使其适合在机器学习上下文中使用。

相似文献

Item response theory as a feature selection and interpretation tool in the context of machine learning.项目反应理论作为机器学习中特征选择和解释的工具。

Med Biol Eng Comput. 2021 Feb;59(2):471-482. doi: 10.1007/s11517-020-02301-x. Epub 2021 Feb 3.

Novel Feature Selection for Artificial Intelligence Using Item Response Theory for Mortality Prediction.基于项目反应理论的人工智能新型特征选择用于死亡率预测

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:5729-5732. doi: 10.1109/EMBC44109.2020.9175403.

Classification of pulmonary lesion based on multiparametric MRI: utility of radiomics and comparison of machine learning methods.基于多参数 MRI 的肺部病变分类：放射组学的效用及机器学习方法的比较。

Eur Radiol. 2020 Aug;30(8):4595-4605. doi: 10.1007/s00330-020-06768-y. Epub 2020 Mar 28.

An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.基于基因表达数据的多支持向量机技术的高效特征选择策略。

Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018.

Multiclass Classification for the Differential Diagnosis on the ADHD Subtypes Using Recursive Feature Elimination and Hierarchical Extreme Learning Machine: Structural MRI Study.基于递归特征消除和分层极限学习机的多动症亚型鉴别诊断多分类：结构磁共振成像研究

PLoS One. 2016 Aug 8;11(8):e0160697. doi: 10.1371/journal.pone.0160697. eCollection 2016.

Absolute cosine-based SVM-RFE feature selection method for prostate histopathological grading.基于绝对余弦的 SVM-RFE 特征选择方法在前列腺组织病理分级中的应用。

Artif Intell Med. 2018 May;87:78-90. doi: 10.1016/j.artmed.2018.04.002. Epub 2018 Apr 19.

A Wrapper Feature Subset Selection Method Based on Randomized Search and Multilayer Structure.基于随机搜索和多层结构的包装特征子集选择方法。

Biomed Res Int. 2019 Nov 4;2019:9864213. doi: 10.1155/2019/9864213. eCollection 2019.

Analysis of structural brain MRI and multi-parameter classification for Alzheimer's disease.阿尔茨海默病的脑结构磁共振成像分析及多参数分类

Biomed Tech (Berl). 2018 Jul 26;63(4):427-437. doi: 10.1515/bmt-2016-0239.

A novel machine learning strategy for model selections - Stepwise Support Vector Machine (StepSVM).一种新的机器学习模型选择策略 - 逐步支持向量机（StepSVM）。

PLoS One. 2020 Aug 27;15(8):e0238384. doi: 10.1371/journal.pone.0238384. eCollection 2020.

Prediction of unenhanced lesion evolution in multiple sclerosis using radiomics-based models: a machine learning approach.基于放射组学模型预测多发性硬化症未增强病灶的演变：一种机器学习方法。

Mult Scler Relat Disord. 2021 Aug;53:102989. doi: 10.1016/j.msard.2021.102989. Epub 2021 May 4.

引用本文的文献

IRTCI: Item Response Theory for Categorical Imputation.IRTCI：用于分类插补的项目反应理论

Res Sq. 2024 Jul 2:rs.3.rs-4529519. doi: 10.21203/rs.3.rs-4529519/v1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

项目反应理论作为机器学习中特征选择和解释的工具。

Item response theory as a feature selection and interpretation tool in the context of machine learning.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献