• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

特征选择是否能提高分类准确性?使用解剖磁共振图像进行分类时,样本量和特征选择的影响。

Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images.

机构信息

Section on Functional Imaging Methods, Laboratory of Brain and Cognition, NIMH, NIH, Bethesda, USA.

出版信息

Neuroimage. 2012 Mar;60(1):59-70. doi: 10.1016/j.neuroimage.2011.11.066. Epub 2011 Dec 1.

DOI:10.1016/j.neuroimage.2011.11.066
PMID:22166797
Abstract

There are growing numbers of studies using machine learning approaches to characterize patterns of anatomical difference discernible from neuroimaging data. The high-dimensionality of image data often raises a concern that feature selection is needed to obtain optimal accuracy. Among previous studies, mostly using fixed sample sizes, some show greater predictive accuracies with feature selection, whereas others do not. In this study, we compared four common feature selection methods. 1) Pre-selected region of interests (ROIs) that are based on prior knowledge. 2) Univariate t-test filtering. 3) Recursive feature elimination (RFE), and 4) t-test filtering constrained by ROIs. The predictive accuracies achieved from different sample sizes, with and without feature selection, were compared statistically. To demonstrate the effect, we used grey matter segmented from the T1-weighted anatomical scans collected by the Alzheimer's disease Neuroimaging Initiative (ADNI) as the input features to a linear support vector machine classifier. The objective was to characterize the patterns of difference between Alzheimer's disease (AD) patients and cognitively normal subjects, and also to characterize the difference between mild cognitive impairment (MCI) patients and normal subjects. In addition, we also compared the classification accuracies between MCI patients who converted to AD and MCI patients who did not convert within the period of 12 months. Predictive accuracies from two data-driven feature selection methods (t-test filtering and RFE) were no better than those achieved using whole brain data. We showed that we could achieve the most accurate characterizations by using prior knowledge of where to expect neurodegeneration (hippocampus and parahippocampal gyrus). Therefore, feature selection does improve the classification accuracies, but it depends on the method adopted. In general, larger sample sizes yielded higher accuracies with less advantage obtained by using knowledge from the existing literature.

摘要

越来越多的研究采用机器学习方法来描述从神经影像学数据中可识别的解剖差异模式。图像数据的高维性常常引起人们的关注,即需要进行特征选择以获得最佳的准确性。在之前的研究中,大多数使用固定的样本大小,有些研究表明特征选择具有更高的预测准确性,而有些则不然。在这项研究中,我们比较了四种常见的特征选择方法。1)基于先验知识的预先选择的感兴趣区域(ROI)。2)单变量 t 检验过滤。3)递归特征消除(RFE),以及 4)受 ROI 限制的 t 检验过滤。统计比较了不同样本大小、有无特征选择时的预测准确性。为了演示效果,我们使用从阿尔茨海默病神经影像学倡议(ADNI)收集的 T1 加权解剖扫描中分割的灰质作为线性支持向量机分类器的输入特征。目标是描述阿尔茨海默病(AD)患者与认知正常受试者之间差异的模式,以及描述轻度认知障碍(MCI)患者与正常受试者之间的差异。此外,我们还比较了在 12 个月内转化为 AD 的 MCI 患者和未转化为 AD 的 MCI 患者之间的分类准确性。两种数据驱动的特征选择方法(t 检验过滤和 RFE)的预测准确性并不优于使用全脑数据获得的准确性。我们表明,通过使用对神经退行性病变发生位置的先验知识(海马体和海马旁回),我们可以实现最准确的特征描述。因此,特征选择确实可以提高分类准确性,但这取决于所采用的方法。总的来说,更大的样本量可以获得更高的准确性,而利用现有文献中的知识则获得的优势较小。

相似文献

1
Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images.特征选择是否能提高分类准确性?使用解剖磁共振图像进行分类时,样本量和特征选择的影响。
Neuroimage. 2012 Mar;60(1):59-70. doi: 10.1016/j.neuroimage.2011.11.066. Epub 2011 Dec 1.
2
An ensemble learning system for a 4-way classification of Alzheimer's disease and mild cognitive impairment.用于阿尔茨海默病和轻度认知障碍 4 分类的集成学习系统。
J Neurosci Methods. 2018 May 15;302:75-81. doi: 10.1016/j.jneumeth.2018.03.008. Epub 2018 Mar 22.
3
Random forest feature selection, fusion and ensemble strategy: Combining multiple morphological MRI measures to discriminate among healhy elderly, MCI, cMCI and alzheimer's disease patients: From the alzheimer's disease neuroimaging initiative (ADNI) database.随机森林特征选择、融合和集成策略:结合多种形态磁共振成像指标对健康老年人、MCI、cMCI 和阿尔茨海默病患者进行分类:来自阿尔茨海默病神经影像学倡议(ADNI)数据库。
J Neurosci Methods. 2018 May 15;302:14-23. doi: 10.1016/j.jneumeth.2017.12.010. Epub 2017 Dec 18.
4
Effects of imaging modalities, brain atlases and feature selection on prediction of Alzheimer's disease.成像模态、脑图谱和特征选择对阿尔茨海默病预测的影响。
J Neurosci Methods. 2015 Dec 30;256:168-83. doi: 10.1016/j.jneumeth.2015.08.020. Epub 2015 Aug 28.
5
Comparison of Feature Selection Techniques in Machine Learning for Anatomical Brain MRI in Dementia.痴呆症中用于解剖学脑磁共振成像的机器学习特征选择技术比较
Neuroinformatics. 2016 Jul;14(3):279-96. doi: 10.1007/s12021-015-9292-3.
6
Ensemble of random forests One vs. Rest classifiers for MCI and AD prediction using ANOVA cortical and subcortical feature selection and partial least squares.基于 ANOVA 皮质和皮质下特征选择和偏最小二乘法的随机森林与 One vs. Rest 分类器集成用于 MCI 和 AD 预测。
J Neurosci Methods. 2018 May 15;302:47-57. doi: 10.1016/j.jneumeth.2017.12.005. Epub 2017 Dec 11.
7
A novel joint HCPMMP method for automatically classifying Alzheimer's and different stage MCI patients.一种新型联合 HCPMMP 方法,用于自动分类阿尔茨海默病和不同阶段的轻度认知障碍患者。
Behav Brain Res. 2019 Jun 3;365:210-221. doi: 10.1016/j.bbr.2019.03.004. Epub 2019 Mar 2.
8
Ensemble based on static classifier selection for automated diagnosis of Mild Cognitive Impairment.基于静态分类器选择的用于轻度认知障碍自动诊断的集成方法。
J Neurosci Methods. 2018 May 15;302:42-46. doi: 10.1016/j.jneumeth.2017.11.002. Epub 2017 Nov 2.
9
Automated discrimination of dementia spectrum disorders using extreme learning machine and structural T1 MRI features.使用极限学习机和结构T1磁共振成像特征对痴呆谱系障碍进行自动鉴别
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:1990-1993. doi: 10.1109/EMBC.2017.8037241.
10
A Role for Prior Knowledge in Statistical Classification of the Transition from Mild Cognitive Impairment to Alzheimer's Disease.先前知识在轻度认知障碍向阿尔茨海默病转变的统计分类中的作用。
J Alzheimers Dis. 2021;83(4):1859-1875. doi: 10.3233/JAD-201398.

引用本文的文献

1
Multilayer meta-matching: Translating phenotypic prediction models from multiple datasets to small data.多层元匹配:将多个数据集的表型预测模型转化应用于小数据。
Imaging Neurosci (Camb). 2024 Jul 17;2. doi: 10.1162/imag_a_00233. eCollection 2024.
2
Translating phenotypic prediction models from big to small anatomical MRI data using meta-matching.使用元匹配方法将表型预测模型从大尺寸解剖MRI数据转换为小尺寸解剖MRI数据。
Imaging Neurosci (Camb). 2024 Aug 1;2. doi: 10.1162/imag_a_00251. eCollection 2024.
3
Harnessing the potential of human induced pluripotent stem cells, functional assays and machine learning for neurodevelopmental disorders.
利用人类诱导多能干细胞、功能测定和机器学习在神经发育障碍方面的潜力。
Front Neurosci. 2025 Jan 8;18:1524577. doi: 10.3389/fnins.2024.1524577. eCollection 2024.
4
Domain-Specific Prediction of Clinical Progression in Parkinson's Disease Using the Mosaic Approach.使用镶嵌方法对帕金森病临床进展进行特定领域预测。
Brain Behav. 2025 Jan;15(1):e70289. doi: 10.1002/brb3.70289.
5
Investigating the contribution of image time series observations to cauliflower harvest-readiness prediction.研究图像时间序列观测对花椰菜收获准备度预测的贡献。
Front Artif Intell. 2024 Sep 18;7:1416323. doi: 10.3389/frai.2024.1416323. eCollection 2024.
6
Predicting the Conversion from Mild Cognitive Impairment to Alzheimer's Disease Using Graph Frequency Bands and Functional Connectivity-Based Features.利用基于图形频带和功能连接的特征预测轻度认知障碍向阿尔茨海默病的转化
Res Sq. 2024 Jun 21:rs.3.rs-4549428. doi: 10.21203/rs.3.rs-4549428/v1.
7
Multilayer meta-matching: translating phenotypic prediction models from multiple datasets to small data.多层元匹配:将多个数据集的表型预测模型转化应用于小数据
bioRxiv. 2023 Dec 7:2023.12.05.569848. doi: 10.1101/2023.12.05.569848.
8
Harnessing Semi-Supervised Machine Learning to Automatically Predict Bioactivities of Per- and Polyfluoroalkyl Substances (PFASs).利用半监督机器学习自动预测全氟和多氟烷基物质(PFASs)的生物活性。
Environ Sci Technol Lett. 2022 Aug 26;10(11):1017-1022. doi: 10.1021/acs.estlett.2c00530. eCollection 2023 Nov 14.
9
oFVSD: a Python package of optimized forward variable selection decoder for high-dimensional neuroimaging data.oFVSD:用于高维神经成像数据的优化前向变量选择解码器的Python软件包。
Front Neuroinform. 2023 Sep 26;17:1266713. doi: 10.3389/fninf.2023.1266713. eCollection 2023.
10
AI Tools for Assessing Human Fertility Using Risk Factors: A State-of-the-Art Review.使用风险因素评估人类生育能力的人工智能工具:最新综述。
J Med Syst. 2023 Aug 23;47(1):91. doi: 10.1007/s10916-023-01983-8.