• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于振动光谱数据分析的集成变量选择方法。

An ensemble variable selection method for vibrational spectroscopic data analysis.

作者信息

Zhang Jixiong, Yan Hong, Xiong Yanmei, Li Qianqian, Min Shungeng

机构信息

College of Science, China Agricultural University No. 2, Yuanmingyuanxi Road, Haidian District Beijing 100193 P.R. China

School of Marine Science, China University of Geosciences in Beijing Beijing 100086 China.

出版信息

RSC Adv. 2019 Feb 26;9(12):6708-6716. doi: 10.1039/c8ra08754g. eCollection 2019 Feb 22.

DOI:10.1039/c8ra08754g
PMID:35548689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9087301/
Abstract

Wavelength selection is a critical factor for pattern recognition of vibrational spectroscopic data. Not only does it alleviate the effect of dimensionality on an algorithm's generalization performance, but it also enhances the understanding and interpretability of multivariate classification models. In this study, a novel partial least squares discriminant analysis (PLSDA)-based wavelength selection algorithm, termed ensemble of bootstrapping space shrinkage (EBSS), has been devised for vibrational spectroscopic data analysis. In the algorithm, a set of subsets are generated from a data set using random sampling. For an individual subset, a feature space is determined by maximizing the expected 10-fold cross-validation accuracy with a weighted bootstrap sampling strategy. Then an ensemble strategy and a sequential forward selection method are applied to the feature spaces to select characteristic variables. Experimental results obtained from analysis of real vibrational spectroscopic data sets demonstrate that the ensemble wavelength selection algorithm can reserve stable and informative variables for the final modeling and improve predictive ability for multivariate classification models.

摘要

波长选择是振动光谱数据模式识别的关键因素。它不仅能减轻维度对算法泛化性能的影响,还能增强多元分类模型的可理解性和可解释性。在本研究中,一种基于偏最小二乘判别分析(PLSDA)的新型波长选择算法——自展空间收缩集成算法(EBSS)被设计用于振动光谱数据分析。在该算法中,通过随机抽样从数据集中生成一组子集。对于单个子集,采用加权自展抽样策略,通过最大化期望的10折交叉验证准确率来确定特征空间。然后将集成策略和顺序向前选择方法应用于这些特征空间以选择特征变量。对实际振动光谱数据集的分析结果表明,该集成波长选择算法能够为最终建模保留稳定且信息丰富的变量,并提高多元分类模型的预测能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/8fce0e27eaee/c8ra08754g-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/8ec8f539e982/c8ra08754g-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/4a4597fe0f44/c8ra08754g-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/c26cdc78a4e9/c8ra08754g-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/a986234fc1d0/c8ra08754g-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/5edc37ac756d/c8ra08754g-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/53b27e39bbb7/c8ra08754g-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/8558b7b1c159/c8ra08754g-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/8fce0e27eaee/c8ra08754g-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/8ec8f539e982/c8ra08754g-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/4a4597fe0f44/c8ra08754g-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/c26cdc78a4e9/c8ra08754g-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/a986234fc1d0/c8ra08754g-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/5edc37ac756d/c8ra08754g-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/53b27e39bbb7/c8ra08754g-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/8558b7b1c159/c8ra08754g-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0ce/9087301/8fce0e27eaee/c8ra08754g-f8.jpg

相似文献

1
An ensemble variable selection method for vibrational spectroscopic data analysis.一种用于振动光谱数据分析的集成变量选择方法。
RSC Adv. 2019 Feb 26;9(12):6708-6716. doi: 10.1039/c8ra08754g. eCollection 2019 Feb 22.
2
A new hybrid filter/wrapper algorithm for feature selection in classification.一种用于分类中特征选择的新型混合过滤/包装算法。
Anal Chim Acta. 2019 Nov 8;1080:43-54. doi: 10.1016/j.aca.2019.06.054. Epub 2019 Jun 28.
3
Coupling bootstrap with synergy self-organizing map-based orthogonal partial least squares discriminant analysis: Stable metabolic biomarker selection for inherited metabolic diseases.将自举法与基于协同自组织映射的正交偏最小二乘判别分析相结合:用于遗传性代谢疾病的稳定代谢生物标志物选择。
Talanta. 2020 Nov 1;219:121370. doi: 10.1016/j.talanta.2020.121370. Epub 2020 Jul 9.
4
A bootstrapping soft shrinkage approach for variable selection in chemical modeling.一种用于化学建模中变量选择的自举软收缩方法。
Anal Chim Acta. 2016 Feb 18;908:63-74. doi: 10.1016/j.aca.2016.01.001. Epub 2016 Jan 7.
5
Bootstrapping search margin-based nearest neighbor method for qualitative spectroscopic analysis.基于自举搜索边际的最近邻法用于定性光谱分析。
Spectrochim Acta A Mol Biomol Spectrosc. 2019 Dec 5;223:117110. doi: 10.1016/j.saa.2019.05.015. Epub 2019 May 11.
6
On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning.关于划分训练集和验证集:交叉验证、自助法和系统抽样在估计监督学习泛化性能方面的比较研究
J Anal Test. 2018;2(3):249-262. doi: 10.1007/s41664-018-0068-2. Epub 2018 Oct 29.
7
Variable selection in discriminant partial least-squares analysis.判别式偏最小二乘分析中的变量选择
Anal Chem. 1998 Oct 1;70(19):4126-33. doi: 10.1021/ac980506o.
8
Three-step hybrid strategy towards efficiently selecting variables in multivariate calibration of near-infrared spectra.三步混合策略在近红外光谱多元校正中高效选择变量。
Spectrochim Acta A Mol Biomol Spectrosc. 2020 Jan 5;224:117376. doi: 10.1016/j.saa.2019.117376. Epub 2019 Jul 8.
9
A blocking strategy to improve gene selection for classification of gene expression data.一种用于改进基因选择以对基因表达数据进行分类的阻断策略。
IEEE/ACM Trans Comput Biol Bioinform. 2007 Apr-Jun;4(2):293-300. doi: 10.1109/TCBB.2007.1014.
10
Using variable combination population analysis for variable selection in multivariate calibration.在多元校准中使用可变组合总体分析进行变量选择。
Anal Chim Acta. 2015 Mar 3;862:14-23. doi: 10.1016/j.aca.2014.12.048. Epub 2014 Dec 30.

引用本文的文献

1
Solubility determination, dissolution properties and solid transformation of resmetirom (form A) in heptane and seven alcohols.瑞美替隆(晶型A)在庚烷和七种醇中的溶解度测定、溶解特性及固态转变
RSC Adv. 2023 Jul 27;13(32):22172-22184. doi: 10.1039/d3ra02521g. eCollection 2023 Jul 19.

本文引用的文献

1
Convolutional neural networks for vibrational spectroscopic data analysis.卷积神经网络在振动光谱数据分析中的应用。
Anal Chim Acta. 2017 Feb 15;954:22-31. doi: 10.1016/j.aca.2016.12.010. Epub 2016 Dec 27.
2
Rapid Classification of Turmeric Based on DNA Fingerprint by Near-Infrared Spectroscopy Combined with Moving Window Partial Least Squares-Discrimination Analysis.
Anal Sci. 2017;33(1):111-115. doi: 10.2116/analsci.33.111.
3
A novel algorithm for spectral interval combination optimization.一种用于光谱区间组合优化的新算法。
Anal Chim Acta. 2016 Dec 15;948:19-29. doi: 10.1016/j.aca.2016.10.041. Epub 2016 Nov 2.
4
Combining NMR and LC/MS Using Backward Variable Elimination: Metabolomics Analysis of Colorectal Cancer, Polyps, and Healthy Controls.采用反向变量消除法结合 NMR 和 LC/MS:结直肠癌、息肉和健康对照的代谢组学分析。
Anal Chem. 2016 Aug 16;88(16):7975-83. doi: 10.1021/acs.analchem.6b00885. Epub 2016 Aug 1.
5
A bootstrapping soft shrinkage approach for variable selection in chemical modeling.一种用于化学建模中变量选择的自举软收缩方法。
Anal Chim Acta. 2016 Feb 18;908:63-74. doi: 10.1016/j.aca.2016.01.001. Epub 2016 Jan 7.
6
Soil type recognition as improved by genetic algorithm-based variable selection using near infrared spectroscopy and partial least squares discriminant analysis.基于遗传算法变量选择的近红外光谱和偏最小二乘判别分析用于改进土壤类型识别。
Sci Rep. 2015 Jun 18;5:10930. doi: 10.1038/srep10930.
7
Data size reduction strategy for the classification of breath and air samples using multicapillary column-ion mobility spectrometry.基于多毛细管柱-离子迁移谱法的呼吸和空气样本分类数据量缩减策略
Anal Chem. 2015 Jan 20;87(2):869-75. doi: 10.1021/ac503857y. Epub 2015 Jan 8.
8
Evaluation of the effect of chance correlations on variable selection using Partial Least Squares-Discriminant Analysis.运用偏最小二乘判别分析评估机遇相关性对变量选择的影响。
Talanta. 2013 Nov 15;116:835-40. doi: 10.1016/j.talanta.2013.07.048. Epub 2013 Aug 9.
9
A Partial Least Squares based algorithm for parsimonious variable selection.一种基于偏最小二乘法的简约变量选择算法。
Algorithms Mol Biol. 2011 Dec 5;6(1):27. doi: 10.1186/1748-7188-6-27.
10
Detection and chemical profiling of medicine counterfeits by Raman spectroscopy and chemometrics.拉曼光谱和化学计量学检测和分析假药。
Anal Chim Acta. 2011 Oct 31;705(1-2):334-41. doi: 10.1016/j.aca.2011.07.043. Epub 2011 Aug 5.