• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于气相色谱指纹图谱的组合特征随机森林法在烟叶分类中的应用。

A random forest of combined features in the classification of cut tobacco based on gas chromatography fingerprinting.

机构信息

Department of Computer Science & Engineering, Dalian University of Technology, Dalian 116024, China.

出版信息

Talanta. 2010 Sep 15;82(4):1571-5. doi: 10.1016/j.talanta.2010.07.053. Epub 2010 Jul 30.

DOI:10.1016/j.talanta.2010.07.053
PMID:20801374
Abstract

We applied the random forest method to discriminate among different kinds of cut tobacco. To overcome the influence of the descending resolution caused by column pollution and the subsequent deterioration of column efficacy at different testing times, we constructed combined peaks by summing the peaks over a specific elution time interval Deltat. On constructing tree classifiers, both the original peaks and the combined peaks were considered. A data set of 75 samples from three grades of the same tobacco brand was used to evaluate our method. Two parameters of the random forest were optimized using out-of-bag error, and the relationship between Deltat and classification rate was investigated. Experiments show that partial least squares discriminant analysis was not suitable because of the overfitting, and the random forest with the combined features performed more accurately than Naïve Bayes, support vector machines, bootstrap aggregating and the random forest using only its original features.

摘要

我们应用随机森林方法来区分不同种类的烟丝。为了克服由于柱污染导致的分辨率下降以及在不同测试时间下柱效随后恶化的影响,我们通过对特定洗脱时间间隔 Deltat 进行求和来构建组合峰。在构建树分类器时,同时考虑了原始峰和组合峰。我们使用同一烟草品牌的三个等级的 75 个样本数据集来评估我们的方法。通过袋外误差优化了随机森林的两个参数,并研究了 Deltat 与分类率之间的关系。实验表明,由于过拟合,偏最小二乘判别分析不适用,而使用组合特征的随机森林比朴素贝叶斯、支持向量机、自举聚合和仅使用原始特征的随机森林更准确。

相似文献

1
A random forest of combined features in the classification of cut tobacco based on gas chromatography fingerprinting.基于气相色谱指纹图谱的组合特征随机森林法在烟叶分类中的应用。
Talanta. 2010 Sep 15;82(4):1571-5. doi: 10.1016/j.talanta.2010.07.053. Epub 2010 Jul 30.
2
Application of the random forest classification method to peaks detected from mass spectrometric proteomic profiles of cancer patients and controls.将随机森林分类方法应用于从癌症患者和对照的质谱蛋白质组学图谱中检测到的峰。
Stat Appl Genet Mol Biol. 2008;7(2):Article4. doi: 10.2202/1544-6115.1349. Epub 2008 Feb 8.
3
Accelerated solvent extraction for GC-based tobacco fingerprinting and its comparison with simultaneous distillation and extraction.基于气相色谱的烟草特征图谱的加速溶剂萃取及其与同时蒸馏萃取的比较。
Talanta. 2010 Apr 15;81(1-2):650-6. doi: 10.1016/j.talanta.2009.12.054. Epub 2010 Jan 11.
4
Comparative analysis of the volatile components in cut tobacco from different locations with gas chromatography-mass spectrometry (GC-MS) and combined chemometric methods.采用气相色谱-质谱联用(GC-MS)技术并结合化学计量学方法对不同产地卷烟烟丝中的挥发性成分进行比较分析。
Anal Chim Acta. 2006 Aug 11;575(2):236-45. doi: 10.1016/j.aca.2006.05.079. Epub 2006 Jun 2.
5
Random forest: a classification and regression tool for compound classification and QSAR modeling.随机森林:一种用于化合物分类和定量构效关系建模的分类与回归工具。
J Chem Inf Comput Sci. 2003 Nov-Dec;43(6):1947-58. doi: 10.1021/ci034160g.
6
Classification of gasoline data obtained by gas chromatography using a piecewise alignment algorithm combined with feature selection and principal component analysis.使用分段比对算法结合特征选择和主成分分析对气相色谱法获得的汽油数据进行分类。
J Chromatogr A. 2005 Nov 25;1096(1-2):101-10. doi: 10.1016/j.chroma.2005.04.078.
7
Bayesian framework for least-squares support vector machine classifiers, gaussian processes, and kernel Fisher discriminant analysis.用于最小二乘支持向量机分类器、高斯过程和核Fisher判别分析的贝叶斯框架。
Neural Comput. 2002 May;14(5):1115-47. doi: 10.1162/089976602753633411.
8
Comparison of random forest and Pipeline Pilot Naïve Bayes in prospective QSAR predictions.随机森林与 Pipeline Pilot Naïve Bayes 在前瞻性 QSAR 预测中的比较。
J Chem Inf Model. 2012 Mar 26;52(3):792-803. doi: 10.1021/ci200615h. Epub 2012 Mar 8.
9
[Meta-analysis of the Italian studies on short-term effects of air pollution].[意大利关于空气污染短期影响研究的荟萃分析]
Epidemiol Prev. 2001 Mar-Apr;25(2 Suppl):1-71.
10
A classification paradigm for distributed vertically partitioned data.一种用于分布式垂直分区数据的分类范式。
Neural Comput. 2004 Jul;16(7):1525-44. doi: 10.1162/089976604323057470.

引用本文的文献

1
Estimating high-density aboveground biomass within a complex tropical grassland using Worldview-3 imagery.利用 Worldview-3 影像估算复杂热带草原的高密度地上生物量。
Environ Monit Assess. 2024 Mar 15;196(4):370. doi: 10.1007/s10661-024-12476-7.
2
Prediction of Thermogravimetric Data in the Thermal Recycling of e-waste Using Machine Learning Techniques: A Data-driven Approach.使用机器学习技术预测电子垃圾热回收中的热重数据:一种数据驱动的方法
ACS Omega. 2023 Oct 30;8(45):43254-43270. doi: 10.1021/acsomega.3c07228. eCollection 2023 Nov 14.
3
Estimate earth fissure hazard based on machine learning in the Qa' Jahran Basin, Yemen.
基于机器学习在也门卡贾万盆地估算地裂缝灾害。
Sci Rep. 2022 Dec 19;12(1):21936. doi: 10.1038/s41598-022-26526-y.
4
Auto-classification of biomass through characterization of their pyrolysis behaviors using thermogravimetric analysis with support vector machine algorithm: case study for tobacco.使用支持向量机算法通过热重分析表征生物质热解行为实现生物质自动分类:烟草案例研究
Biotechnol Biofuels. 2021 Apr 27;14(1):106. doi: 10.1186/s13068-021-01942-w.
5
Modeling maize above-ground biomass based on machine learning approaches using UAV remote-sensing data.基于无人机遥感数据,运用机器学习方法对玉米地上生物量进行建模。
Plant Methods. 2019 Feb 4;15:10. doi: 10.1186/s13007-019-0394-z. eCollection 2019.
6
Land cover mapping based on random forest classification of multitemporal spectral and thermal images.基于多时相光谱和热图像随机森林分类的土地覆盖制图
Environ Monit Assess. 2015 May;187(5):291. doi: 10.1007/s10661-015-4489-3. Epub 2015 Apr 25.