• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多测试决策树及其在微阵列数据分类中的应用。

Multi-test decision tree and its application to microarray data classification.

作者信息

Czajkowski Marcin, Grześ Marek, Kretowski Marek

机构信息

Faculty of Computer Science, Bialystok University of Technology, Wiejska 45a, 15-351 Bialystok, Poland.

School of Computer Science, University of Waterloo, 200 University Avenue West, Waterloo, Ontario N2L 3G1, Canada.

出版信息

Artif Intell Med. 2014 May;61(1):35-44. doi: 10.1016/j.artmed.2014.01.005. Epub 2014 Feb 10.

DOI:10.1016/j.artmed.2014.01.005
PMID:24630712
Abstract

OBJECTIVE

The desirable property of tools used to investigate biological data is easy to understand models and predictive decisions. Decision trees are particularly promising in this regard due to their comprehensible nature that resembles the hierarchical process of human decision making. However, existing algorithms for learning decision trees have tendency to underfit gene expression data. The main aim of this work is to improve the performance and stability of decision trees with only a small increase in their complexity.

METHODS

We propose a multi-test decision tree (MTDT); our main contribution is the application of several univariate tests in each non-terminal node of the decision tree. We also search for alternative, lower-ranked features in order to obtain more stable and reliable predictions.

RESULTS

Experimental validation was performed on several real-life gene expression datasets. Comparison results with eight classifiers show that MTDT has a statistically significantly higher accuracy than popular decision tree classifiers, and it was highly competitive with ensemble learning algorithms. The proposed solution managed to outperform its baseline algorithm on 14 datasets by an average 6%. A study performed on one of the datasets showed that the discovered genes used in the MTDT classification model are supported by biological evidence in the literature.

CONCLUSION

This paper introduces a new type of decision tree which is more suitable for solving biological problems. MTDTs are relatively easy to analyze and much more powerful in modeling high dimensional microarray data than their popular counterparts.

摘要

目的

用于研究生物数据的工具的理想特性是易于理解的模型和预测决策。决策树在这方面特别有前景,因为其可理解的性质类似于人类决策的分层过程。然而,现有的决策树学习算法倾向于对基因表达数据拟合不足。这项工作的主要目的是在仅略微增加决策树复杂度的情况下提高其性能和稳定性。

方法

我们提出了一种多测试决策树(MTDT);我们的主要贡献是在决策树的每个非终端节点应用多个单变量测试。我们还搜索替代的、排名较低的特征,以获得更稳定和可靠的预测。

结果

在几个实际的基因表达数据集上进行了实验验证。与八个分类器的比较结果表明,MTDT在统计上具有比流行的决策树分类器显著更高的准确率,并且与集成学习算法具有高度竞争力。所提出的解决方案在14个数据集上比其基线算法平均高出6%。对其中一个数据集进行的一项研究表明,MTDT分类模型中发现的基因得到了文献中的生物学证据的支持。

结论

本文介绍了一种更适合解决生物学问题的新型决策树。MTDT相对易于分析,并且在对高维微阵列数据建模方面比其流行的同类方法更强大。

相似文献

1
Multi-test decision tree and its application to microarray data classification.多测试决策树及其在微阵列数据分类中的应用。
Artif Intell Med. 2014 May;61(1):35-44. doi: 10.1016/j.artmed.2014.01.005. Epub 2014 Feb 10.
2
A decision tree--based method for the differential diagnosis of Aortic Stenosis from Mitral Regurgitation using heart sounds.一种基于决策树的利用心音对主动脉瓣狭窄与二尖瓣反流进行鉴别诊断的方法。
Biomed Eng Online. 2004 Jun 29;3(1):21. doi: 10.1186/1475-925X-3-21.
3
Decision tree and ensemble learning algorithms with their applications in bioinformatics.决策树和集成学习算法及其在生物信息学中的应用。
Adv Exp Med Biol. 2011;696:191-9. doi: 10.1007/978-1-4419-7046-6_19.
4
Automatic design of decision-tree algorithms with evolutionary algorithms.使用进化算法自动设计决策树算法。
Evol Comput. 2013 Winter;21(4):659-84. doi: 10.1162/EVCO_a_00101. Epub 2013 Aug 8.
5
Top scoring pair decision tree for gene expression data analysis.基于最高得分对的决策树进行基因表达数据分析。
Adv Exp Med Biol. 2011;696:27-35. doi: 10.1007/978-1-4419-7046-6_3.
6
Multi-objective evolutionary algorithms for fuzzy classification in survival prediction.多目标进化算法在生存预测中的模糊分类。
Artif Intell Med. 2014 Mar;60(3):197-219. doi: 10.1016/j.artmed.2013.12.006. Epub 2014 Jan 9.
7
Comprehensive decision tree models in bioinformatics.生物信息学中的综合决策树模型。
PLoS One. 2012;7(3):e33812. doi: 10.1371/journal.pone.0033812. Epub 2012 Mar 30.
8
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
9
Accuracy-based learning classifier systems: models, analysis and applications to classification tasks.基于准确性的学习分类器系统:模型、分析及其在分类任务中的应用。
Evol Comput. 2003 Fall;11(3):209-38. doi: 10.1162/106365603322365289.
10
Rotation of random forests for genomic and proteomic classification problems.随机森林旋转算法在基因组和蛋白质组分类问题中的应用。
Adv Exp Med Biol. 2011;696:211-21. doi: 10.1007/978-1-4419-7046-6_21.

引用本文的文献

1
Ensemble methods of rank-based trees for single sample classification with gene expression profiles.基于排名的树的集成方法,用于具有基因表达谱的单个样本分类。
J Transl Med. 2024 Feb 7;22(1):140. doi: 10.1186/s12967-024-04940-2.
2
Identification of TRPC6 as a Novel Diagnostic Biomarker of PM-Induced Chronic Obstructive Pulmonary Disease Using Machine Learning Models.使用机器学习模型鉴定 TRPC6 作为 PM 诱导的慢性阻塞性肺疾病的新型诊断生物标志物。
Genes (Basel). 2023 Jan 21;14(2):284. doi: 10.3390/genes14020284.
3
Deep learning-based microarray cancer classification and ensemble gene selection approach.
基于深度学习的微阵列癌症分类和集成基因选择方法。
IET Syst Biol. 2022 May;16(3-4):120-131. doi: 10.1049/syb2.12044. Epub 2022 Jul 4.
4
Hybrid learning method based on feature clustering and scoring for enhanced COVID-19 breath analysis by an electronic nose.基于特征聚类和评分的混合学习方法,用于增强电子鼻对 COVID-19 呼吸分析。
Artif Intell Med. 2022 Jul;129:102323. doi: 10.1016/j.artmed.2022.102323. Epub 2022 May 17.
5
Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data.基于模糊粗糙集的F信息与人类基因表达数据的水漩涡算法相结合自动检测癌症相关基因
PLoS One. 2016 Dec 9;11(12):e0167504. doi: 10.1371/journal.pone.0167504. eCollection 2016.