发现癌症诊断数据分类的重要规则。

Discovery of significant rules for classifying cancer diagnosis data.

作者信息

Li Jinyan, Liu Huiqing, Ng See-Kiong, Wong Limsoon

机构信息

Institute for Infocomm Research, Heng Mui Keng Terrace, Singapore.

出版信息

Bioinformatics. 2003 Oct;19 Suppl 2:ii93-102. doi: 10.1093/bioinformatics/btg1066.

DOI:10.1093/bioinformatics/btg1066

PMID:14534178

Abstract

We introduce a new method to discover many diversified and significant rules from high dimensional profiling data. We also propose to aggregate the discriminating power of these rules for reliable predictions. The discovered rules are found to contain low-ranked features; these features are found to be sometimes necessary for classifiers to achieve perfect accuracy. The use of low-ranked but essential features in our method is in contrast to the prevailing use of an ad-hoc number of only top-ranked features. On a wide range of data sets, our method displayed highly competitive accuracy compared to the best performance of other kinds of classification models. In addition to accuracy, our method also provides comprehensible rules to help elucidate the translation between raw data and useful knowledge.

摘要

我们介绍了一种从高维剖析数据中发现许多多样化且重要规则的新方法。我们还提议汇总这些规则的判别力以进行可靠预测。发现的规则包含排名靠后的特征；这些特征有时被发现是分类器实现完美准确率所必需的。我们方法中使用排名靠后的但必不可少的特征，这与仅使用数量随意的顶级排名特征的普遍做法形成对比。在广泛的数据集上，与其他类型分类模型的最佳性能相比，我们的方法展现出极具竞争力的准确率。除了准确率，我们的方法还提供可理解的规则，以帮助阐明原始数据与有用知识之间的转化。

相似文献

Discovery of significant rules for classifying cancer diagnosis data.发现癌症诊断数据分类的重要规则。

Bioinformatics. 2003 Oct;19 Suppl 2:ii93-102. doi: 10.1093/bioinformatics/btg1066.

Simple decision rules for classifying human cancers from gene expression profiles.基于基因表达谱对人类癌症进行分类的简单决策规则。

Bioinformatics. 2005 Oct 15;21(20):3896-904. doi: 10.1093/bioinformatics/bti631. Epub 2005 Aug 16.

Toward a measure of classification complexity in gene expression signatures.迈向基因表达特征中分类复杂性的一种度量方法。

Annu Int Conf IEEE Eng Med Biol Soc. 2008;2008:5704-7. doi: 10.1109/IEMBS.2008.4650509.

Using fuzzy association rule mining in cancer classification.在癌症分类中使用模糊关联规则挖掘。

Australas Phys Eng Sci Med. 2011 Apr;34(1):41-54. doi: 10.1007/s13246-011-0054-8. Epub 2011 Feb 16.

Dimension reduction-based penalized logistic regression for cancer classification using microarray data.基于降维的惩罚逻辑回归用于利用微阵列数据进行癌症分类

IEEE/ACM Trans Comput Biol Bioinform. 2005 Apr-Jun;2(2):166-75. doi: 10.1109/TCBB.2005.22.

A robust meta-classification strategy for cancer diagnosis from gene expression data.一种基于基因表达数据进行癌症诊断的强大元分类策略。

Proc IEEE Comput Syst Bioinform Conf. 2005:322-5. doi: 10.1109/csb.2005.7.

Robust and accurate cancer classification with gene expression profiling.基于基因表达谱的稳健且准确的癌症分类

Proc IEEE Comput Syst Bioinform Conf. 2005:310-21. doi: 10.1109/csb.2005.49.

A classification framework applied to cancer gene expression profiles.一种应用于癌症基因表达谱的分类框架。

J Healthc Eng. 2013;4(2):255-83. doi: 10.1260/2040-2295.4.2.255.

Effects of replacing the unreliable cDNA microarray measurements on the disease classification based on gene expression profiles and functional modules.基于基因表达谱和功能模块，替换不可靠的cDNA微阵列测量值对疾病分类的影响。

Bioinformatics. 2006 Dec 1;22(23):2883-9. doi: 10.1093/bioinformatics/btl339. Epub 2006 Jun 29.

Cancer molecular pattern discovery by subspace consensus kernel classification.基于子空间共识核分类的癌症分子模式发现

Comput Syst Bioinformatics Conf. 2007;6:55-65.

引用本文的文献

A practical approach for colorectal cancer diagnosis based on machine learning.一种基于机器学习的结直肠癌诊断实用方法。

PLoS One. 2025 Apr 29;20(4):e0321009. doi: 10.1371/journal.pone.0321009. eCollection 2025.

Tracing the path from conservation to expansion evolutionary insights into NLR genes in oleaceae.追踪从保守到扩张的路径：木犀科NLR基因的进化见解

BMC Plant Biol. 2025 Feb 26;25(1):259. doi: 10.1186/s12870-025-06233-2.

ML-based Models as a Strategy to Discover Novel Antiepileptic Drugs Targeting Sodium Receptor Channel.基于机器学习的模型作为发现靶向钠受体通道的新型抗癫痫药物的策略。

Curr Top Med Chem. 2025;25(2):209-227. doi: 10.2174/0115680266331755241008061915.

Homogeneous Adaboost Ensemble Machine Learning Algorithms with Reduced Entropy on Balanced Data.基于平衡数据上具有降低熵的同质自适应提升集成机器学习算法

Entropy (Basel). 2023 Jan 29;25(2):245. doi: 10.3390/e25020245.

DNA computing for gastric cancer analysis and functional classification.用于胃癌分析和功能分类的DNA计算

Front Genet. 2022 Nov 24;13:1064715. doi: 10.3389/fgene.2022.1064715. eCollection 2022.

Engineering and screening of novel β-1,3-xylanases with desired hydrolysate type by optimized ancestor sequence reconstruction and data mining.通过优化祖先序列重建和数据挖掘对具有所需水解产物类型的新型β-1,3-木聚糖酶进行工程改造和筛选。

Comput Struct Biotechnol J. 2022 Jun 27;20:3313-3321. doi: 10.1016/j.csbj.2022.06.050. eCollection 2022.

A framework model using multifilter feature selection to enhance colon cancer classification.基于多滤波器特征选择的结肠癌分类增强框架模型。

PLoS One. 2021 Apr 16;16(4):e0249094. doi: 10.1371/journal.pone.0249094. eCollection 2021.

A DEEP LEARNING APPROACH FOR CANCER DETECTION AND RELEVANT GENE IDENTIFICATION.一种用于癌症检测和相关基因识别的深度学习方法。

Pac Symp Biocomput. 2017;22:219-229. doi: 10.1142/9789813207813_0022.

Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes.利用蛋白质-蛋白质相互作用网络中真实复合物与随机子图之间的对比模式来预测未知蛋白质复合物。

Sci Rep. 2016 Feb 12;6:21223. doi: 10.1038/srep21223.

The Use of Chemical-Chemical Interaction and Chemical Structure to Identify New Candidate Chemicals Related to Lung Cancer.利用化学-化学相互作用和化学结构来鉴定与肺癌相关的新候选化学物质。

PLoS One. 2015 Jun 5;10(6):e0128696. doi: 10.1371/journal.pone.0128696. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

发现癌症诊断数据分类的重要规则。

Discovery of significant rules for classifying cancer diagnosis data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献