• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于微阵列数据分析的具有准确且紧凑模糊规则库的可解释基因表达分类器。

Interpretable gene expression classifier with an accurate and compact fuzzy rule base for microarray data analysis.

作者信息

Ho Shinn-Ying, Hsieh Chih-Hung, Chen Hung-Ming, Huang Hui-Ling

机构信息

Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan.

出版信息

Biosystems. 2006 Sep;85(3):165-76. doi: 10.1016/j.biosystems.2006.01.002. Epub 2006 Feb 21.

DOI:10.1016/j.biosystems.2006.01.002
PMID:16490299
Abstract

An accurate classifier with linguistic interpretability using a small number of relevant genes is beneficial to microarray data analysis and development of inexpensive diagnostic tests. Several frequently used techniques for designing classifiers of microarray data, such as support vector machine, neural networks, k-nearest neighbor, and logistic regression model, suffer from low interpretabilities. This paper proposes an interpretable gene expression classifier (named iGEC) with an accurate and compact fuzzy rule base for microarray data analysis. The design of iGEC has three objectives to be simultaneously optimized: maximal classification accuracy, minimal number of rules, and minimal number of used genes. An "intelligent" genetic algorithm IGA is used to efficiently solve the design problem with a large number of tuning parameters. The performance of iGEC is evaluated using eight commonly-used data sets. It is shown that iGEC has an accurate, concise, and interpretable rule base (1.1 rules per class) on average in terms of test classification accuracy (87.9%), rule number (3.9), and used gene number (5.0). Moreover, iGEC not only has better performance than the existing fuzzy rule-based classifier in terms of the above-mentioned objectives, but also is more accurate than some existing non-rule-based classifiers.

摘要

使用少量相关基因且具有语言可解释性的精确分类器,有利于微阵列数据分析以及开发低成本诊断测试。一些常用于设计微阵列数据分类器的技术,如支持向量机、神经网络、k近邻和逻辑回归模型,都存在可解释性低的问题。本文提出了一种用于微阵列数据分析的具有精确且紧凑模糊规则库的可解释基因表达分类器(名为iGEC)。iGEC的设计有三个需要同时优化的目标:最大分类准确率、最少规则数和最少使用基因数。一种“智能”遗传算法IGA被用于有效解决具有大量调优参数的设计问题。使用八个常用数据集对iGEC的性能进行了评估。结果表明,就测试分类准确率(87.9%)、规则数(3.9)和使用基因数(5.0)而言,iGEC平均具有精确、简洁且可解释的规则库(每个类别1.1条规则)。此外,iGEC不仅在上述目标方面比现有的基于模糊规则的分类器性能更好,而且比一些现有的非基于规则的分类器更准确。

相似文献

1
Interpretable gene expression classifier with an accurate and compact fuzzy rule base for microarray data analysis.用于微阵列数据分析的具有准确且紧凑模糊规则库的可解释基因表达分类器。
Biosystems. 2006 Sep;85(3):165-76. doi: 10.1016/j.biosystems.2006.01.002. Epub 2006 Feb 21.
2
Data mining of gene expression data by fuzzy and hybrid fuzzy methods.利用模糊和混合模糊方法对基因表达数据进行数据挖掘。
IEEE Trans Inf Technol Biomed. 2010 Jan;14(1):23-9. doi: 10.1109/TITB.2009.2033590. Epub 2009 Oct 20.
3
Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers.从微阵列数据中选择最少数量的相关基因以设计精确的组织分类器。
Biosystems. 2007 Jul-Aug;90(1):78-86. doi: 10.1016/j.biosystems.2006.07.002. Epub 2006 Jul 10.
4
GenSo-FDSS: a neural-fuzzy decision support system for pediatric ALL cancer subtype identification using gene expression data.GenSo-FDSS:一种基于基因表达数据的用于小儿急性淋巴细胞白血病癌症亚型识别的神经模糊决策支持系统。
Artif Intell Med. 2005 Jan;33(1):61-88. doi: 10.1016/j.artmed.2004.03.009.
5
Small, fuzzy and interpretable gene expression based classifiers.基于小的、模糊且可解释的基因表达的分类器。
Bioinformatics. 2005 May 1;21(9):1964-70. doi: 10.1093/bioinformatics/bti287. Epub 2005 Jan 20.
6
Fuzzy-rough supervised attribute clustering algorithm and classification of microarray data.模糊粗糙监督属性聚类算法与微阵列数据分类
IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):222-33. doi: 10.1109/TSMCB.2010.2050684. Epub 2010 Jun 10.
7
ESVM: evolutionary support vector machine for automatic feature selection and classification of microarray data.ESVM:用于微阵列数据自动特征选择与分类的进化支持向量机
Biosystems. 2007 Sep-Oct;90(2):516-28. doi: 10.1016/j.biosystems.2006.12.003. Epub 2006 Dec 16.
8
Interval based fuzzy systems for identification of important genes from microarray gene expression data: Application to carcinogenic development.基于区间的模糊系统在微阵列基因表达数据中重要基因识别中的应用:在致癌发展中的应用。
J Biomed Inform. 2009 Dec;42(6):1022-8. doi: 10.1016/j.jbi.2009.06.003. Epub 2009 Jul 8.
9
Recursive fuzzy granulation for gene subsets extraction and cancer classification.用于基因子集提取和癌症分类的递归模糊粒度化
IEEE Trans Inf Technol Biomed. 2008 Nov;12(6):723-30. doi: 10.1109/TITB.2008.920787.
10
A novel generic hebbian ordering-based fuzzy rule base reduction approach to mamdani neuro-fuzzy system.一种基于新型通用赫布排序的模糊规则库约简方法在Mamdani神经模糊系统中的应用
Neural Comput. 2007 Jun;19(6):1656-80. doi: 10.1162/neco.2007.19.6.1656.

引用本文的文献

1
Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data.基于模糊粗糙集的F信息与人类基因表达数据的水漩涡算法相结合自动检测癌症相关基因
PLoS One. 2016 Dec 9;11(12):e0167504. doi: 10.1371/journal.pone.0167504. eCollection 2016.
2
A genetic programming approach to oral cancer prognosis.一种用于口腔癌预后的遗传编程方法。
PeerJ. 2016 Sep 21;4:e2482. doi: 10.7717/peerj.2482. eCollection 2016.
3
Identification of Novel Pathways in Plant Lectin-Induced Cancer Cell Apoptosis.
鉴定植物凝集素诱导癌细胞凋亡的新途径。
Int J Mol Sci. 2016 Feb 8;17(2):228. doi: 10.3390/ijms17020228.
4
Fuzzy support vector machine: an efficient rule-based classification technique for microarrays.模糊支持向量机:一种用于微阵列的基于规则的高效分类技术。
BMC Bioinformatics. 2013;14 Suppl 13(Suppl 13):S4. doi: 10.1186/1471-2105-14-S13-S4. Epub 2013 Oct 1.
5
Predicting the survival time for diffuse large B-cell lymphoma using microarray data.利用微阵列数据预测弥漫性大B细胞淋巴瘤的生存时间。
J Mol Genet Med. 2012;6:287-92. doi: 10.4172/1747-0862.1000051. Epub 2012 May 23.
6
Predicting microRNA precursors with a generalized Gaussian components based density estimation algorithm.基于广义高斯成分密度估计算法预测 microRNA 前体。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S52. doi: 10.1186/1471-2105-11-S1-S52.
7
ProLoc-GO: utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization.ProLoc-GO:利用信息丰富的基因本体术语进行基于序列的蛋白质亚细胞定位预测。
BMC Bioinformatics. 2008 Feb 1;9:80. doi: 10.1186/1471-2105-9-80.
8
Feature selection and molecular classification of cancer using genetic programming.利用遗传编程进行癌症的特征选择与分子分类
Neoplasia. 2007 Apr;9(4):292-303. doi: 10.1593/neo.07121.