通过 Swirls 和 Ripples 实现基因表达微阵列的简单灵活分类。

Simple and flexible classification of gene expression microarrays via Swirls and Ripples.

机构信息

Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, EPN 3131, 6130 Executive Blvd MSC 7354, Bethesda, MD 20892-7354, USA.

出版信息

BMC Bioinformatics. 2010 Sep 8;11:452. doi: 10.1186/1471-2105-11-452.

DOI:10.1186/1471-2105-11-452

PMID:20825641

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2949887/

Abstract

BACKGROUND

A simple classification rule with few genes and parameters is desirable when applying a classification rule to new data. One popular simple classification rule, diagonal discriminant analysis, yields linear or curved classification boundaries, called Ripples, that are optimal when gene expression levels are normally distributed with the appropriate variance, but may yield poor classification in other situations.

RESULTS

A simple modification of diagonal discriminant analysis yields smooth highly nonlinear classification boundaries, called Swirls, that sometimes outperforms Ripples. In particular, if the data are normally distributed with different variances in each class, Swirls substantially outperforms Ripples when using a pooled variance to reduce the number of parameters. The proposed classification rule for two classes selects either Swirls or Ripples after parsimoniously selecting the number of genes and distance measures. Applications to five cancer microarray data sets identified predictive genes related to the tissue organization theory of carcinogenesis.

CONCLUSION

The parsimonious selection of classifiers coupled with the selection of either Swirls or Ripples provides a good basis for formulating a simple, yet flexible, classification rule. Open source software is available for download.

摘要

背景

当将分类规则应用于新数据时，需要使用具有少量基因和参数的简单分类规则。一种流行的简单分类规则，即对角判别分析，会产生线性或曲线分类边界，称为 Ripples，当基因表达水平呈正态分布且方差适当时，这种边界是最优的，但在其他情况下可能会导致较差的分类。

结果

对角判别分析的一个简单修改会产生平滑的高度非线性分类边界，称为 Swirls，它有时会优于 Ripples。特别是，如果数据在每个类别中呈正态分布但方差不同，当使用 pooled variance 来减少参数数量时，Swirls 会大大优于 Ripples。对于两类分类规则，在简洁地选择基因数量和距离度量之后，会选择 Swirls 或 Ripples。应用于五个癌症微阵列数据集，确定了与致癌发生的组织学理论相关的预测基因。

结论

分类器的简约选择加上对 Swirls 或 Ripples 的选择为制定简单而灵活的分类规则提供了良好的基础。可下载开源软件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3685/2949887/abd2cda7ecf6/1471-2105-11-452-1.jpg

相似文献

Simple and flexible classification of gene expression microarrays via Swirls and Ripples.通过 Swirls 和 Ripples 实现基因表达微阵列的简单灵活分类。

BMC Bioinformatics. 2010 Sep 8;11:452. doi: 10.1186/1471-2105-11-452.

Eigengene-based linear discriminant model for tumor classification using gene expression microarray data.基于特征基因的线性判别模型用于利用基因表达微阵列数据进行肿瘤分类

Bioinformatics. 2006 Nov 1;22(21):2635-42. doi: 10.1093/bioinformatics/btl442. Epub 2006 Aug 22.

Regularized Least Squares Cancer classifiers from DNA microarray data.基于DNA微阵列数据的正则化最小二乘癌症分类器。

BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-6-S4-S2.

Regularized linear discriminant analysis and its application in microarrays.正则化线性判别分析及其在微阵列中的应用。

Biostatistics. 2007 Jan;8(1):86-100. doi: 10.1093/biostatistics/kxj035. Epub 2006 Apr 7.

M@CBETH: a microarray classification benchmarking tool.M@CBETH：一种微阵列分类基准测试工具。

Bioinformatics. 2005 Jul 15;21(14):3185-6. doi: 10.1093/bioinformatics/bti495. Epub 2005 May 12.

HykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data.HykGene：一种利用微阵列基因表达数据选择用于表型分类的标记基因的混合方法。

Bioinformatics. 2005 Apr 15;21(8):1530-7. doi: 10.1093/bioinformatics/bti192. Epub 2004 Dec 7.

PCP: a program for supervised classification of gene expression profiles.PCP：一个用于基因表达谱监督分类的程序。

Bioinformatics. 2006 Jan 15;22(2):245-7. doi: 10.1093/bioinformatics/bti760. Epub 2005 Nov 8.

Accurate molecular classification of cancer using simple rules.使用简单规则进行准确的癌症分子分类。

BMC Med Genomics. 2009 Oct 30;2:64. doi: 10.1186/1755-8794-2-64.

Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data.基于基因表达数据的癌症分类的线性判别分析方法比较。

J Exp Clin Cancer Res. 2009 Dec 10;28(1):149. doi: 10.1186/1756-9966-28-149.

Optimal number of features as a function of sample size for various classification rules.针对各种分类规则，作为样本大小函数的最优特征数量。

Bioinformatics. 2005 Apr 15;21(8):1509-15. doi: 10.1093/bioinformatics/bti171. Epub 2004 Nov 30.

引用本文的文献

Evaluating Markers for Guiding Treatment.评估用于指导治疗的标志物。

J Natl Cancer Inst. 2016 May 18;108(9). doi: 10.1093/jnci/djw101. Print 2016 Sep.

A cancer theory kerfuffle can lead to new lines of research.一场癌症理论的争论可能会引发新的研究方向。

J Natl Cancer Inst. 2014 Dec 20;107(2). doi: 10.1093/jnci/dju405. Print 2015 Feb.

Systems analysis of high-throughput data.高通量数据的系统分析

Adv Exp Med Biol. 2014;844:153-87. doi: 10.1007/978-1-4939-2095-2_8.

Evaluating surrogate endpoints, prognostic markers, and predictive markers: Some simple themes.评估替代终点、预后标志物和预测标志物：一些简单的主题。

Clin Trials. 2015 Aug;12(4):299-308. doi: 10.1177/1740774514557725. Epub 2014 Nov 10.

Bivariate marker measurements and ROC analysis.双变量标志物测量与ROC分析。

Biometrics. 2012 Dec;68(4):1207-18. doi: 10.1111/j.1541-0420.2012.01783.x. Epub 2012 Sep 24.

Gene signatures revisited.重新审视基因特征。

J Natl Cancer Inst. 2012 Feb 22;104(4):262-3. doi: 10.1093/jnci/djr557. Epub 2012 Jan 18.

Partition decoupling for multi-gene analysis of gene expression profiling data.分群解耦在基因表达谱数据分析中的多基因分析

BMC Bioinformatics. 2011 Dec 30;12:497. doi: 10.1186/1471-2105-12-497.

Robust two-gene classifiers for cancer prediction.用于癌症预测的稳健双基因分类器。

Genomics. 2012 Feb;99(2):90-5. doi: 10.1016/j.ygeno.2011.11.003. Epub 2011 Nov 27.

Microarray-based cancer prediction using single genes.基于微阵列的单基因癌症预测。

BMC Bioinformatics. 2011 Oct 7;12:391. doi: 10.1186/1471-2105-12-391.

Systems biology and cancer: promises and perils.系统生物学与癌症：前景与挑战。

Prog Biophys Mol Biol. 2011 Aug;106(2):410-3. doi: 10.1016/j.pbiomolbio.2011.03.002. Epub 2011 Mar 23.

本文引用的文献

Research on early-stage carcinogenesis: are we approaching paradigm instability?早期致癌作用的研究：我们是否正接近范式不稳定？

J Clin Oncol. 2010 Jul 10;28(20):3215-8. doi: 10.1200/JCO.2010.28.5460. Epub 2010 Jun 14.

Incorporating gene co-expression network in identification of cancer prognosis markers.将基因共表达网络纳入癌症预后标志物的鉴定中。

BMC Bioinformatics. 2010 May 20;11:271. doi: 10.1186/1471-2105-11-271.

Using relative utility curves to evaluate risk prediction.使用相对效用曲线评估风险预测。

J R Stat Soc Ser A Stat Soc. 2009 Oct 1;172(4):729-748. doi: 10.1111/j.1467-985X.2009.00592.x.

Putting risk prediction in perspective: relative utility curves.正确看待风险预测：相对效用曲线。

J Natl Cancer Inst. 2009 Nov 18;101(22):1538-42. doi: 10.1093/jnci/djp353. Epub 2009 Oct 20.

Zyxin mediates actin fiber reorganization in epithelial-mesenchymal transition and contributes to endocardial morphogenesis.斑联蛋白介导上皮-间质转化过程中的肌动蛋白纤维重组，并有助于心内膜形态发生。

Mol Biol Cell. 2009 Jul;20(13):3115-24. doi: 10.1091/mbc.e09-01-0046. Epub 2009 May 13.

Plausibility of stromal initiation of epithelial cancers without a mutation in the epithelium: a computer simulation of morphostats.上皮无突变情况下基质引发上皮癌的合理性：形态稳定态的计算机模拟

BMC Cancer. 2009 Mar 23;9:89. doi: 10.1186/1471-2407-9-89.

Theories of carcinogenesis: an emerging perspective.致癌理论：一种新出现的观点。

Semin Cancer Biol. 2008 Oct;18(5):372-7. doi: 10.1016/j.semcancer.2008.03.012. Epub 2008 Mar 26.

Paradoxes in carcinogenesis: new opportunities for research directions.癌症发生中的悖论：研究方向的新机遇

BMC Cancer. 2007 Aug 6;7:151. doi: 10.1186/1471-2407-7-151.

Identifying genes that contribute most to good classification in microarrays.识别在微阵列中对良好分类贡献最大的基因。

BMC Bioinformatics. 2006 Sep 7;7:407. doi: 10.1186/1471-2105-7-407.

Regularized linear discriminant analysis and its application in microarrays.正则化线性判别分析及其在微阵列中的应用。

Biostatistics. 2007 Jan;8(1):86-100. doi: 10.1093/biostatistics/kxj035. Epub 2006 Apr 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过 Swirls 和 Ripples 实现基因表达微阵列的简单灵活分类。

Simple and flexible classification of gene expression microarrays via Swirls and Ripples.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献