Suppr超能文献

通过 Swirls 和 Ripples 实现基因表达微阵列的简单灵活分类。

Simple and flexible classification of gene expression microarrays via Swirls and Ripples.

机构信息

Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, EPN 3131, 6130 Executive Blvd MSC 7354, Bethesda, MD 20892-7354, USA.

出版信息

BMC Bioinformatics. 2010 Sep 8;11:452. doi: 10.1186/1471-2105-11-452.

Abstract

BACKGROUND

A simple classification rule with few genes and parameters is desirable when applying a classification rule to new data. One popular simple classification rule, diagonal discriminant analysis, yields linear or curved classification boundaries, called Ripples, that are optimal when gene expression levels are normally distributed with the appropriate variance, but may yield poor classification in other situations.

RESULTS

A simple modification of diagonal discriminant analysis yields smooth highly nonlinear classification boundaries, called Swirls, that sometimes outperforms Ripples. In particular, if the data are normally distributed with different variances in each class, Swirls substantially outperforms Ripples when using a pooled variance to reduce the number of parameters. The proposed classification rule for two classes selects either Swirls or Ripples after parsimoniously selecting the number of genes and distance measures. Applications to five cancer microarray data sets identified predictive genes related to the tissue organization theory of carcinogenesis.

CONCLUSION

The parsimonious selection of classifiers coupled with the selection of either Swirls or Ripples provides a good basis for formulating a simple, yet flexible, classification rule. Open source software is available for download.

摘要

背景

当将分类规则应用于新数据时,需要使用具有少量基因和参数的简单分类规则。一种流行的简单分类规则,即对角判别分析,会产生线性或曲线分类边界,称为 Ripples,当基因表达水平呈正态分布且方差适当时,这种边界是最优的,但在其他情况下可能会导致较差的分类。

结果

对角判别分析的一个简单修改会产生平滑的高度非线性分类边界,称为 Swirls,它有时会优于 Ripples。特别是,如果数据在每个类别中呈正态分布但方差不同,当使用 pooled variance 来减少参数数量时,Swirls 会大大优于 Ripples。对于两类分类规则,在简洁地选择基因数量和距离度量之后,会选择 Swirls 或 Ripples。应用于五个癌症微阵列数据集,确定了与致癌发生的组织学理论相关的预测基因。

结论

分类器的简约选择加上对 Swirls 或 Ripples 的选择为制定简单而灵活的分类规则提供了良好的基础。可下载开源软件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3685/2949887/abd2cda7ecf6/1471-2105-11-452-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验