• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于支持向量机的含噪子窗口排列分析在信息基因选择中的应用。

Noise incorporated subwindow permutation analysis for informative gene selection using support vector machines.

机构信息

Research Center of Modernization of Traditional Chinese Medicines, College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China.

出版信息

Analyst. 2011 Apr 7;136(7):1456-63. doi: 10.1039/c0an00667j. Epub 2011 Feb 14.

DOI:10.1039/c0an00667j
PMID:21321685
Abstract

Selecting a small subset of informative genes plays an important role in accurate prediction of clinical tumor samples. Based on model population analysis, a novel variable selection method, called noise incorporated subwindow permutation analysis (NISPA), is proposed in this study to work with support vector machines (SVMs). The essence of NISPA lies in the point that one noise variable is added into each sampled sub-dataset and then the distribution of variable importance of the added noise could be computed and serves as the common reference to evaluate the experimental variables. Further, by using the non-parametric Mann-Whitney U test, a P value can be assigned to each variable which describes to what extent the distributions of the gene variable and the noise variable are different. According to the computed P values, all the variables could be ranked and then a small subset of informative variables could be determined to build the model. Moreover, by NISPA, we are the first to distinguish the variables into a more detailed classification as informative, uninformative (noise) and interfering variables in comparison with other methods. In this study, two microarray datasets are employed to evaluate the performance of NISPA. The results show that the prediction errors of SVM classifiers could be significantly reduced by variable selection using NISPA. It is concluded that NISPA is a good alternative of variable selection algorithm.

摘要

选择一小部分信息丰富的基因对于准确预测临床肿瘤样本起着重要作用。基于模型人群分析,本研究提出了一种新的变量选择方法,称为噪声集成子窗口置换分析(NISPA),与支持向量机(SVM)一起使用。NISPA 的本质在于向每个采样子数据集中添加一个噪声变量,然后可以计算添加噪声的变量重要性分布,并将其作为评估实验变量的共同参考。此外,通过使用非参数曼-惠特尼 U 检验,可以为每个变量分配一个 P 值,该 P 值描述了基因变量和噪声变量的分布差异程度。根据计算出的 P 值,可以对所有变量进行排序,然后确定一小部分信息丰富的变量来构建模型。此外,通过 NISPA,我们是第一个将变量细分为更详细的分类,与其他方法相比,将变量分为信息丰富、无信息(噪声)和干扰变量。在这项研究中,使用了两个微阵列数据集来评估 NISPA 的性能。结果表明,通过使用 NISPA 进行变量选择,可以显著降低 SVM 分类器的预测误差。因此,可以得出结论,NISPA 是一种很好的变量选择算法的替代方法。

相似文献

1
Noise incorporated subwindow permutation analysis for informative gene selection using support vector machines.基于支持向量机的含噪子窗口排列分析在信息基因选择中的应用。
Analyst. 2011 Apr 7;136(7):1456-63. doi: 10.1039/c0an00667j. Epub 2011 Feb 14.
2
A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.一种用于从癌组织基因表达数据中进行特征选择和规则提取的多核支持向量机方案。
Artif Intell Med. 2007 Oct;41(2):161-75. doi: 10.1016/j.artmed.2007.07.008. Epub 2007 Sep 11.
3
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
4
Gene selection using support vector machines with non-convex penalty.使用具有非凸惩罚项的支持向量机进行基因选择。
Bioinformatics. 2006 Jan 1;22(1):88-95. doi: 10.1093/bioinformatics/bti736. Epub 2005 Oct 25.
5
Gene extraction for cancer diagnosis by support vector machines--an improvement.用于癌症诊断的支持向量机基因提取——一项改进
Artif Intell Med. 2005 Sep-Oct;35(1-2):185-94. doi: 10.1016/j.artmed.2005.01.006.
6
Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.使用非负主成分分析改进基因表达癌症分子模式发现
Genome Inform. 2008;21:200-11.
7
Genetic test bed for feature selection.用于特征选择的基因测试平台。
Bioinformatics. 2006 Apr 1;22(7):837-42. doi: 10.1093/bioinformatics/btl008. Epub 2006 Jan 20.
8
Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.基于最大间隔准则的递归基因选择:与支持向量机递归特征消除法的比较
BMC Bioinformatics. 2006 Dec 25;7:543. doi: 10.1186/1471-2105-7-543.
9
CARSVM: a class association rule-based classification framework and its application to gene expression data.CARSVM:一种基于类关联规则的分类框架及其在基因表达数据中的应用。
Artif Intell Med. 2008 Sep;44(1):7-25. doi: 10.1016/j.artmed.2008.05.002. Epub 2008 Jun 30.
10
New variable selection method using interval segmentation purity with application to blockwise kernel transform support vector machine classification of high-dimensional microarray data.基于区间分割纯度的新变量选择方法及其在高维微阵列数据的分块核变换支持向量机分类中的应用
J Chem Inf Model. 2009 Aug;49(8):2002-9. doi: 10.1021/ci900032q.

引用本文的文献

1
Statistic and Network Features of RGB and Hyperspectral Imaging for Determination of Black Root Mold Infection in Apples.用于确定苹果黑根霉感染的RGB和高光谱成像的统计与网络特征
Foods. 2023 Apr 10;12(8):1608. doi: 10.3390/foods12081608.
2
An efficient variable selection method based on random frog for the multivariate calibration of NIR spectra.一种基于随机蛙跳算法的高效变量选择方法用于近红外光谱的多元校正
RSC Adv. 2020 Apr 23;10(28):16245-16253. doi: 10.1039/d0ra00922a.