• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用主成分的最优子集对高通量数据进行分类。

Classification for high-throughput data with an optimal subset of principal components.

作者信息

Song Joon Jin, Ren Yuan, Yan Fenglan

机构信息

Department of Mathematical Sciences, University of Arkansas, Fayetteville, AR 72701, USA.

出版信息

Comput Biol Chem. 2009 Oct;33(5):408-13. doi: 10.1016/j.compbiolchem.2009.07.017. Epub 2009 Aug 18.

DOI:10.1016/j.compbiolchem.2009.07.017
PMID:19748831
Abstract

High-throughput data have been widely used in biological and medical studies to discover gene and protein functions. Due to the high dimensionality, principal component analysis (PCA) is often involved for data dimension reduction. However, when a few principal components (PCs) are selected for dimension reduction or considered for dimension determination, they are typically ranked by their variances, eigenvalues. However, this approach is not always effective in subsequent multivariate analysis, particularly classification. To maximize information from data with a subset of the components, we apply a different ranking criterion, canonical variate criterion, which considers within- and between-group variance rather than total variance in the classical criterion. Four prevalent classification methods are considered and compared using leave-one-out cross-validation. These methods are illustrated with three real high-throughput data sets, two microarray data sets and a nuclear magnetic resonance spectra data set.

摘要

高通量数据已广泛应用于生物学和医学研究中,以发现基因和蛋白质的功能。由于数据的高维度性,主成分分析(PCA)经常被用于数据降维。然而,当选择少数主成分(PC)进行降维或用于维度确定时,它们通常是根据其方差(即特征值)进行排序的。然而,这种方法在随后的多变量分析中,尤其是分类分析中,并不总是有效。为了从数据的一个子集成分中最大化信息,我们应用了一种不同的排序标准,即典型变量标准,该标准考虑组内和组间方差,而不是经典标准中的总方差。使用留一法交叉验证来考虑和比较四种流行的分类方法。通过三个真实的高通量数据集(两个微阵列数据集和一个核磁共振光谱数据集)对这些方法进行了说明。

相似文献

1
Classification for high-throughput data with an optimal subset of principal components.利用主成分的最优子集对高通量数据进行分类。
Comput Biol Chem. 2009 Oct;33(5):408-13. doi: 10.1016/j.compbiolchem.2009.07.017. Epub 2009 Aug 18.
2
Robust PCA and classification in biosciences.生物科学中的鲁棒主成分分析与分类
Bioinformatics. 2004 Jul 22;20(11):1728-36. doi: 10.1093/bioinformatics/bth158. Epub 2004 Feb 26.
3
Partial least squares dimension reduction for microarray gene expression data with a censored response.具有删失响应的微阵列基因表达数据的偏最小二乘降维法
Math Biosci. 2005 Jan;193(1):119-37. doi: 10.1016/j.mbs.2004.10.007. Epub 2005 Jan 22.
4
A study of spectral integration and normalization in NMR-based metabonomic analyses.基于核磁共振的代谢组学分析中光谱积分与归一化的研究。
J Pharm Biomed Anal. 2005 Sep 15;39(3-4):830-6. doi: 10.1016/j.jpba.2005.05.012.
5
Enhanced classification for high-throughput data with an optimal projection and hybrid classifier.基于最优投影和混合分类器的高通量数据增强分类
Int J Data Min Bioinform. 2014;9(1):106-20. doi: 10.1504/ijdmb.2014.057783.
6
Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.使用非负主成分分析改进基因表达癌症分子模式发现
Genome Inform. 2008;21:200-11.
7
Classification of array CGH data using smoothed logistic regression model.使用平滑逻辑回归模型对 array CGH 数据进行分类。
Stat Med. 2009 Dec 30;28(30):3798-810. doi: 10.1002/sim.3753.
8
Dimension reduction for classification with gene expression microarray data.利用基因表达微阵列数据进行分类的降维方法。
Stat Appl Genet Mol Biol. 2006;5:Article6. doi: 10.2202/1544-6115.1147. Epub 2006 Feb 24.
9
On using prototype reduction schemes to optimize kernel-based fisher discriminant analysis.关于使用原型约简方案优化基于核的Fisher判别分析
IEEE Trans Syst Man Cybern B Cybern. 2008 Apr;38(2):564-70. doi: 10.1109/TSMCB.2007.914446.
10
Block principal component analysis with application to gene microarray data classification.用于基因微阵列数据分类的块主成分分析
Stat Med. 2002 Nov 30;21(22):3465-74. doi: 10.1002/sim.1263.

引用本文的文献

1
A New Optical Sensor Based on Laser Speckle and Chemometrics for Precision Agriculture: Application to Sunflower Plant-Breeding.一种基于激光散斑和化学计量学的新型农业光学传感器:在向日葵植物育种中的应用。
Sensors (Basel). 2020 Aug 18;20(16):4652. doi: 10.3390/s20164652.