• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于收缩的对角判别分析及其在高维数据中的应用。

Shrinkage-based diagonal discriminant analysis and its applications in high-dimensional data.

作者信息

Pang Herbert, Tong Tiejun, Zhao Hongyu

机构信息

Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, North Carolina 27705, USA.

出版信息

Biometrics. 2009 Dec;65(4):1021-9. doi: 10.1111/j.1541-0420.2009.01200.x.

DOI:10.1111/j.1541-0420.2009.01200.x
PMID:19302409
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2794982/
Abstract

High-dimensional data such as microarrays have brought us new statistical challenges. For example, using a large number of genes to classify samples based on a small number of microarrays remains a difficult problem. Diagonal discriminant analysis, support vector machines, and k-nearest neighbor have been suggested as among the best methods for small sample size situations, but none was found to be superior to others. In this article, we propose an improved diagonal discriminant approach through shrinkage and regularization of the variances. The performance of our new approach along with the existing methods is studied through simulations and applications to real data. These studies show that the proposed shrinkage-based and regularization diagonal discriminant methods have lower misclassification rates than existing methods in many cases.

摘要

诸如微阵列之类的高维数据给我们带来了新的统计挑战。例如,基于少量微阵列使用大量基因对样本进行分类仍然是一个难题。对角判别分析、支持向量机和k近邻法被认为是小样本量情况下的最佳方法,但没有一种方法被发现优于其他方法。在本文中,我们通过对方差进行收缩和正则化提出了一种改进的对角判别方法。我们通过模拟和对实际数据的应用来研究我们新方法与现有方法的性能。这些研究表明,在许多情况下,所提出的基于收缩和正则化的对角判别方法比现有方法具有更低的误分类率。

相似文献

1
Shrinkage-based diagonal discriminant analysis and its applications in high-dimensional data.基于收缩的对角判别分析及其在高维数据中的应用。
Biometrics. 2009 Dec;65(4):1021-9. doi: 10.1111/j.1541-0420.2009.01200.x.
2
Improved mean estimation and its application to diagonal discriminant analysis.均值估计的改进及其在对角判别分析中的应用。
Bioinformatics. 2012 Feb 15;28(4):531-7. doi: 10.1093/bioinformatics/btr690. Epub 2011 Dec 14.
3
Regularized linear discriminant analysis and its application in microarrays.正则化线性判别分析及其在微阵列中的应用。
Biostatistics. 2007 Jan;8(1):86-100. doi: 10.1093/biostatistics/kxj035. Epub 2006 Apr 7.
4
Block-diagonal discriminant analysis and its bias-corrected rules.块对角判别分析及其偏差校正规则。
Stat Appl Genet Mol Biol. 2013 Jun;12(3):347-59. doi: 10.1515/sagmb-2012-0017.
5
Incorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data.将基因功能组的先验知识纳入微阵列数据的正则化判别分析。
Bioinformatics. 2007 Dec 1;23(23):3170-7. doi: 10.1093/bioinformatics/btm488. Epub 2007 Oct 12.
6
Comparison of linear discriminant analysis methods for the classification of cancer based on gene expression data.基于基因表达数据的癌症分类的线性判别分析方法比较。
J Exp Clin Cancer Res. 2009 Dec 10;28(1):149. doi: 10.1186/1756-9966-28-149.
7
Gene expression data classification using consensus independent component analysis.使用一致性独立成分分析的基因表达数据分类
Genomics Proteomics Bioinformatics. 2008 Jun;6(2):74-82. doi: 10.1016/S1672-0229(08)60022-4.
8
Tumor classification by partial least squares using microarray gene expression data.利用微阵列基因表达数据通过偏最小二乘法进行肿瘤分类。
Bioinformatics. 2002 Jan;18(1):39-50. doi: 10.1093/bioinformatics/18.1.39.
9
Face recognition by regularized discriminant analysis.基于正则化判别分析的人脸识别。
IEEE Trans Syst Man Cybern B Cybern. 2007 Aug;37(4):1080-5. doi: 10.1109/tsmcb.2007.895363.
10
Feature weight estimation for gene selection: a local hyperlinear learning approach.特征权重估计在基因选择中的应用:一种局部超线性学习方法。
BMC Bioinformatics. 2014 Mar 14;15:70. doi: 10.1186/1471-2105-15-70.

引用本文的文献

1
Sparse Linear Discriminant Analysis using the Prior-Knowledge-Guided Block Covariance Matrix.使用先验知识引导的块协方差矩阵的稀疏线性判别分析
Chemometr Intell Lab Syst. 2020 Nov 15;206. doi: 10.1016/j.chemolab.2020.104142. Epub 2020 Aug 27.
2
Comparison of Variable Selection Methods for Time-to-Event Data in High-Dimensional Settings.高维环境下生存数据分析中变量选择方法的比较。
Comput Math Methods Med. 2020 Jul 1;2020:6795392. doi: 10.1155/2020/6795392. eCollection 2020.
3
Better-than-chance classification for signal detection.信号检测中优于随机概率的分类。
Biostatistics. 2021 Apr 10;22(2):365-380. doi: 10.1093/biostatistics/kxz035.
4
Robust estimation of the expected survival probabilities from high-dimensional Cox models with biomarker-by-treatment interactions in randomized clinical trials.在随机临床试验中,通过生物标志物与治疗的相互作用,从高维Cox模型中稳健估计预期生存概率。
BMC Med Res Methodol. 2017 May 22;17(1):83. doi: 10.1186/s12874-017-0354-0.
5
Nearest shrunken centroids via alternative genewise shrinkages.通过替代的基因特异性收缩实现最近收缩质心
PLoS One. 2017 Feb 15;12(2):e0171068. doi: 10.1371/journal.pone.0171068. eCollection 2017.
6
NBLDA: negative binomial linear discriminant analysis for RNA-Seq data.NBLDA:用于RNA测序数据的负二项式线性判别分析。
BMC Bioinformatics. 2016 Sep 13;17(1):369. doi: 10.1186/s12859-016-1208-1.
7
Boosting for high-dimensional two-class prediction.用于高维二类预测的提升算法。
BMC Bioinformatics. 2015 Sep 21;16:300. doi: 10.1186/s12859-015-0723-9.
8
Block-diagonal discriminant analysis and its bias-corrected rules.块对角判别分析及其偏差校正规则。
Stat Appl Genet Mol Biol. 2013 Jun;12(3):347-59. doi: 10.1515/sagmb-2012-0017.
9
Sample size considerations of prediction-validation methods in high-dimensional data for survival outcomes.高维生存数据中预测-验证方法的样本量考虑。
Genet Epidemiol. 2013 Apr;37(3):276-82. doi: 10.1002/gepi.21721. Epub 2013 Mar 7.
10
Improved shrunken centroid classifiers for high-dimensional class-imbalanced data.用于高维类不平衡数据的改进的收缩质心分类器。
BMC Bioinformatics. 2013 Feb 23;14:64. doi: 10.1186/1471-2105-14-64.

本文引用的文献

1
Meta-analysis of microarray results: challenges, opportunities, and recommendations for standardization.微阵列结果的荟萃分析:标准化面临的挑战、机遇及建议
Gene. 2007 Oct 15;401(1-2):12-8. doi: 10.1016/j.gene.2007.06.016. Epub 2007 Jul 3.
2
Meta-analysis of gene expression data: a predictor-based approach.基因表达数据的荟萃分析:一种基于预测因子的方法。
Bioinformatics. 2007 Jul 1;23(13):1599-606. doi: 10.1093/bioinformatics/btm149. Epub 2007 Apr 26.
3
Classification methods for the development of genomic signatures from high-dimensional data.用于从高维数据开发基因组特征的分类方法。
Genome Biol. 2006;7(12):R121. doi: 10.1186/gb-2006-7-12-r121.
4
Using uncorrelated discriminant analysis for tissue classification with gene expression data.使用非相关判别分析结合基因表达数据进行组织分类。
IEEE/ACM Trans Comput Biol Bioinform. 2004 Oct-Dec;1(4):181-90. doi: 10.1109/TCBB.2004.45.
5
Prognosis of stage II colon cancer by non-neoplastic mucosa gene expression profiling.通过非肿瘤性黏膜基因表达谱分析预测II期结肠癌的预后
Oncogene. 2007 Apr 19;26(18):2642-8. doi: 10.1038/sj.onc.1210060. Epub 2006 Oct 9.
6
Gene-expression signature of benign monoclonal gammopathy evident in multiple myeloma is linked to good prognosis.多发性骨髓瘤中明显的良性单克隆丙种球蛋白病的基因表达特征与良好预后相关。
Blood. 2007 Feb 15;109(4):1692-700. doi: 10.1182/blood-2006-07-037077. Epub 2006 Oct 5.
7
Gene expression signatures separate B-cell chronic lymphocytic leukaemia prognostic subgroups defined by ZAP-70 and CD38 expression status.基因表达特征可区分由ZAP-70和CD38表达状态定义的B细胞慢性淋巴细胞白血病预后亚组。
Leukemia. 2006 Oct;20(10):1774-82. doi: 10.1038/sj.leu.2404363. Epub 2006 Aug 17.
8
Next station in microarray data analysis: GEPAS.微阵列数据分析的下一个阶段:GEPAS。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W486-91. doi: 10.1093/nar/gkl197.
9
Pathway analysis using random forests classification and regression.使用随机森林分类和回归的通路分析
Bioinformatics. 2006 Aug 15;22(16):2028-36. doi: 10.1093/bioinformatics/btl344. Epub 2006 Jun 29.
10
Independent component analysis-based penalized discriminant method for tumor classification using gene expression data.基于独立成分分析的惩罚判别方法用于利用基因表达数据进行肿瘤分类
Bioinformatics. 2006 Aug 1;22(15):1855-62. doi: 10.1093/bioinformatics/btl190. Epub 2006 May 18.