Suppr超能文献

加权距离加权判别及其渐近性质。

Weighted Distance Weighted Discrimination and Its Asymptotic Properties.

作者信息

Qiao Xingye, Zhang Hao Helen, Liu Yufeng, Todd Michael J, Marron J S

机构信息

Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, NC 27599.

出版信息

J Am Stat Assoc. 2010 Mar 1;105(489):401-414. doi: 10.1198/jasa.2010.tm08487.

Abstract

While Distance Weighted Discrimination (DWD) is an appealing approach to classification in high dimensions, it was designed for balanced datasets. In the case of unequal costs, biased sampling, or unbalanced data, there are major improvements available, using appropriately weighted versions of DWD (wDWD). A major contribution of this paper is the development of optimal weighting schemes for various nonstandard classification problems. In addition, we discuss several alternative criteria and propose an adaptive weighting scheme (awDWD) and demonstrate its advantages over nonadaptive weighting schemes under some situations. The second major contribution is a theoretical study of weighted DWD. Both high-dimensional low sample-size asymptotics and Fisher consistency of DWD are studied. The performance of weighted DWD is evaluated using simulated examples and two real data examples. The theoretical results are also confirmed by simulations.

摘要

虽然距离加权判别法(DWD)在高维分类中是一种很有吸引力的方法,但它是为平衡数据集设计的。在成本不平等、抽样有偏差或数据不平衡的情况下,可以使用适当加权的DWD版本(wDWD)进行重大改进。本文的一个主要贡献是为各种非标准分类问题开发了最优加权方案。此外,我们讨论了几种替代标准,并提出了一种自适应加权方案(awDWD),并在某些情况下证明了它相对于非自适应加权方案的优势。第二个主要贡献是对加权DWD的理论研究。研究了DWD的高维低样本量渐近性和Fisher一致性。使用模拟示例和两个实际数据示例评估了加权DWD的性能。模拟结果也证实了理论结果。

相似文献

1
Weighted Distance Weighted Discrimination and Its Asymptotic Properties.
J Am Stat Assoc. 2010 Mar 1;105(489):401-414. doi: 10.1198/jasa.2010.tm08487.
2
Sparse Multicategory Generalized Distance Weighted Discrimination in Ultra-High Dimensions.
Entropy (Basel). 2020 Nov 5;22(11):1257. doi: 10.3390/e22111257.
3
R/DWD: distance-weighted discrimination for classification, visualization and batch adjustment.
Bioinformatics. 2012 Apr 15;28(8):1182-3. doi: 10.1093/bioinformatics/bts096. Epub 2012 Feb 24.
4
Bayesian Distance Weighted Discrimination.
J Comput Graph Stat. 2022;31(4):1177-1188. doi: 10.1080/10618600.2022.2069778. Epub 2022 May 26.
5
Maximum Decentral Projection Margin Classifier for High Dimension and Low Sample Size problems.
Neural Netw. 2023 Jan;157:147-159. doi: 10.1016/j.neunet.2022.10.017. Epub 2022 Oct 22.
6
Multiway sparse distance weighted discrimination.
J Comput Graph Stat. 2023;32(2):730-743. doi: 10.1080/10618600.2022.2099404. Epub 2022 Aug 30.
7
Discriminating sample groups with multi-way data.
Biostatistics. 2017 Jul 1;18(3):434-450. doi: 10.1093/biostatistics/kxw057.
8
Bidirectional discrimination with application to data visualization.
Biometrika. 2012 Dec;99(4):851-864. doi: 10.1093/biomet/ass029. Epub 2012 Jul 24.
9
Adaptive weighted learning for unbalanced multicategory classification.
Biometrics. 2009 Mar;65(1):159-68. doi: 10.1111/j.1541-0420.2008.01017.x. Epub 2008 Mar 24.

引用本文的文献

2
Image analysis-based identification of high risk ER-positive, HER2-negative breast cancers.
Breast Cancer Res. 2024 Dec 4;26(1):177. doi: 10.1186/s13058-024-01915-5.
4
Comparison and development of cross-study normalization methods for inter-species transcriptional analysis.
PLoS One. 2024 Sep 10;19(9):e0307997. doi: 10.1371/journal.pone.0307997. eCollection 2024.
6
Measure of Strength of Evidence for Visually Observed Differences between Subpopulations.
J Comput Graph Stat. 2024;33(2):736-748. doi: 10.1080/10618600.2023.2276113. Epub 2023 Dec 26.
8
Multiway sparse distance weighted discrimination.
J Comput Graph Stat. 2023;32(2):730-743. doi: 10.1080/10618600.2022.2099404. Epub 2022 Aug 30.
9
An open-source solution for shape modeling and analysis of objects of challenging topologies.
Proc SPIE Int Soc Opt Eng. 2021 Feb;11600. doi: 10.1117/12.2579716. Epub 2021 Feb 15.

本文引用的文献

1
High Dimensional Classification Using Features Annealed Independence Rules.
Ann Stat. 2008;36(6):2605-2637. doi: 10.1214/07-AOS504.
2
Adaptive weighted learning for unbalanced multicategory classification.
Biometrics. 2009 Mar;65(1):159-68. doi: 10.1111/j.1541-0420.2008.01017.x. Epub 2008 Mar 24.
3
A simple and efficient algorithm for gene selection using sparse logistic regression.
Bioinformatics. 2003 Nov 22;19(17):2246-53. doi: 10.1093/bioinformatics/btg308.
4
Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.
Proc Natl Acad Sci U S A. 2001 Nov 20;98(24):13790-5. doi: 10.1073/pnas.191502998. Epub 2001 Nov 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验