• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

加权距离加权判别及其渐近性质。

Weighted Distance Weighted Discrimination and Its Asymptotic Properties.

作者信息

Qiao Xingye, Zhang Hao Helen, Liu Yufeng, Todd Michael J, Marron J S

机构信息

Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, NC 27599.

出版信息

J Am Stat Assoc. 2010 Mar 1;105(489):401-414. doi: 10.1198/jasa.2010.tm08487.

DOI:10.1198/jasa.2010.tm08487
PMID:21152360
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2996856/
Abstract

While Distance Weighted Discrimination (DWD) is an appealing approach to classification in high dimensions, it was designed for balanced datasets. In the case of unequal costs, biased sampling, or unbalanced data, there are major improvements available, using appropriately weighted versions of DWD (wDWD). A major contribution of this paper is the development of optimal weighting schemes for various nonstandard classification problems. In addition, we discuss several alternative criteria and propose an adaptive weighting scheme (awDWD) and demonstrate its advantages over nonadaptive weighting schemes under some situations. The second major contribution is a theoretical study of weighted DWD. Both high-dimensional low sample-size asymptotics and Fisher consistency of DWD are studied. The performance of weighted DWD is evaluated using simulated examples and two real data examples. The theoretical results are also confirmed by simulations.

摘要

虽然距离加权判别法(DWD)在高维分类中是一种很有吸引力的方法,但它是为平衡数据集设计的。在成本不平等、抽样有偏差或数据不平衡的情况下,可以使用适当加权的DWD版本(wDWD)进行重大改进。本文的一个主要贡献是为各种非标准分类问题开发了最优加权方案。此外,我们讨论了几种替代标准,并提出了一种自适应加权方案(awDWD),并在某些情况下证明了它相对于非自适应加权方案的优势。第二个主要贡献是对加权DWD的理论研究。研究了DWD的高维低样本量渐近性和Fisher一致性。使用模拟示例和两个实际数据示例评估了加权DWD的性能。模拟结果也证实了理论结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/3aa70ae32b09/nihms250323f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/fd697914a6be/nihms250323f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/88fd8a7f4852/nihms250323f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/848a81e02842/nihms250323f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/172b82982ea9/nihms250323f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/3aa70ae32b09/nihms250323f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/fd697914a6be/nihms250323f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/88fd8a7f4852/nihms250323f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/848a81e02842/nihms250323f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/172b82982ea9/nihms250323f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/290e/2996856/3aa70ae32b09/nihms250323f5.jpg

相似文献

1
Weighted Distance Weighted Discrimination and Its Asymptotic Properties.加权距离加权判别及其渐近性质。
J Am Stat Assoc. 2010 Mar 1;105(489):401-414. doi: 10.1198/jasa.2010.tm08487.
2
Sparse Multicategory Generalized Distance Weighted Discrimination in Ultra-High Dimensions.超高维稀疏多类别广义距离加权判别
Entropy (Basel). 2020 Nov 5;22(11):1257. doi: 10.3390/e22111257.
3
R/DWD: distance-weighted discrimination for classification, visualization and batch adjustment.R/DWD:用于分类、可视化和批量调整的距离加权判别。
Bioinformatics. 2012 Apr 15;28(8):1182-3. doi: 10.1093/bioinformatics/bts096. Epub 2012 Feb 24.
4
Bayesian Distance Weighted Discrimination.贝叶斯距离加权判别法
J Comput Graph Stat. 2022;31(4):1177-1188. doi: 10.1080/10618600.2022.2069778. Epub 2022 May 26.
5
Maximum Decentral Projection Margin Classifier for High Dimension and Low Sample Size problems.用于高维和小样本量问题的最大分散投影边际分类器
Neural Netw. 2023 Jan;157:147-159. doi: 10.1016/j.neunet.2022.10.017. Epub 2022 Oct 22.
6
Multiway sparse distance weighted discrimination.多路稀疏距离加权判别
J Comput Graph Stat. 2023;32(2):730-743. doi: 10.1080/10618600.2022.2099404. Epub 2022 Aug 30.
7
Discriminating sample groups with multi-way data.用多向数据区分样本组。
Biostatistics. 2017 Jul 1;18(3):434-450. doi: 10.1093/biostatistics/kxw057.
8
Bidirectional discrimination with application to data visualization.应用于数据可视化的双向判别
Biometrika. 2012 Dec;99(4):851-864. doi: 10.1093/biomet/ass029. Epub 2012 Jul 24.
9
Adaptive weighted learning for unbalanced multicategory classification.用于不平衡多类别分类的自适应加权学习
Biometrics. 2009 Mar;65(1):159-68. doi: 10.1111/j.1541-0420.2008.01017.x. Epub 2008 Mar 24.
10
Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case-control studies.在两阶段病例对照研究中,根据受试者工作特征曲线下面积评估和比较生物标志物。
Biostatistics. 2016 Jul;17(3):499-522. doi: 10.1093/biostatistics/kxw003. Epub 2016 Feb 16.

引用本文的文献

1
Prognostic value of residual disease (RD) biology and gene expression changes during the neoadjuvant treatment in patients with HER2-positive early breast cancer (EBC).HER2阳性早期乳腺癌(EBC)患者新辅助治疗期间残留病灶(RD)生物学及基因表达变化的预后价值
Ann Oncol. 2025 Apr;36(4):403-413. doi: 10.1016/j.annonc.2024.12.010. Epub 2024 Dec 18.
2
Image analysis-based identification of high risk ER-positive, HER2-negative breast cancers.基于图像分析的高危雌激素受体阳性、人表皮生长因子受体2阴性乳腺癌的识别
Breast Cancer Res. 2024 Dec 4;26(1):177. doi: 10.1186/s13058-024-01915-5.
3
Discovery of Plasma Lipids as Potential Biomarkers Distinguishing Breast Cancer Patients from Healthy Controls.发现血浆脂质可作为潜在的生物标志物,区分乳腺癌患者与健康对照者。
Int J Mol Sci. 2024 Oct 28;25(21):11559. doi: 10.3390/ijms252111559.
4
Comparison and development of cross-study normalization methods for inter-species transcriptional analysis.跨物种转录分析中跨研究标准化方法的比较与发展。
PLoS One. 2024 Sep 10;19(9):e0307997. doi: 10.1371/journal.pone.0307997. eCollection 2024.
5
Identification of autoantibodies as potential non-invasive biomarkers for intrahepatic cholangiocarcinoma.鉴定自身抗体作为肝内胆管癌潜在的非侵入性生物标志物。
Sci Rep. 2024 Aug 28;14(1):20012. doi: 10.1038/s41598-024-70595-0.
6
Measure of Strength of Evidence for Visually Observed Differences between Subpopulations.亚人群之间视觉观察到的差异的证据强度衡量。
J Comput Graph Stat. 2024;33(2):736-748. doi: 10.1080/10618600.2023.2276113. Epub 2023 Dec 26.
7
Uniformly shaped harmonization combines human transcriptomic data from different platforms while retaining their biological properties and differential gene expression patterns.形状一致的归一化整合了来自不同平台的人类转录组数据,同时保留其生物学特性和差异基因表达模式。
Front Mol Biosci. 2023 Sep 6;10:1237129. doi: 10.3389/fmolb.2023.1237129. eCollection 2023.
8
Multiway sparse distance weighted discrimination.多路稀疏距离加权判别
J Comput Graph Stat. 2023;32(2):730-743. doi: 10.1080/10618600.2022.2099404. Epub 2022 Aug 30.
9
An open-source solution for shape modeling and analysis of objects of challenging topologies.一种用于具有挑战性拓扑结构物体的形状建模与分析的开源解决方案。
Proc SPIE Int Soc Opt Eng. 2021 Feb;11600. doi: 10.1117/12.2579716. Epub 2021 Feb 15.
10
Prognostic and Predictive Value of Immune-Related Gene Expression Signatures vs Tumor-Infiltrating Lymphocytes in Early-Stage ERBB2/HER2-Positive Breast Cancer: A Correlative Analysis of the CALGB 40601 and PAMELA Trials.早期 ERBB2/HER2 阳性乳腺癌中免疫相关基因表达特征与肿瘤浸润淋巴细胞的预后和预测价值:CALGB 40601 和 PAMELA 试验的相关性分析。
JAMA Oncol. 2023 Apr 1;9(4):490-499. doi: 10.1001/jamaoncol.2022.6288.

本文引用的文献

1
High Dimensional Classification Using Features Annealed Independence Rules.使用特征退火独立规则的高维分类
Ann Stat. 2008;36(6):2605-2637. doi: 10.1214/07-AOS504.
2
Adaptive weighted learning for unbalanced multicategory classification.用于不平衡多类别分类的自适应加权学习
Biometrics. 2009 Mar;65(1):159-68. doi: 10.1111/j.1541-0420.2008.01017.x. Epub 2008 Mar 24.
3
A simple and efficient algorithm for gene selection using sparse logistic regression.一种使用稀疏逻辑回归进行基因选择的简单高效算法。
Bioinformatics. 2003 Nov 22;19(17):2246-53. doi: 10.1093/bioinformatics/btg308.
4
Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.通过mRNA表达谱对人类肺癌进行分类揭示了不同的腺癌亚类。
Proc Natl Acad Sci U S A. 2001 Nov 20;98(24):13790-5. doi: 10.1073/pnas.191502998. Epub 2001 Nov 13.