• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

树分类的I型错误控制。

Type I error control for tree classification.

作者信息

Jung Sin-Ho, Chen Yong, Ahn Hongshik

机构信息

Department of Biostatistics and Bioinformatics, Duke University, Durham, NC 27710, USA.

Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794-3600, USA.

出版信息

Cancer Inform. 2014 Nov 16;13(Suppl 7):11-8. doi: 10.4137/CIN.S16342. eCollection 2014.

DOI:10.4137/CIN.S16342
PMID:25452689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4237155/
Abstract

Binary tree classification has been useful for classifying the whole population based on the levels of outcome variable that is associated with chosen predictors. Often we start a classification with a large number of candidate predictors, and each predictor takes a number of different cutoff values. Because of these types of multiplicity, binary tree classification method is subject to severe type I error probability. Nonetheless, there have not been many publications to address this issue. In this paper, we propose a binary tree classification method to control the probability to accept a predictor below certain level, say 5%.

摘要

二叉树分类法已被用于根据与所选预测变量相关的结果变量水平对总体进行分类。通常,我们在进行分类时会有大量的候选预测变量,并且每个预测变量都有多个不同的截断值。由于存在这类多重性问题,二叉树分类方法容易出现严重的I型错误概率。尽管如此,针对这一问题的相关出版物并不多。在本文中,我们提出了一种二叉树分类方法,以控制接受某个预测变量的概率低于特定水平(如5%)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/988a17be2819/cin-suppl.7-2014-011f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/9602a59c2149/cin-suppl.7-2014-011f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/7a7cb0286409/cin-suppl.7-2014-011f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/f4724940978d/cin-suppl.7-2014-011f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/988a17be2819/cin-suppl.7-2014-011f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/9602a59c2149/cin-suppl.7-2014-011f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/7a7cb0286409/cin-suppl.7-2014-011f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/f4724940978d/cin-suppl.7-2014-011f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8ff/4237155/988a17be2819/cin-suppl.7-2014-011f4.jpg

相似文献

1
Type I error control for tree classification.树分类的I型错误控制。
Cancer Inform. 2014 Nov 16;13(Suppl 7):11-8. doi: 10.4137/CIN.S16342. eCollection 2014.
2
Multiclass cancer classification by using fuzzy support vector machine and binary decision tree with gene selection.利用模糊支持向量机和带有基因选择的二叉决策树进行多类别癌症分类。
J Biomed Biotechnol. 2005 Jun 30;2005(2):160-71. doi: 10.1155/JBB.2005.160.
3
Single-photon sources based on stepwise optimized binary-tree multiplexers.
Opt Express. 2024 May 6;32(10):17173-17188. doi: 10.1364/OE.516313.
4
Obtaining optimal cutoff values for tree classifiers using multiple biomarkers.使用多个生物标志物获取树分类器的最优截断值。
Biometrics. 2022 Mar;78(1):128-140. doi: 10.1111/biom.13409. Epub 2020 Dec 22.
5
Prediction trees with soft nodes for binary outcomes.
Stat Med. 2002 Apr 30;21(8):1145-65. doi: 10.1002/sim.1106.
6
Mobility Analysis of AmpuTees (MAAT 4): classification tree analysis for probability of lower limb prosthesis user functional potential.截肢者移动性分析(MAAT 4):下肢假肢使用者功能潜力概率的分类树分析
Disabil Rehabil Assist Technol. 2020 Feb;15(2):211-218. doi: 10.1080/17483107.2018.1555290. Epub 2019 Feb 11.
7
Neyman-Pearson classification algorithms and NP receiver operating characteristics.Neyman-Pearson 分类算法和 NP 接收机工作特性。
Sci Adv. 2018 Feb 2;4(2):eaao1659. doi: 10.1126/sciadv.aao1659. eCollection 2018 Feb.
8
Classification trees with neural network feature extraction.具有神经网络特征提取的分类树
IEEE Trans Neural Netw. 1992;3(6):923-33. doi: 10.1109/72.165594.
9
Semisupervised Multiclass Classification Problems With Scarcity of Labeled Data: A Theoretical Study.半监督多类分类问题与标签数据的稀缺:理论研究。
IEEE Trans Neural Netw Learn Syst. 2016 Dec;27(12):2602-2614. doi: 10.1109/TNNLS.2015.2498525. Epub 2015 Nov 23.
10
Variable selection for binary classification using error rate p-values applied to metabolomics data.将错误率p值应用于代谢组学数据进行二元分类的变量选择
BMC Bioinformatics. 2016 Jan 14;17:33. doi: 10.1186/s12859-015-0867-7.

引用本文的文献

1
K-Sample comparisons using propensity analysis.使用倾向分析进行K样本比较。
Biom J. 2019 May;61(3):698-713. doi: 10.1002/bimj.201800049. Epub 2019 Jan 7.

本文引用的文献

1
Bias correction for selecting the minimal-error classifier from many machine learning models.从众多机器学习模型中选择最小错误分类器的偏差校正。
Bioinformatics. 2014 Nov 15;30(22):3152-8. doi: 10.1093/bioinformatics/btu520. Epub 2014 Aug 1.
2
Correcting the optimal resampling-based error rate by estimating the error rate of wrapper algorithms.通过估计包装算法的错误率来校正基于最优重采样的错误率。
Biometrics. 2013 Sep;69(3):693-702. doi: 10.1111/biom.12041. Epub 2013 Jul 11.
3
Gene expression-based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study.
基于基因表达的肺腺癌生存预测:一项多中心、盲法验证研究。
Nat Med. 2008 Aug;14(8):822-7. doi: 10.1038/nm.1790. Epub 2008 Jul 20.
4
Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome.短散在转座元件(SINEs)被排除在人类基因组的印记区域之外。
Proc Natl Acad Sci U S A. 2002 Jan 8;99(1):327-32. doi: 10.1073/pnas.012539199. Epub 2001 Dec 26.
5
Genomic imprinting: parental influence on the genome.基因组印记:亲本对基因组的影响。
Nat Rev Genet. 2001 Jan;2(1):21-32. doi: 10.1038/35047554.