• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Probabilistic classifiers with high-dimensional data.高维数据的概率分类器。
Biostatistics. 2011 Jul;12(3):399-412. doi: 10.1093/biostatistics/kxq069. Epub 2010 Nov 17.
2
On the statistical assessment of classifiers using DNA microarray data.关于使用DNA微阵列数据对分类器进行统计评估
BMC Bioinformatics. 2006 Aug 19;7:387. doi: 10.1186/1471-2105-7-387.
3
A combinational feature selection and ensemble neural network method for classification of gene expression data.一种用于基因表达数据分类的组合特征选择与集成神经网络方法。
BMC Bioinformatics. 2004 Sep 27;5:136. doi: 10.1186/1471-2105-5-136.
4
Sample size planning for developing classifiers using high-dimensional DNA microarray data.使用高维DNA微阵列数据开发分类器的样本量规划
Biostatistics. 2007 Jan;8(1):101-17. doi: 10.1093/biostatistics/kxj036. Epub 2006 Apr 13.
5
Pattern classification with class probability output network.具有类概率输出网络的模式分类。
IEEE Trans Neural Netw. 2009 Oct;20(10):1659-73. doi: 10.1109/TNN.2009.2029103. Epub 2009 Sep 18.
6
Corrected small-sample estimation of the Bayes error.
Bioinformatics. 2003 May 22;19(8):944-51. doi: 10.1093/bioinformatics/btg144.
7
A hierarchical Naïve Bayes Model for handling sample heterogeneity in classification problems: an application to tissue microarrays.一种用于处理分类问题中样本异质性的分层朴素贝叶斯模型:在组织微阵列中的应用。
BMC Bioinformatics. 2006 Nov 24;7:514. doi: 10.1186/1471-2105-7-514.
8
Bayesian variable selection for disease classification using gene expression data.基于基因表达数据的疾病分类的贝叶斯变量选择。
Bioinformatics. 2010 Jan 15;26(2):215-22. doi: 10.1093/bioinformatics/btp638. Epub 2009 Nov 17.
9
A multi-class predictor based on a probabilistic model: application to gene expression profiling-based diagnosis of thyroid tumors.基于概率模型的多分类预测器:应用于基于基因表达谱的甲状腺肿瘤诊断
BMC Genomics. 2006 Jul 27;7:190. doi: 10.1186/1471-2164-7-190.
10
Multiclass cancer classification using gene expression profiling and probabilistic neural networks.使用基因表达谱和概率神经网络进行多类癌症分类。
Pac Symp Biocomput. 2003:5-16.

引用本文的文献

1
A gene signature that distinguishes conventional and leukemic nonnodal mantle cell lymphoma helps predict outcome.一个能够区分常规型和白血病性非结节性套细胞淋巴瘤的基因特征有助于预测预后。
Blood. 2018 Jul 26;132(4):413-422. doi: 10.1182/blood-2018-03-838136. Epub 2018 May 16.
2
Assessing rejection-related disease in kidney transplant biopsies based on archetypal analysis of molecular phenotypes.基于分子表型的原型分析评估肾移植活检中与排斥相关的疾病。
JCI Insight. 2017 Jun 15;2(12). doi: 10.1172/jci.insight.94197.
3
Transcriptome assists prognosis of disease severity in respiratory syncytial virus infected infants.转录组有助于预测呼吸道合胞病毒感染婴儿的疾病严重程度。
Sci Rep. 2016 Nov 11;6:36603. doi: 10.1038/srep36603.
4
Predicting Progression from Mild Cognitive Impairment to Alzheimer's Dementia Using Clinical, MRI, and Plasma Biomarkers via Probabilistic Pattern Classification.通过概率模式分类,利用临床、MRI和血浆生物标志物预测从轻度认知障碍到阿尔茨海默病痴呆的进展。
PLoS One. 2016 Feb 22;11(2):e0138866. doi: 10.1371/journal.pone.0138866. eCollection 2016.
5
Factors affecting the accuracy of a class prediction model in gene expression data.影响基因表达数据中类别预测模型准确性的因素。
BMC Bioinformatics. 2015 Jun 21;16:199. doi: 10.1186/s12859-015-0610-4.
6
Optimally splitting cases for training and testing high dimensional classifiers.最优地拆分高维分类器的训练和测试案例。
BMC Med Genomics. 2011 Apr 8;4:31. doi: 10.1186/1755-8794-4-31.

本文引用的文献

1
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
J Stat Softw. 2010;33(1):1-22.
2
A comparison of univariate and multivariate gene selection techniques for classification of cancer datasets.用于癌症数据集分类的单变量和多变量基因选择技术比较。
BMC Bioinformatics. 2006 May 2;7:235. doi: 10.1186/1471-2105-7-235.
3
A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics.一种用于大规模协方差矩阵估计的收缩方法及其对功能基因组学的影响。
Stat Appl Genet Mol Biol. 2005;4:Article32. doi: 10.2202/1544-6115.1175. Epub 2005 Nov 14.
4
Prediction error estimation: a comparison of resampling methods.预测误差估计:重采样方法的比较
Bioinformatics. 2005 Aug 1;21(15):3301-7. doi: 10.1093/bioinformatics/bti499. Epub 2005 May 19.
5
A protocol for building and evaluating predictors of disease state based on microarray data.一种基于微阵列数据构建和评估疾病状态预测指标的方案。
Bioinformatics. 2005 Oct 1;21(19):3755-62. doi: 10.1093/bioinformatics/bti429. Epub 2005 Apr 7.
6
BagBoosting for tumor classification with gene expression data.用于基于基因表达数据的肿瘤分类的BagBoosting算法
Bioinformatics. 2004 Dec 12;20(18):3583-93. doi: 10.1093/bioinformatics/bth447. Epub 2004 Oct 5.
7
A gene expression-based method to diagnose clinically distinct subgroups of diffuse large B cell lymphoma.一种基于基因表达的方法来诊断弥漫性大B细胞淋巴瘤的临床不同亚组。
Proc Natl Acad Sci U S A. 2003 Aug 19;100(17):9991-6. doi: 10.1073/pnas.1732008100. Epub 2003 Aug 4.
8
Diagnosis of multiple cancer types by shrunken centroids of gene expression.通过基因表达的收缩质心诊断多种癌症类型。
Proc Natl Acad Sci U S A. 2002 May 14;99(10):6567-72. doi: 10.1073/pnas.082099299.

高维数据的概率分类器。

Probabilistic classifiers with high-dimensional data.

机构信息

Biometric Research Branch, National Cancer Institute, 9000 Rockville Pike, MSC 7434, Bethesda, MD 20892-7434, USA.

出版信息

Biostatistics. 2011 Jul;12(3):399-412. doi: 10.1093/biostatistics/kxq069. Epub 2010 Nov 17.

DOI:10.1093/biostatistics/kxq069
PMID:21087946
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3138069/
Abstract

For medical classification problems, it is often desirable to have a probability associated with each class. Probabilistic classifiers have received relatively little attention for small n large p classification problems despite of their importance in medical decision making. In this paper, we introduce 2 criteria for assessment of probabilistic classifiers: well-calibratedness and refinement and develop corresponding evaluation measures. We evaluated several published high-dimensional probabilistic classifiers and developed 2 extensions of the Bayesian compound covariate classifier. Based on simulation studies and analysis of gene expression microarray data, we found that proper probabilistic classification is more difficult than deterministic classification. It is important to ensure that a probabilistic classifier is well calibrated or at least not "anticonservative" using the methods developed here. We provide this evaluation for several probabilistic classifiers and also evaluate their refinement as a function of sample size under weak and strong signal conditions. We also present a cross-validation method for evaluating the calibration and refinement of any probabilistic classifier on any data set.

摘要

对于医学分类问题,通常希望为每个类别关联一个概率。尽管在医学决策中具有重要意义,但概率分类器在小 n 大 p 分类问题中并没有受到太多关注。在本文中,我们引入了两个用于评估概率分类器的标准:校准良好和细化,并开发了相应的评估指标。我们评估了几种已发表的高维概率分类器,并开发了贝叶斯复合协变量分类器的两种扩展。基于模拟研究和基因表达微阵列数据分析,我们发现正确的概率分类比确定性分类更困难。使用这里开发的方法确保概率分类器校准良好或至少不“保守”非常重要。我们为几种概率分类器提供了这种评估,并还评估了它们在弱信号和强信号条件下随样本量的细化情况。我们还提出了一种交叉验证方法,用于评估任何概率分类器在任何数据集上的校准和细化情况。