• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将微阵列分类到最近的质心。

Classification of microarrays to nearest centroids.

作者信息

Dabney Alan R

机构信息

Department of Biostatistics, University of Washington, Seattle, 98195, USA.

出版信息

Bioinformatics. 2005 Nov 15;21(22):4148-54. doi: 10.1093/bioinformatics/bti681. Epub 2005 Sep 20.

DOI:10.1093/bioinformatics/bti681
PMID:16174683
Abstract

MOTIVATION

Classification of biological samples by microarrays is a topic of much interest. A number of methods have been proposed and successfully applied to this problem. It has recently been shown that classification by nearest centroids provides an accurate predictor that may outperform much more complicated methods. The 'Prediction Analysis of Microarrays' (PAM) approach is one such example, which the authors strongly motivate by its simplicity and interpretability. In this spirit, I seek to assess the performance of classifiers simpler than even PAM.

RESULTS

I surprisingly show that the modified t-statistics and shrunken centroids employed by PAM tend to increase misclassification error when compared with their simpler counterparts. Based on these observations, I propose a classification method called 'Classification to Nearest Centroids' (ClaNC). ClaNC ranks genes by standard t-statistics, does not shrink centroids and uses a class-specific gene-selection procedure. Because of these modifications, ClaNC is arguably simpler and easier to interpret than PAM, and it can be viewed as a traditional nearest centroid classifier that uses specially selected genes. I demonstrate that ClaNC error rates tend to be significantly less than those for PAM, for a given number of active genes.

AVAILABILITY

Point-and-click software is freely available at http://students.washington.edu/adabney/clanc.

摘要

动机

利用微阵列对生物样本进行分类是一个备受关注的课题。已经提出了许多方法并成功应用于该问题。最近有研究表明,最近质心分类法能提供一个准确的预测器,其性能可能优于更为复杂的方法。“微阵列预测分析”(PAM)方法就是这样一个例子,作者因其简单性和可解释性而大力推崇。本着这种精神,我试图评估比PAM甚至更简单的分类器的性能。

结果

我令人惊讶地发现,与更简单的对应方法相比,PAM所采用的修正t统计量和收缩质心往往会增加误分类误差。基于这些观察结果,我提出了一种名为“最近质心分类法”(ClaNC)的分类方法。ClaNC通过标准t统计量对基因进行排序,不收缩质心,并使用特定类别的基因选择程序。由于这些改进,ClaNC可以说是比PAM更简单且更易于解释,并且它可以被视为一种使用特别选择基因的传统最近质心分类器。我证明,对于给定数量的活跃基因,ClaNC的错误率往往显著低于PAM的错误率。

可用性

可通过点击式软件免费获取,网址为http://students.washington.edu/adabney/clanc 。

相似文献

1
Classification of microarrays to nearest centroids.将微阵列分类到最近的质心。
Bioinformatics. 2005 Nov 15;21(22):4148-54. doi: 10.1093/bioinformatics/bti681. Epub 2005 Sep 20.
2
ClaNC: point-and-click software for classifying microarrays to nearest centroids.ClaNC:用于将微阵列分类到最近质心的点击式软件。
Bioinformatics. 2006 Jan 1;22(1):122-3. doi: 10.1093/bioinformatics/bti756. Epub 2005 Nov 2.
3
Improved centroids estimation for the nearest shrunken centroid classifier.改进最近收缩质心分类器的质心估计
Bioinformatics. 2007 Apr 15;23(8):972-9. doi: 10.1093/bioinformatics/btm046. Epub 2007 Mar 24.
4
Bias in error estimation when using cross-validation for model selection.在使用交叉验证进行模型选择时误差估计中的偏差。
BMC Bioinformatics. 2006 Feb 23;7:91. doi: 10.1186/1471-2105-7-91.
5
Optimal approach for classification of acute leukemia subtypes based on gene expression data.基于基因表达数据的急性白血病亚型分类的优化方法。
Biotechnol Prog. 2002 Jul-Aug;18(4):847-54. doi: 10.1021/bp025517o.
6
Robust classification modeling on microarray data using misclassification penalized posterior.使用误分类惩罚后验对微阵列数据进行稳健分类建模。
Bioinformatics. 2005 Jun;21 Suppl 1:i423-30. doi: 10.1093/bioinformatics/bti1020.
7
Regularized linear discriminant analysis and its application in microarrays.正则化线性判别分析及其在微阵列中的应用。
Biostatistics. 2007 Jan;8(1):86-100. doi: 10.1093/biostatistics/kxj035. Epub 2006 Apr 7.
8
Classification of microarray data with factor mixture models.基于因子混合模型的微阵列数据分类
Bioinformatics. 2006 Jan 15;22(2):202-8. doi: 10.1093/bioinformatics/bti779. Epub 2005 Nov 15.
9
Classification based upon gene expression data: bias and precision of error rates.基于基因表达数据的分类:错误率的偏差与精度
Bioinformatics. 2007 Jun 1;23(11):1363-70. doi: 10.1093/bioinformatics/btm117. Epub 2007 Mar 28.
10
Differential gene expression detection and sample classification using penalized linear regression models.使用惩罚线性回归模型进行差异基因表达检测和样本分类。
Bioinformatics. 2006 Feb 15;22(4):472-6. doi: 10.1093/bioinformatics/bti827. Epub 2005 Dec 13.

引用本文的文献

1
Peripheral blood biomarkers predict viral rebound following antiretroviral therapy discontinuation in SIV-infected, early ART-treated rhesus macaques.外周血生物标志物可预测 SIV 感染、早期 ART 治疗的恒河猴中断抗逆转录病毒治疗后的病毒反弹。
Cell Rep Med. 2023 Jul 18;4(7):101122. doi: 10.1016/j.xcrm.2023.101122.
2
Association of Antifolate Response Signature Status and Clinical Activity of Pemetrexed-Platinum Chemotherapy in Non-Small Cell Lung Cancer: The Piedmont Study.抗叶酸药物反应特征状态与培美曲塞-铂类化疗在非小细胞肺癌中临床活性的相关性:皮埃蒙特研究。
Clin Cancer Res. 2023 Aug 15;29(16):3203-3213. doi: 10.1158/1078-0432.CCR-22-2558.
3
Incorporating RNA-based Risk Scores for Genomic Instability to Predict Breast Cancer Recurrence and Immunogenicity in a Diverse Population.
将基于 RNA 的基因组不稳定性风险评分纳入预测不同人群乳腺癌复发和免疫原性的模型。
Cancer Res Commun. 2023 Jan 5;3(1):12-20. doi: 10.1158/2767-9764.CRC-22-0267. eCollection 2023 Jan.
4
Distinct Predictive Immunogenomic Profiles of Response to Immune Checkpoint Inhibitors and IL2: A Real-world Evidence Study of Patients with Advanced Renal Cancer.免疫检查点抑制剂和 IL2 反应的预测性免疫基因组特征:晚期肾癌患者的真实世界证据研究。
Cancer Res Commun. 2022 Aug 30;2(8):894-903. doi: 10.1158/2767-9764.CRC-21-0153. eCollection 2022 Aug.
5
RNA-Based Classification of Homologous Recombination Deficiency in Racially Diverse Patients with Breast Cancer.基于 RNA 的乳腺癌种族多样化患者同源重组缺陷分类。
Cancer Epidemiol Biomarkers Prev. 2022 Dec 5;31(12):2136-2147. doi: 10.1158/1055-9965.EPI-22-0590.
6
Reassessment of Reliability and Reproducibility for Triple-Negative Breast Cancer Subtyping.三阴性乳腺癌亚型分类的可靠性和可重复性的重新评估
Cancers (Basel). 2022 May 24;14(11):2571. doi: 10.3390/cancers14112571.
7
A Series of Genes for Predicting Responses to Anti-Tumor Necrosis Factor α Therapy in Crohn's Disease.用于预测克罗恩病患者对抗肿瘤坏死因子α治疗反应的一系列基因
Front Pharmacol. 2022 Apr 20;13:870796. doi: 10.3389/fphar.2022.870796. eCollection 2022.
8
The Landscape of Immune Microenvironments in Racially Diverse Breast Cancer Patients.不同种族乳腺癌患者的免疫微环境全景图。
Cancer Epidemiol Biomarkers Prev. 2022 Jul 1;31(7):1341-1350. doi: 10.1158/1055-9965.EPI-21-1312.
9
Identification of a unique tumor cell subset employing myeloid transcriptional circuits to create an immunomodulatory microenvironment in glioblastoma.利用髓系转录回路鉴定胶质母细胞瘤中具有独特免疫调节微环境的肿瘤细胞亚群。
Oncoimmunology. 2022 Jan 26;11(1):2030020. doi: 10.1080/2162402X.2022.2030020. eCollection 2022.
10
Genomic characterization of rare molecular subclasses of hepatocellular carcinoma.肝细胞癌罕见分子亚型的基因组特征。
Commun Biol. 2021 Oct 4;4(1):1150. doi: 10.1038/s42003-021-02674-1.