将微阵列数据转化为结直肠癌临床相关诊断信息的统计方法。

Statistical methods of translating microarray data into clinically relevant diagnostic information in colorectal cancer.

作者信息

Kim Byung Soo, Kim Inyoung, Lee Sunho, Kim Sangcheol, Rha Sun Young, Chung Hyun Cheol

机构信息

Department of Applied Statistics, College of Medicine, Yonsei University Seoul, South Korea.

出版信息

Bioinformatics. 2005 Feb 15;21(4):517-28. doi: 10.1093/bioinformatics/bti029. Epub 2004 Sep 16.

DOI:10.1093/bioinformatics/bti029

PMID:15374865

Abstract

MOTIVATION

It is a common practice in cancer microarray experiments that a normal tissue is collected from the same individual from whom the tumor tissue was taken. The indirect design is usually adopted for the experiment that uses a common reference RNA hybridized both to normal and tumor tissues. However, it is often the case that the test material is not large enough for the experimenter to extract enough RNA to conduct the microarray experiment. Hence, collecting n cases does not necessarily end up with a matched pair sample of size n. Instead we usually have a matched pair sample of size n1, and two independent samples of sizes n2 and n3, respectively, for 'reference versus normal tissue only' and 'reference versus tumor tissue only' hybridizations (n=n1 + n2 + n3). Standard statistical methods need to be modified and new statistical procedures are developed for analyzing this mixed dataset.

RESULTS

We propose a new test statistic, t3, as a means of combining all the information in the mixed dataset for detecting differentially expressed (DE) genes between normal and tumor tissues. We employed the extended receiver operating characteristic approach to the mixed dataset. We devised a measure of disagreement between a RT-PCR experiment and a microarray experiment. Hotelling's T2 statistic is employed to detect a set of DE genes and its prediction rate is compared with the prediction rate of a univariate procedure. We observe that Hotelling's T2 statistic detects DE genes more efficiently than a univariate procedure and that further research is warranted on the formal test procedure using Hotelling's T2 statistic.

CONTACT

bskim@yonsei.ac.kr.

摘要

动机

在癌症微阵列实验中，从采集肿瘤组织的同一个体采集正常组织是一种常见做法。对于使用与正常组织和肿瘤组织均杂交的共同参考RNA的实验，通常采用间接设计。然而，实验材料往往不够大，实验者无法提取足够的RNA来进行微阵列实验。因此，收集n个病例不一定最终得到大小为n的匹配对样本。相反，我们通常有一个大小为n1的匹配对样本，以及分别用于“仅参考与正常组织”和“仅参考与肿瘤组织”杂交的大小为n2和n3的两个独立样本（n = n1 + n2 + n3）。需要对标准统计方法进行修改，并开发新的统计程序来分析这个混合数据集。

结果

我们提出了一种新的检验统计量t3，作为一种合并混合数据集中所有信息以检测正常组织和肿瘤组织之间差异表达（DE）基因的方法。我们对混合数据集采用了扩展的接收者操作特征方法。我们设计了一种衡量逆转录聚合酶链反应实验和微阵列实验之间不一致性的方法。采用霍特林T2统计量来检测一组DE基因，并将其预测率与单变量程序的预测率进行比较。我们观察到，霍特林T2统计量比单变量程序更有效地检测DE基因，并且有必要对使用霍特林T2统计量的正式检验程序进行进一步研究。

联系方式

bskim@yonsei.ac.kr

相似文献

Statistical methods of translating microarray data into clinically relevant diagnostic information in colorectal cancer.

Bioinformatics. 2005 Feb 15;21(4):517-28. doi: 10.1093/bioinformatics/bti029. Epub 2004 Sep 16.

A semiparametric approach for marker gene selection based on gene expression data.

Bioinformatics. 2005 Feb 15;21(4):529-36. doi: 10.1093/bioinformatics/bti032. Epub 2004 Sep 16.

Empirical Bayes screening of many p-values with applications to microarray studies.

Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.

Detection of differentially expressed gene sets in a partially paired microarray data set.

Stat Appl Genet Mol Biol. 2012 Feb 15;11(3):Article 5. doi: 10.1515/1544-6115.1610.

A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis.

Bioinformatics. 2005 Mar 1;21(5):631-43. doi: 10.1093/bioinformatics/bti033. Epub 2004 Sep 16.

Empirical study of supervised gene screening.

BMC Bioinformatics. 2006 Dec 18;7:537. doi: 10.1186/1471-2105-7-537.

RankGene: identification of diagnostic genes based on expression data.

Bioinformatics. 2003 Aug 12;19(12):1578-9. doi: 10.1093/bioinformatics/btg179.

Hotelling's T2 multivariate profiling for detecting differential expression in microarrays.

Bioinformatics. 2005 Jul 15;21(14):3105-13. doi: 10.1093/bioinformatics/bti496. Epub 2005 May 19.

Regularized binormal ROC method in disease classification using microarray data.

BMC Bioinformatics. 2006 May 9;7:253. doi: 10.1186/1471-2105-7-253.

Classification using partial least squares with penalized logistic regression.

Bioinformatics. 2005 Apr 1;21(7):1104-11. doi: 10.1093/bioinformatics/bti114. Epub 2004 Nov 5.

引用本文的文献

Optimal weighted two-sample -test with partially paired data in a unified framework.

J Appl Stat. 2020 Apr 20;48(6):961-976. doi: 10.1080/02664763.2020.1753027. eCollection 2021.

Analyzing partially paired data: when can the unpaired portion(s) be safely ignored?

J Appl Stat. 2020 Dec 23;49(6):1402-1420. doi: 10.1080/02664763.2020.1864813. eCollection 2022.

Propensity score method for partially matched omics studies.

Cancer Inform. 2014 Oct 29;13(Suppl 7):1-10. doi: 10.4137/CIN.S16352. eCollection 2014.

A simple and robust method for partially matched samples using the p-values pooling approach.

Stat Med. 2013 Aug 30;32(19):3247-59. doi: 10.1002/sim.5758. Epub 2013 Feb 17.

Analysis of high dimensional data using pre-defined set and subset information, with applications to genomic data.

BMC Bioinformatics. 2012 Jul 24;13:177. doi: 10.1186/1471-2105-13-177.

Concordant release of glycolysis proteins into the plasma preceding a diagnosis of ER+ breast cancer.

Cancer Res. 2012 Apr 15;72(8):1935-42. doi: 10.1158/0008-5472.CAN-11-3266. Epub 2012 Feb 24.

Improving the prediction accuracy in classification using the combined data sets by ranks of gene expressions.

BMC Bioinformatics. 2008 Jun 16;9:283. doi: 10.1186/1471-2105-9-283.

A multivariate approach for integrating genome-wide expression data and biological knowledge.

Bioinformatics. 2006 Oct 1;22(19):2373-80. doi: 10.1093/bioinformatics/btl401. Epub 2006 Jul 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

将微阵列数据转化为结直肠癌临床相关诊断信息的统计方法。

Statistical methods of translating microarray data into clinically relevant diagnostic information in colorectal cancer.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

CONTACT

动机

结果

联系方式

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献