癌症微阵列实验预处理中的统计学挑战。

Statistical challenges in preprocessing in microarray experiments in cancer.

作者信息

Owzar Kouros, Barry William T, Jung Sin-Ho, Sohn Insuk, George Stephen L

机构信息

Department of Biostatistics and Bioinformatics, and Cancer and Leukemia Group B Statistical Center, Duke University School of Medicine, 2424 Erwin Road, Durham, NC 27705, USA.

出版信息

Clin Cancer Res. 2008 Oct 1;14(19):5959-66. doi: 10.1158/1078-0432.CCR-07-4532.

DOI:10.1158/1078-0432.CCR-07-4532

PMID:18829474

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3529914/

Abstract

Many clinical studies incorporate genomic experiments to investigate the potential associations between high-dimensional molecular data and clinical outcome. A critical first step in the statistical analyses of these experiments is that the molecular data are preprocessed. This article provides an overview of preprocessing methods, including summary algorithms and quality control metrics for microarrays. Some of the ramifications and effects that preprocessing methods have on the statistical results are illustrated. The discussions are centered around a microarray experiment based on lung cancer tumor samples with survival as the clinical outcome of interest. The procedures that are presented focus on the array platform used in this study. However, many of these issues are more general and are applicable to other instruments for genome-wide investigation. The discussions here will provide insight into the statistical challenges in preprocessing microarrays used in clinical studies of cancer. These challenges should not be viewed as inconsequential nuisances but rather as important issues that need to be addressed so that informed conclusions can be drawn.

摘要

许多临床研究纳入基因组实验，以调查高维分子数据与临床结果之间的潜在关联。这些实验统计分析的关键第一步是对分子数据进行预处理。本文概述了预处理方法，包括微阵列的汇总算法和质量控制指标。阐述了预处理方法对统计结果的一些影响。讨论围绕一项基于肺癌肿瘤样本的微阵列实验展开，该实验将生存作为感兴趣的临床结果。所介绍的程序侧重于本研究中使用的阵列平台。然而，其中许多问题更为普遍，适用于其他全基因组研究仪器。这里的讨论将深入了解癌症临床研究中微阵列预处理的统计挑战。这些挑战不应被视为无关紧要的麻烦，而应被视为需要解决的重要问题，以便得出明智的结论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0563/3529914/60a6eb5def8d/nihms78290f1.jpg

相似文献

Statistical challenges in preprocessing in microarray experiments in cancer.癌症微阵列实验预处理中的统计学挑战。

Clin Cancer Res. 2008 Oct 1;14(19):5959-66. doi: 10.1158/1078-0432.CCR-07-4532.

A review of statistical methods for preprocessing oligonucleotide microarrays.寡核苷酸微阵列的预处理统计方法综述。

Stat Methods Med Res. 2009 Dec;18(6):533-41. doi: 10.1177/0962280209351924.

Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.微分析器：Affymetrix 微阵列数据的自动预处理。

Comput Methods Programs Biomed. 2013 Aug;111(2):402-9. doi: 10.1016/j.cmpb.2013.04.006. Epub 2013 May 31.

Working with Oligonucleotide Arrays.使用寡核苷酸阵列

Methods Mol Biol. 2016;1418:145-59. doi: 10.1007/978-1-4939-3578-9_7.

Assessing statistical significance in microarray experiments using the distance between microarrays.利用微阵列之间的距离评估微阵列实验中的统计学显著性。

PLoS One. 2009 Jun 16;4(6):e5838. doi: 10.1371/journal.pone.0005838.

Ranking analysis for identifying differentially expressed genes.差异表达基因识别的排名分析。

Genomics. 2011 May;97(5):326-9. doi: 10.1016/j.ygeno.2011.03.002. Epub 2011 Mar 22.

HDBStat!: a platform-independent software suite for statistical analysis of high dimensional biology data.HDBStat！：一个用于高维生物学数据统计分析的独立于平台的软件套件。

BMC Bioinformatics. 2005 Apr 6;6:86. doi: 10.1186/1471-2105-6-86.

mu-CS: an extension of the TM4 platform to manage Affymetrix binary data.mu-CS：TM4 平台的一个扩展，用于管理 Affymetrix 二进制数据。

BMC Bioinformatics. 2010 Jun 10;11:315. doi: 10.1186/1471-2105-11-315.

Use of principal component analysis and the GE-biplot for the graphical exploration of gene expression data.主成分分析和GE双标图在基因表达数据图形化探索中的应用。

Biometrics. 2005 Jun;61(2):630-2; discussion 632-4. doi: 10.1111/j.1541-0420.2005.00366.x.

Evaluating concentration estimation errors in ELISA microarray experiments.评估酶联免疫吸附测定微阵列实验中的浓度估计误差。

BMC Bioinformatics. 2005 Jan 26;6:17. doi: 10.1186/1471-2105-6-17.

引用本文的文献

Unraveling the Core Components and Critical Targets of Thunb. in Treating Non-small Cell Lung Cancer through Network Pharmacology and Multi-omics Analysis.通过网络药理学和多组学分析揭示土贝母治疗非小细胞肺癌的核心成分及关键靶点

Curr Pharm Des. 2025;31(7):540-558. doi: 10.2174/0113816128330427241017110325.

A Python Clustering Analysis Protocol of Genes Expression Data Sets.基于基因表达数据集的 Python 聚类分析方案。

Genes (Basel). 2022 Oct 12;13(10):1839. doi: 10.3390/genes13101839.

Decision Theory versus Conventional Statistics for Personalized Therapy of Breast Cancer.用于乳腺癌个体化治疗的决策理论与传统统计学

J Pers Med. 2022 Apr 2;12(4):570. doi: 10.3390/jpm12040570.

Identification and Validation of Hub Genes in Acute Pancreatitis and Hypertriglyceridemia.急性胰腺炎和高甘油三酯血症中核心基因的鉴定与验证

Diabetes Metab Syndr Obes. 2022 Feb 24;15:559-577. doi: 10.2147/DMSO.S349528. eCollection 2022.

Bioinformatics Analysis of Hub Genes and Potential Therapeutic Agents Associated with Gastric Cancer.与胃癌相关的枢纽基因和潜在治疗药物的生物信息学分析

Cancer Manag Res. 2021 Nov 30;13:8929-8951. doi: 10.2147/CMAR.S341485. eCollection 2021.

Decision theory for precision therapy of breast cancer.决策理论在乳腺癌精准治疗中的应用。

Sci Rep. 2021 Feb 19;11(1):4233. doi: 10.1038/s41598-021-82418-7.

Identification of Key Genes in Gastric Cancer by Bioinformatics Analysis.生物信息学分析鉴定胃癌的关键基因

Biomed Res Int. 2020 Sep 21;2020:7658230. doi: 10.1155/2020/7658230. eCollection 2020.

Co-expressed genes enhance precision of receptor status identification in breast cancer patients.共表达基因增强乳腺癌患者受体状态鉴定的准确性。

Breast Cancer Res Treat. 2018 Nov;172(2):313-326. doi: 10.1007/s10549-018-4920-x. Epub 2018 Aug 16.

Gene expression profiles among murine strains segregate with distinct differences in the progression of radiation-induced lung disease.小鼠品系间的基因表达谱与辐射诱导的肺部疾病进展中的明显差异相关。

Dis Model Mech. 2017 Apr 1;10(4):425-437. doi: 10.1242/dmm.028217. Epub 2017 Jan 26.

FAS Death Receptor: A Breast Cancer Subtype-Specific Radiation Response Biomarker and Potential Therapeutic Target.FAS死亡受体：一种乳腺癌亚型特异性辐射反应生物标志物及潜在治疗靶点。

Radiat Res. 2015 Nov;184(5):456-69. doi: 10.1667/RR14089.1. Epub 2015 Oct 21.

本文引用的文献

The use of genomics in clinical trial design.基因组学在临床试验设计中的应用。

Clin Cancer Res. 2008 Oct 1;14(19):5984-93. doi: 10.1158/1078-0432.CCR-07-4531.

Validation of biomarker-based risk prediction models.基于生物标志物的风险预测模型的验证

Clin Cancer Res. 2008 Oct 1;14(19):5977-83. doi: 10.1158/1078-0432.CCR-07-4534.

Validation of analytic methods for biomarkers used in drug development.用于药物研发的生物标志物分析方法的验证。

Clin Cancer Res. 2008 Oct 1;14(19):5967-76. doi: 10.1158/1078-0432.CCR-07-4535.

Gene expression profiling reveals reproducible human lung adenocarcinoma subtypes in multiple independent patient cohorts.基因表达谱分析揭示了多个独立患者队列中可重复的人类肺腺癌亚型。

J Clin Oncol. 2006 Nov 1;24(31):5079-90. doi: 10.1200/JCO.2005.05.1748.

A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database.一种使用来自大型、生物多样性数据库的参考训练集对Affymetrix基因芯片数据进行汇总的方法。

BMC Bioinformatics. 2006 Oct 23;7:464. doi: 10.1186/1471-2105-7-464.

The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements.微阵列质量控制（MAQC）项目展示了基因表达测量在不同平台间和同一平台内的可重复性。

Nat Biotechnol. 2006 Sep;24(9):1151-61. doi: 10.1038/nbt1239.

Performance comparison of one-color and two-color platforms within the MicroArray Quality Control (MAQC) project.微阵列质量控制（MAQC）项目中单色和双色平台的性能比较。

Nat Biotechnol. 2006 Sep;24(9):1140-50. doi: 10.1038/nbt1242.

Probe set algorithms: is there a rational best bet?探针集算法：是否存在合理的最佳选择？

BMC Bioinformatics. 2006 Aug 30;7:395. doi: 10.1186/1471-2105-7-395.

Adjusting batch effects in microarray expression data using empirical Bayes methods.使用经验贝叶斯方法调整微阵列表达数据中的批次效应。

Biostatistics. 2007 Jan;8(1):118-27. doi: 10.1093/biostatistics/kxj037. Epub 2006 Apr 21.

Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data.评估Affymetrix基因芯片表达数据的芯片前和芯片后质量指标之间的关系。

BMC Bioinformatics. 2006 Apr 19;7:211. doi: 10.1186/1471-2105-7-211.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

癌症微阵列实验预处理中的统计学挑战。

Statistical challenges in preprocessing in microarray experiments in cancer.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献