微阵列数据的可重复性：对微阵列质量控制（MAQC）数据的进一步分析。

Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data.

作者信息

Chen James J, Hsueh Huey-Miin, Delongchamp Robert R, Lin Chien-Ju, Tsai Chen-An

机构信息

Division of Personalized Nutrition and Medicine, National Center for Toxicological Research, Food and Drug Administration, Jefferson, Arkansas 72079, USA.

出版信息

BMC Bioinformatics. 2007 Oct 25;8:412. doi: 10.1186/1471-2105-8-412.

DOI:10.1186/1471-2105-8-412

PMID:17961233

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2204045/

Abstract

BACKGROUND

Many researchers are concerned with the comparability and reliability of microarray gene expression data. Recent completion of the MicroArray Quality Control (MAQC) project provides a unique opportunity to assess reproducibility across multiple sites and the comparability across multiple platforms. The MAQC analysis presented for the conclusion of inter- and intra-platform comparability/reproducibility of microarray gene expression measurements is inadequate. We evaluate the reproducibility/comparability of the MAQC data for 12901 common genes in four titration samples generated from five high-density one-color microarray platforms and the TaqMan technology. We discuss some of the problems with the use of correlation coefficient as metric to evaluate the inter- and intra-platform reproducibility and the percent of overlapping genes (POG) as a measure for evaluation of a gene selection procedure by MAQC.

RESULTS

A total of 293 arrays were used in the intra- and inter-platform analysis. A hierarchical cluster analysis shows distinct differences in the measured intensities among the five platforms. A number of genes show a small fold-change in one platform and a large fold-change in another platform, even though the correlations between platforms are high. An analysis of variance shows thirty percent of gene expressions of the samples show inconsistent patterns across the five platforms. We illustrated that POG does not reflect the accuracy of a selected gene list. A non-overlapping gene can be truly differentially expressed with a stringent cut, and an overlapping gene can be non-differentially expressed with non-stringent cutoff. In addition, POG is an unusable selection criterion. POG can increase or decrease irregularly as cutoff changes; there is no criterion to determine a cutoff so that POG is optimized.

CONCLUSION

Using various statistical methods we demonstrate that there are differences in the intensities measured by different platforms and different sites within platform. Within each platform, the patterns of expression are generally consistent, but there is site-by-site variability. Evaluation of data analysis methods for use in regulatory decision should take no treatment effect into consideration, when there is no treatment effect, "a fold-change cutoff with a non-stringent p-value cutoff" could result in 100% false positive error selection.

摘要

背景

许多研究人员关注微阵列基因表达数据的可比性和可靠性。微阵列质量控制（MAQC）项目的近期完成提供了一个独特的机会，可用于评估多个位点间的可重复性以及多个平台间的可比性。MAQC分析得出的关于微阵列基因表达测量的平台间和平台内可比性/可重复性的结论并不充分。我们评估了由五个高密度单色微阵列平台和TaqMan技术生成的四个滴定样本中12901个共同基因的MAQC数据的可重复性/可比性。我们讨论了使用相关系数作为评估平台间和平台内可重复性的指标以及使用重叠基因百分比（POG）作为MAQC评估基因选择程序的一种度量所存在的一些问题。

结果

总共293个阵列用于平台内和平台间分析。层次聚类分析显示五个平台之间测量强度存在明显差异。尽管平台间相关性很高，但许多基因在一个平台上显示出较小的倍数变化，而在另一个平台上显示出较大的倍数变化。方差分析表明，样本中30%的基因表达在五个平台上呈现不一致的模式。我们证明POG不能反映所选基因列表的准确性。一个不重叠的基因在严格的阈值下可能是真正差异表达的，而一个重叠的基因在不严格的阈值下可能是非差异表达的。此外，POG是一个不可用的选择标准。随着阈值的变化，POG可能会不规则地增加或减少；没有确定阈值的标准，因此无法优化POG。

结论

使用各种统计方法，我们证明了不同平台以及平台内不同位点测量的强度存在差异。在每个平台内，表达模式通常是一致的，但存在位点间的变异性。在监管决策中评估数据分析方法时，如果没有处理效应，“具有不严格p值阈值的倍数变化阈值”可能会导致100%的假阳性错误选择，此时不应考虑处理效应。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2324/2204045/6e737a78eca7/1471-2105-8-412-1.jpg

相似文献

Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data.

BMC Bioinformatics. 2007 Oct 25;8:412. doi: 10.1186/1471-2105-8-412.

Performance comparison of two microarray platforms to assess differential gene expression in human monocyte and macrophage cells.

BMC Genomics. 2008 Jun 25;9:302. doi: 10.1186/1471-2164-9-302.

Cross-platform comparison of SYBR Green real-time PCR with TaqMan PCR, microarrays and other gene expression measurement technologies evaluated in the MicroArray Quality Control (MAQC) study.

BMC Genomics. 2008 Jul 11;9:328. doi: 10.1186/1471-2164-9-328.

The balance of reproducibility, sensitivity, and specificity of lists of differentially expressed genes in microarray studies.

BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S10. doi: 10.1186/1471-2105-9-S9-S10.

Evaluation of DNA microarray results with quantitative gene expression platforms.

Nat Biotechnol. 2006 Sep;24(9):1115-22. doi: 10.1038/nbt1236.

The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements.

Nat Biotechnol. 2006 Sep;24(9):1151-61. doi: 10.1038/nbt1239.

Investigating the concordance of Gene Ontology terms reveals the intra- and inter-platform reproducibility of enrichment analysis.

BMC Bioinformatics. 2013 Apr 29;14:143. doi: 10.1186/1471-2105-14-143.

Evaluation of gene expression data generated from expired Affymetrix GeneChip® microarrays using MAQC reference RNA samples.

BMC Bioinformatics. 2010 Oct 7;11 Suppl 6(Suppl 6):S10. doi: 10.1186/1471-2105-11-S6-S10.

Rat toxicogenomic study reveals analytical consistency across microarray platforms.

Nat Biotechnol. 2006 Sep;24(9):1162-9. doi: 10.1038/nbt1238.

Evaluating methods for ranking differentially expressed genes applied to microArray quality control data.

BMC Bioinformatics. 2011 Jun 6;12:227. doi: 10.1186/1471-2105-12-227.

引用本文的文献

Multi-Omics Analysis Identified Drug Repurposing Targets for Chronic Obstructive Pulmonary Disease.

Int J Mol Sci. 2024 Oct 16;25(20):11106. doi: 10.3390/ijms252011106.

A simplified machine learning model utilizing platelet-related genes for predicting poor prognosis in sepsis.

Front Immunol. 2023 Nov 20;14:1286203. doi: 10.3389/fimmu.2023.1286203. eCollection 2023.

Uniformly shaped harmonization combines human transcriptomic data from different platforms while retaining their biological properties and differential gene expression patterns.

Front Mol Biosci. 2023 Sep 6;10:1237129. doi: 10.3389/fmolb.2023.1237129. eCollection 2023.

Multi-omics data integration using ratio-based quantitative profiling with Quartet reference materials.

Nat Biotechnol. 2024 Jul;42(7):1133-1149. doi: 10.1038/s41587-023-01934-1. Epub 2023 Sep 7.

Towards reproducible research in recurrent pregnancy loss immunology: Learning from cancer microenvironment deconvolution.

Front Immunol. 2023 Feb 23;14:1082087. doi: 10.3389/fimmu.2023.1082087. eCollection 2023.

Systematic Review of the Diagnostic and Clinical Utility of Salivary microRNAs in Traumatic Brain Injury (TBI).

Int J Mol Sci. 2022 Oct 29;23(21):13160. doi: 10.3390/ijms232113160.

Transcriptomic Harmonization as the Way for Suppressing Cross-Platform Bias and Batch Effect.

Biomedicines. 2022 Sep 18;10(9):2318. doi: 10.3390/biomedicines10092318.

Evaluation of connectivity map shows limited reproducibility in drug repositioning.

Sci Rep. 2021 Sep 2;11(1):17624. doi: 10.1038/s41598-021-97005-z.

Full pathogen characterisation: species identification including the detection of virulence factors and antibiotic resistance genes via multiplex DNA-assays.

Sci Rep. 2021 Mar 16;11(1):6001. doi: 10.1038/s41598-021-85438-5.

Identification of candidate repurposable drugs to combat COVID-19 using a signature-based approach.

Sci Rep. 2021 Feb 24;11(1):4495. doi: 10.1038/s41598-021-84044-9.

本文引用的文献

MAQC papers over the cracks.

Nat Biotechnol. 2007 Jan;25(1):27-8; author reply 28-9. doi: 10.1038/nbt0107-27.

Statistical methods and microarray data.

Nat Biotechnol. 2007 Jan;25(1):25-6; author reply 26-7. doi: 10.1038/nbt0107-25.

Rat toxicogenomic study reveals analytical consistency across microarray platforms.

Nat Biotechnol. 2006 Sep;24(9):1162-9. doi: 10.1038/nbt1238.

The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements.

Nat Biotechnol. 2006 Sep;24(9):1151-61. doi: 10.1038/nbt1239.

Performance comparison of one-color and two-color platforms within the MicroArray Quality Control (MAQC) project.

Nat Biotechnol. 2006 Sep;24(9):1140-50. doi: 10.1038/nbt1242.

Evaluation of DNA microarray results with quantitative gene expression platforms.

Nat Biotechnol. 2006 Sep;24(9):1115-22. doi: 10.1038/nbt1236.

Reliability and reproducibility issues in DNA microarray measurements.

Trends Genet. 2006 Feb;22(2):101-9. doi: 10.1016/j.tig.2005.12.005. Epub 2005 Dec 27.

An array of problems.

Nat Rev Drug Discov. 2005 May;4(5):362-3. doi: 10.1038/nrd1746.

Standardizing global gene expression analysis between laboratories and across platforms.

Nat Methods. 2005 May;2(5):351-6. doi: 10.1038/nmeth754. Epub 2005 Apr 21.

Multiple-laboratory comparison of microarray platforms.

Nat Methods. 2005 May;2(5):345-50. doi: 10.1038/nmeth756. Epub 2005 Apr 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

微阵列数据的可重复性：对微阵列质量控制（MAQC）数据的进一步分析。

Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献