• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

纠正 Illumina BeadChip 数据中的实验内变异性对于生成稳健的基因表达谱是必要的。

Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles.

机构信息

School of Physics, University of Edinburgh, 10 Crichton Street, Edinburgh, EH8 9AB, UK.

出版信息

BMC Genomics. 2010 Feb 24;11:134. doi: 10.1186/1471-2164-11-134.

DOI:10.1186/1471-2164-11-134
PMID:20181233
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2843619/
Abstract

BACKGROUND

Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study.

RESULTS

A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%.

CONCLUSION

In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data.

摘要

背景

微阵列技术是产生全基因组转录谱的一种流行手段,然而,mRNA 的高成本和稀缺性导致许多研究基于单个样本的分析进行。我们利用 Illumina 平台的设计,特别是每个芯片上的多个阵列,通过重复杂交通用人类参考 RNA(UHRR)和临床研究中来自原发性乳腺癌样本的重复杂交来评估实验内技术变异。

结果

在测量的 UHRR 和临床样本的表达中均检测到明显的批次特异性偏差。在应用标准微阵列归一化技术后,仍发现存在这种偏差。但是,当对数据应用均值中心化或经验贝叶斯批次校正方法(ComBat)时,UHRR 和临床样本中的批次间差异大大减少。使用 ComBat 进行批次校正后,重复 UHRR 样本之间的相关性提高了两个数量级(范围从 0.9833-0.9991 到 0.9997-0.9999),并增加了来自重复临床样本的基因列表的一致性,从定量归一化数据中的 11.6%增加到批次校正数据中的 66.4%。在与 ComBat 结合使用时,UHRR 作为批次间校准器的使用提供了额外的益处,进一步提高了两个基因列表之间的一致性,最高可达 74.1%。

结论

为了实际和成本的考虑,这些结果表明,单个样本可以生成可靠的数据,但仅在仔细补偿实验中的技术偏差后才能实现。我们建议研究人员在微阵列实验的设计阶段注意到这种变异的倾向,并建议在数据的统计分析中常规使用合适的校正方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/aa14af8a8bf5/1471-2164-11-134-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/534d17fd803c/1471-2164-11-134-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/c6fe8c0cbf65/1471-2164-11-134-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/d6fdda4e1b5c/1471-2164-11-134-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/c962a30fa100/1471-2164-11-134-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/aa14af8a8bf5/1471-2164-11-134-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/534d17fd803c/1471-2164-11-134-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/c6fe8c0cbf65/1471-2164-11-134-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/d6fdda4e1b5c/1471-2164-11-134-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/c962a30fa100/1471-2164-11-134-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1dc/2843619/aa14af8a8bf5/1471-2164-11-134-5.jpg

相似文献

1
Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles.纠正 Illumina BeadChip 数据中的实验内变异性对于生成稳健的基因表达谱是必要的。
BMC Genomics. 2010 Feb 24;11:134. doi: 10.1186/1471-2164-11-134.
2
Relative impact of key sources of systematic noise in Affymetrix and Illumina gene-expression microarray experiments.Affymetrix 和 Illumina 基因表达微阵列实验中系统噪声关键来源的相对影响。
BMC Genomics. 2011 Dec 1;12:589. doi: 10.1186/1471-2164-12-589.
3
Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients.使用相同重复参考样本对微阵列数据中的批次效应进行经验贝叶斯调整:应用于杜兴氏肌营养不良症患者血液的RNA表达谱分析。
BMC Genomics. 2008 Oct 20;9:494. doi: 10.1186/1471-2164-9-494.
4
Universal Reference RNA as a standard for microarray experiments.通用参考RNA作为微阵列实验的标准。
BMC Genomics. 2004 Mar 9;5(1):20. doi: 10.1186/1471-2164-5-20.
5
Removing Batch Effects from Longitudinal Gene Expression - Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data.去除纵向基因表达中的批次效应——分位数标准化加ComBat是微阵列转录组数据的最佳方法
PLoS One. 2016 Jun 7;11(6):e0156594. doi: 10.1371/journal.pone.0156594. eCollection 2016.
6
Replication, variation and normalisation in microarray experiments.微阵列实验中的复制、变异与标准化
Appl Bioinformatics. 2005;4(1):33-44. doi: 10.2165/00822942-200504010-00004.
7
Analyzing illumina gene expression microarray data from different tissues: methodological aspects of data analysis in the metaxpress consortium.分析来自不同组织的 Illumina 基因表达微阵列数据:Metaxpress 联盟中数据分析的方法学方面。
PLoS One. 2012;7(12):e50938. doi: 10.1371/journal.pone.0050938. Epub 2012 Dec 7.
8
Batch effect correction for genome-wide methylation data with Illumina Infinium platform.基于 Illumina Infinium 平台的全基因组甲基化数据的批次效应校正。
BMC Med Genomics. 2011 Dec 16;4:84. doi: 10.1186/1755-8794-4-84.
9
Direct integration of intensity-level data from Affymetrix and Illumina microarrays improves statistical power for robust reanalysis.直接整合 Affymetrix 和 Illumina 微阵列的强度水平数据可提高稳健重分析的统计功效。
BMC Med Genomics. 2012 Aug 21;5:35. doi: 10.1186/1755-8794-5-35.
10
Adjusting batch effects in microarray expression data using empirical Bayes methods.使用经验贝叶斯方法调整微阵列表达数据中的批次效应。
Biostatistics. 2007 Jan;8(1):118-27. doi: 10.1093/biostatistics/kxj037. Epub 2006 Apr 21.

引用本文的文献

1
Transcriptomic and Proteomic Insights into Host Immune Responses in Pediatric Severe Malarial Anemia: Dysregulation in HSP60-70-TLR2/4 Signaling and Altered Glutamine Metabolism.小儿严重疟疾贫血宿主免疫反应的转录组学和蛋白质组学见解:HSP60-70-TLR2/4信号通路失调与谷氨酰胺代谢改变
Pathogens. 2024 Oct 3;13(10):867. doi: 10.3390/pathogens13100867.
2
A multi-omics dataset of human transcriptome and proteome stable reference.人类转录组和蛋白质组稳定参考的多组学数据集。
Sci Data. 2023 Jul 13;10(1):455. doi: 10.1038/s41597-023-02359-w.
3
Mitochondrial respiration variability and simulations in human skeletal muscle: The Gene SMART study.

本文引用的文献

1
Sources of bias in specimens for research about molecular markers for cancer.用于癌症分子标志物研究标本的偏倚来源。
J Clin Oncol. 2010 Feb 1;28(4):698-704. doi: 10.1200/JCO.2009.25.6065. Epub 2009 Dec 28.
2
A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data.Illumina BeadArray 重注释管道:改善基因表达数据的解读。
Nucleic Acids Res. 2010 Jan;38(3):e17. doi: 10.1093/nar/gkp942. Epub 2009 Nov 18.
3
Illumina WG-6 BeadChip strips should be normalized separately.Illumina WG-6 BeadChip 条带应分别进行标准化处理。
人类骨骼肌中线粒体呼吸变异性和模拟:Gene SMART 研究。
FASEB J. 2020 Feb;34(2):2978-2986. doi: 10.1096/fj.201901997RR. Epub 2020 Jan 9.
4
An airway epithelial IL-17A response signature identifies a steroid-unresponsive COPD patient subgroup.气道上皮细胞 IL-17A 反应特征可识别出一类对类固醇治疗无应答的 COPD 患者亚群。
J Clin Invest. 2019 Jan 2;129(1):169-181. doi: 10.1172/JCI121087. Epub 2018 Nov 26.
5
Alternative empirical Bayes models for adjusting for batch effects in genomic studies.用于调整基因组研究中批次效应的替代经验贝叶斯模型。
BMC Bioinformatics. 2018 Jul 13;19(1):262. doi: 10.1186/s12859-018-2263-6.
6
Re-evaluating data quality of dog mitochondrial, Y chromosomal, and autosomal SNPs genotyped by SNP array.重新评估通过 SNP 芯片基因分型的犬线粒体、Y 染色体和常染色体 SNPs 的数据质量。
Zool Res. 2016 Nov 18;37(6):356-360. doi: 10.13918/j.issn.2095-8137.2016.6.356.
7
Immune-Signatures for Lung Cancer Diagnostics: Evaluation of Protein Microarray Data Normalization Strategies.用于肺癌诊断的免疫特征:蛋白质微阵列数据标准化策略的评估
Microarrays (Basel). 2015 Apr 2;4(2):162-87. doi: 10.3390/microarrays4020162.
8
Tumour sampling method can significantly influence gene expression profiles derived from neoadjuvant window studies.肿瘤采样方法可能会显著影响新辅助窗口期研究中得出的基因表达谱。
Sci Rep. 2016 Jul 7;6:29434. doi: 10.1038/srep29434.
9
Removing Batch Effects from Longitudinal Gene Expression - Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data.去除纵向基因表达中的批次效应——分位数标准化加ComBat是微阵列转录组数据的最佳方法
PLoS One. 2016 Jun 7;11(6):e0156594. doi: 10.1371/journal.pone.0156594. eCollection 2016.
10
Comparative Analysis of Matrix Metalloproteinase Family Members Reveals That MMP9 Predicts Survival and Response to Temozolomide in Patients with Primary Glioblastoma.基质金属蛋白酶家族成员的比较分析表明,MMP9可预测原发性胶质母细胞瘤患者的生存期及对替莫唑胺的反应。
PLoS One. 2016 Mar 29;11(3):e0151815. doi: 10.1371/journal.pone.0151815. eCollection 2016.
BMC Bioinformatics. 2009 Nov 11;10:372. doi: 10.1186/1471-2105-10-372.
4
Design and optimization of reverse-transcription quantitative PCR experiments.逆转录定量PCR实验的设计与优化
Clin Chem. 2009 Oct;55(10):1816-23. doi: 10.1373/clinchem.2009.126201. Epub 2009 Jul 30.
5
The development of a comparison approach for Illumina bead chips unravels unexpected challenges applying newest generation microarrays.一种针对Illumina珠芯片的比较方法的开发揭示了应用最新一代微阵列时意想不到的挑战。
BMC Bioinformatics. 2009 Jun 18;10:186. doi: 10.1186/1471-2105-10-186.
6
Repeatability of published microarray gene expression analyses.已发表的微阵列基因表达分析的可重复性。
Nat Genet. 2009 Feb;41(2):149-55. doi: 10.1038/ng.295. Epub 2008 Jan 28.
7
Bioinformatics and breast cancer: what can high-throughput genomic approaches actually tell us?生物信息学与乳腺癌:高通量基因组方法究竟能告诉我们什么?
J Clin Pathol. 2009 Oct;62(10):879-85. doi: 10.1136/jcp.2008.060376. Epub 2009 Jan 27.
8
Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: application to RNA expression profiling of blood from Duchenne muscular dystrophy patients.使用相同重复参考样本对微阵列数据中的批次效应进行经验贝叶斯调整:应用于杜兴氏肌营养不良症患者血液的RNA表达谱分析。
BMC Genomics. 2008 Oct 20;9:494. doi: 10.1186/1471-2164-9-494.
9
Comparison of the diagnostic performance of human whole genome microarrays using mixed-tissue RNA reference samples.使用混合组织RNA参考样本对人类全基因组微阵列诊断性能的比较。
Toxicol Lett. 2009 Apr 10;186(1):58-61. doi: 10.1016/j.toxlet.2008.08.018. Epub 2008 Sep 7.
10
The removal of multiplicative, systematic bias allows integration of breast cancer gene expression datasets - improving meta-analysis and prediction of prognosis.消除乘法性系统偏差可实现乳腺癌基因表达数据集的整合,从而改进荟萃分析和预后预测。
BMC Med Genomics. 2008 Sep 21;1:42. doi: 10.1186/1755-8794-1-42.