• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

直接整合 Affymetrix 和 Illumina 微阵列的强度水平数据可提高稳健重分析的统计功效。

Direct integration of intensity-level data from Affymetrix and Illumina microarrays improves statistical power for robust reanalysis.

机构信息

Breakthrough Research Unit, University of Edinburgh, Crewe Road South, Edinburgh, EH4 2XR, UK.

出版信息

BMC Med Genomics. 2012 Aug 21;5:35. doi: 10.1186/1755-8794-5-35.

DOI:10.1186/1755-8794-5-35
PMID:22909195
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3443058/
Abstract

BACKGROUND

Affymetrix GeneChips and Illumina BeadArrays are the most widely used commercial single channel gene expression microarrays. Public data repositories are an extremely valuable resource, providing array-derived gene expression measurements from many thousands of experiments. Unfortunately many of these studies are underpowered and it is desirable to improve power by combining data from more than one study; we sought to determine whether platform-specific bias precludes direct integration of probe intensity signals for combined reanalysis.

RESULTS

Using Affymetrix and Illumina data from the microarray quality control project, from our own clinical samples, and from additional publicly available datasets we evaluated several approaches to directly integrate intensity level expression data from the two platforms. After mapping probe sequences to Ensembl genes we demonstrate that, ComBat and cross platform normalisation (XPN), significantly outperform mean-centering and distance-weighted discrimination (DWD) in terms of minimising inter-platform variance. In particular we observed that DWD, a popular method used in a number of previous studies, removed systematic bias at the expense of genuine biological variability, potentially reducing legitimate biological differences from integrated datasets.

CONCLUSION

Normalised and batch-corrected intensity-level data from Affymetrix and Illumina microarrays can be directly combined to generate biologically meaningful results with improved statistical power for robust, integrated reanalysis.

摘要

背景

Affymetrix GeneChips 和 Illumina BeadArrays 是最广泛使用的商业单通道基因表达微阵列。公共数据存储库是一个极其有价值的资源,提供了来自数千个实验的基于阵列的基因表达测量值。不幸的是,这些研究中的许多都没有足够的功效,因此希望通过合并来自多个研究的数据来提高功效;我们试图确定平台特异性偏差是否会阻止直接集成探针强度信号以进行联合重新分析。

结果

使用微阵列质量控制项目中的 Affymetrix 和 Illumina 数据、我们自己的临床样本以及其他可公开获得的数据集,我们评估了几种方法,以直接整合来自两个平台的强度水平表达数据。在将探针序列映射到 Ensembl 基因后,我们证明 ComBat 和跨平台归一化(XPN)在最小化平台间方差方面明显优于均值中心化和距离加权判别(DWD)。特别是,我们观察到 DWD(一种在许多先前研究中使用的流行方法)以牺牲真实的生物变异性为代价消除了系统偏差,这可能会降低来自集成数据集的合法生物学差异。

结论

可以直接合并 Affymetrix 和 Illumina 微阵列的归一化和批次校正的强度水平数据,以生成具有改进统计功效的生物学上有意义的结果,从而进行稳健的综合重新分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/211ecca877d2/1755-8794-5-35-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/32e74df7df39/1755-8794-5-35-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/930688fff33f/1755-8794-5-35-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/23de61753018/1755-8794-5-35-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/f9a053994edd/1755-8794-5-35-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/211ecca877d2/1755-8794-5-35-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/32e74df7df39/1755-8794-5-35-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/930688fff33f/1755-8794-5-35-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/23de61753018/1755-8794-5-35-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/f9a053994edd/1755-8794-5-35-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6dc/3443058/211ecca877d2/1755-8794-5-35-5.jpg

相似文献

1
Direct integration of intensity-level data from Affymetrix and Illumina microarrays improves statistical power for robust reanalysis.直接整合 Affymetrix 和 Illumina 微阵列的强度水平数据可提高稳健重分析的统计功效。
BMC Med Genomics. 2012 Aug 21;5:35. doi: 10.1186/1755-8794-5-35.
2
Redefinition of Affymetrix probe sets by sequence overlap with cDNA microarray probes reduces cross-platform inconsistencies in cancer-associated gene expression measurements.通过与cDNA微阵列探针的序列重叠来重新定义Affymetrix探针集,可减少癌症相关基因表达测量中跨平台的不一致性。
BMC Bioinformatics. 2005 Apr 25;6:107. doi: 10.1186/1471-2105-6-107.
3
Cross-platform expression microarray performance in a mouse model of mitochondrial disease therapy.用于治疗线粒体疾病的小鼠模型中的跨平台表达微阵列性能。
Mol Genet Metab. 2010 Mar;99(3):309-18. doi: 10.1016/j.ymgme.2009.10.179. Epub 2009 Oct 30.
4
On the necessity of different statistical treatment for Illumina BeadChip and Affymetrix GeneChip data and its significance for biological interpretation.关于Illumina BeadChip和Affymetrix GeneChip数据不同统计处理方法的必要性及其对生物学解释的意义
Biol Direct. 2008 Jun 3;3:23. doi: 10.1186/1745-6150-3-23.
5
Experimental comparison and cross-validation of the Affymetrix and Illumina gene expression analysis platforms.Affymetrix和Illumina基因表达分析平台的实验比较与交叉验证
Nucleic Acids Res. 2005 Oct 19;33(18):5914-23. doi: 10.1093/nar/gki890. Print 2005.
6
Are data from different gene expression microarray platforms comparable?来自不同基因表达微阵列平台的数据具有可比性吗?
Genomics. 2004 Jun;83(6):1164-8. doi: 10.1016/j.ygeno.2004.01.004.
7
Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles.纠正 Illumina BeadChip 数据中的实验内变异性对于生成稳健的基因表达谱是必要的。
BMC Genomics. 2010 Feb 24;11:134. doi: 10.1186/1471-2164-11-134.
8
MAID : an effect size based model for microarray data integration across laboratories and platforms.MAID:一种基于效应量的模型,用于跨实验室和平台整合微阵列数据。
BMC Bioinformatics. 2008 Jul 10;9:305. doi: 10.1186/1471-2105-9-305.
9
Evaluation of the similarity of gene expression data estimated with SAGE and Affymetrix GeneChips.用SAGE和Affymetrix基因芯片评估基因表达数据的相似性。
BMC Genomics. 2005 Jun 14;6:91. doi: 10.1186/1471-2164-6-91.
10
Statistical analysis of an RNA titration series evaluates microarray precision and sensitivity on a whole-array basis.对RNA滴定系列进行统计分析可在全阵列基础上评估微阵列的精度和灵敏度。
BMC Bioinformatics. 2006 Nov 22;7:511. doi: 10.1186/1471-2105-7-511.

引用本文的文献

1
Using meta-analysis and machine learning to investigate the transcriptional response of immune cells to Leishmania infection.利用荟萃分析和机器学习研究免疫细胞对利什曼原虫感染的转录反应。
PLoS Negl Trop Dis. 2024 Jan 8;18(1):e0011892. doi: 10.1371/journal.pntd.0011892. eCollection 2024 Jan.
2
An integrated in silico-in vitro approach for identifying therapeutic targets against osteoarthritis.一种综合的计算机模拟-体外方法,用于鉴定治疗骨关节炎的治疗靶点。
BMC Biol. 2022 Nov 9;20(1):253. doi: 10.1186/s12915-022-01451-8.
3
A Novel Approach for the Discovery of Biomarkers of Radiotherapy Response in Breast Cancer.

本文引用的文献

1
Comprehensive literature review and statistical considerations for microarray meta-analysis.综合文献回顾和微阵列荟萃分析的统计考虑。
Nucleic Acids Res. 2012 May;40(9):3785-99. doi: 10.1093/nar/gkr1265. Epub 2012 Jan 19.
2
The sva package for removing batch effects and other unwanted variation in high-throughput experiments.sva 包用于去除高通量实验中的批次效应和其他不需要的变异。
Bioinformatics. 2012 Mar 15;28(6):882-3. doi: 10.1093/bioinformatics/bts034. Epub 2012 Jan 17.
3
Empirical comparison of cross-platform normalization methods for gene expression data.
一种发现乳腺癌放疗反应生物标志物的新方法。
J Pers Med. 2021 Aug 14;11(8):796. doi: 10.3390/jpm11080796.
4
Reliability of microarray analysis for studying periodontitis: low consistency in 2 periodontitis cohort data sets from different platforms and an integrative meta-analysis.用于研究牙周炎的基因芯片分析的可靠性:来自不同平台的两个牙周炎队列数据集的低一致性及整合荟萃分析
J Periodontal Implant Sci. 2021 Feb;51(1):18-29. doi: 10.5051/jpis.2002120106.
5
Comparative Analysis of the Development of Acquired Radioresistance in Canine and Human Mammary Cancer Cell Lines.犬类和人类乳腺癌细胞系获得性放射抗性发展的比较分析
Front Vet Sci. 2020 Jul 23;7:439. doi: 10.3389/fvets.2020.00439. eCollection 2020.
6
Automated Hypothesis Generation to Identify Signals Relevant in the Development of Mammalian Cell and Tissue Bioprocesses, With Validation in a Retinal Culture System.自动生成假设以识别与哺乳动物细胞和组织生物过程开发相关的信号,并在视网膜培养系统中进行验证。
Front Bioeng Biotechnol. 2020 Jun 4;8:534. doi: 10.3389/fbioe.2020.00534. eCollection 2020.
7
Human Pluripotent Stem Cell-Derived Hepatocytes Show Higher Transcriptional Correlation with Adult Liver Tissue than with Fetal Liver Tissue.人多能干细胞来源的肝细胞与成人肝脏组织的转录相关性高于与胎儿肝脏组织的转录相关性。
ACS Omega. 2020 Mar 2;5(10):4816-4827. doi: 10.1021/acsomega.9b03514. eCollection 2020 Mar 17.
8
Unlocking the transcriptomic potential of formalin-fixed paraffin embedded clinical tissues: comparison of gene expression profiling approaches.解析福尔马林固定石蜡包埋临床组织的转录组潜力:基因表达谱分析方法的比较。
BMC Bioinformatics. 2020 Jan 28;21(1):30. doi: 10.1186/s12859-020-3365-5.
9
Integrative Analysis of Axolotl Gene Expression Data from Regenerative and Wound Healing Limb Tissues.蝾螈再生和创伤愈合肢体组织基因表达数据的综合分析。
Sci Rep. 2019 Dec 30;9(1):20280. doi: 10.1038/s41598-019-56829-6.
10
Development and characterisation of acquired radioresistant breast cancer cell lines.获得性放射抗拒乳腺癌细胞系的发展与特征。
Radiat Oncol. 2019 Apr 15;14(1):64. doi: 10.1186/s13014-019-1268-2.
基于基因表达数据的跨平台归一化方法的实证比较。
BMC Bioinformatics. 2011 Dec 7;12:467. doi: 10.1186/1471-2105-12-467.
4
Relative impact of key sources of systematic noise in Affymetrix and Illumina gene-expression microarray experiments.Affymetrix 和 Illumina 基因表达微阵列实验中系统噪声关键来源的相对影响。
BMC Genomics. 2011 Dec 1;12:589. doi: 10.1186/1471-2164-12-589.
5
ProfileChaser: searching microarray repositories based on genome-wide patterns of differential expression.ProfileChaser:基于全基因组差异表达模式搜索微阵列数据库。
Bioinformatics. 2011 Dec 1;27(23):3317-8. doi: 10.1093/bioinformatics/btr548. Epub 2011 Oct 3.
6
Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies.独立的替代变量分析可在大规模微阵列分析研究中去卷积混杂因素。
Bioinformatics. 2011 Jun 1;27(11):1496-505. doi: 10.1093/bioinformatics/btr171. Epub 2011 Apr 6.
7
Microarray-based class discovery for molecular classification of breast cancer: analysis of interobserver agreement.基于微阵列的乳腺癌分子分类的分类发现:观察者间一致性分析。
J Natl Cancer Inst. 2011 Apr 20;103(8):662-73. doi: 10.1093/jnci/djr071. Epub 2011 Mar 18.
8
The Gene Expression Barcode: leveraging public data repositories to begin cataloging the human and murine transcriptomes.基因表达条形码:利用公共数据存储库开始编目人类和小鼠转录组。
Nucleic Acids Res. 2011 Jan;39(Database issue):D1011-5. doi: 10.1093/nar/gkq1259.
9
Content-based microarray search using differential expression profiles.基于差异表达谱的基于内容的微阵列搜索。
BMC Bioinformatics. 2010 Dec 21;11:603. doi: 10.1186/1471-2105-11-603.
10
Tackling the widespread and critical impact of batch effects in high-throughput data.解决高通量数据中广泛存在且极具影响力的批次效应问题。
Nat Rev Genet. 2010 Oct;11(10):733-9. doi: 10.1038/nrg2825. Epub 2010 Sep 14.