• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合来自不同代寡核苷酸阵列的基因表达数据。

Combining gene expression data from different generations of oligonucleotide arrays.

作者信息

Hwang Kyu-Baek, Kong Sek Won, Greenberg Steve A, Park Peter J

机构信息

School of Computer Science and Engineering, Seoul National University, Seoul 151-742, Korea.

出版信息

BMC Bioinformatics. 2004 Oct 25;5:159. doi: 10.1186/1471-2105-5-159.

DOI:10.1186/1471-2105-5-159
PMID:15504239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC528726/
Abstract

BACKGROUND

One of the important challenges in microarray analysis is to take full advantage of previously accumulated data, both from one's own laboratory and from public repositories. Through a comparative analysis on a variety of datasets, a more comprehensive view of the underlying mechanism or structure can be obtained. However, as we discover in this work, continual changes in genomic sequence annotations and probe design criteria make it difficult to compare gene expression data even from different generations of the same microarray platform.

RESULTS

We first describe the extent of discordance between the results derived from two generations of Affymetrix oligonucleotide arrays, as revealed in cluster analysis and in identification of differentially expressed genes. We then propose a method for increasing comparability. The dataset we use consists of a set of 14 human muscle biopsy samples from patients with inflammatory myopathies that were hybridized on both HG-U95Av2 and HG-U133A human arrays. We find that the use of the probe set matching table for comparative analysis provided by Affymetrix produces better results than matching by UniGene or LocusLink identifiers but still remains inadequate. Rescaling of expression values for each gene across samples and data filtering by expression values enhance comparability but only for few specific analyses. As a generic method for improving comparability, we select a subset of probes with overlapping sequence segments in the two array types and recalculate expression values based only on the selected probes. We show that this filtering of probes significantly improves the comparability while retaining a sufficient number of probe sets for further analysis.

CONCLUSIONS

Compatibility between high-density oligonucleotide arrays is significantly affected by probe-level sequence information. With a careful filtering of the probes based on their sequence overlaps, data from different generations of microarrays can be combined more effectively.

摘要

背景

微阵列分析中的一个重要挑战是充分利用先前积累的数据,这些数据既来自自己的实验室,也来自公共数据库。通过对各种数据集进行比较分析,可以更全面地了解潜在的机制或结构。然而,正如我们在这项工作中所发现的,基因组序列注释和探针设计标准的不断变化使得即使是来自同一微阵列平台不同代次的基因表达数据也难以进行比较。

结果

我们首先描述了在聚类分析和差异表达基因鉴定中所揭示的两代Affymetrix寡核苷酸阵列结果之间的不一致程度。然后我们提出了一种提高可比性的方法。我们使用的数据集包括一组来自炎性肌病患者的14个人类肌肉活检样本,这些样本在HG-U95Av2和HG-U133A人类阵列上进行了杂交。我们发现,使用Affymetrix提供的探针集匹配表进行比较分析比通过UniGene或LocusLink标识符进行匹配产生的结果更好,但仍然不够。对每个基因在样本间的表达值进行重新缩放以及按表达值进行数据过滤可提高可比性,但仅适用于少数特定分析。作为一种提高可比性的通用方法,我们在两种阵列类型中选择具有重叠序列片段的探针子集,并仅基于所选探针重新计算表达值。我们表明,这种探针过滤显著提高了可比性,同时保留了足够数量的探针集用于进一步分析。

结论

高密度寡核苷酸阵列之间的兼容性受到探针水平序列信息的显著影响。通过基于序列重叠仔细过滤探针,可以更有效地组合来自不同代次微阵列的数据。

相似文献

1
Combining gene expression data from different generations of oligonucleotide arrays.整合来自不同代寡核苷酸阵列的基因表达数据。
BMC Bioinformatics. 2004 Oct 25;5:159. doi: 10.1186/1471-2105-5-159.
2
Redefinition of Affymetrix probe sets by sequence overlap with cDNA microarray probes reduces cross-platform inconsistencies in cancer-associated gene expression measurements.通过与cDNA微阵列探针的序列重叠来重新定义Affymetrix探针集,可减少癌症相关基因表达测量中跨平台的不一致性。
BMC Bioinformatics. 2005 Apr 25;6:107. doi: 10.1186/1471-2105-6-107.
3
Integrating probe-level expression changes across generations of Affymetrix arrays.整合跨代Affymetrix阵列的探针水平表达变化。
Nucleic Acids Res. 2005 Dec 14;33(22):e193. doi: 10.1093/nar/gni193.
4
CrossChip: a system supporting comparative analysis of different generations of Affymetrix arrays.CrossChip:一个支持对不同代Affymetrix阵列进行比较分析的系统。
Bioinformatics. 2005 May 1;21(9):2116-7. doi: 10.1093/bioinformatics/bti288. Epub 2005 Jan 31.
5
Transformation of expression intensities across generations of Affymetrix microarrays using sequence matching and regression modeling.使用序列匹配和回归建模对各代Affymetrix微阵列的表达强度进行转换。
Nucleic Acids Res. 2005 Oct 13;33(18):e157. doi: 10.1093/nar/gni159.
6
A verification protocol for the probe sequences of Affymetrix genome arrays reveals high probe accuracy for studies in mouse, human and rat.一项针对Affymetrix基因组阵列探针序列的验证方案显示,该探针在小鼠、人类和大鼠研究中具有很高的准确性。
BMC Bioinformatics. 2007 Apr 20;8:132. doi: 10.1186/1471-2105-8-132.
7
Alternative mapping of probes to genes for Affymetrix chips.针对Affymetrix芯片的探针到基因的替代映射。
BMC Bioinformatics. 2004 Aug 14;5:111. doi: 10.1186/1471-2105-5-111.
8
Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays.利用双向探针水平块设计通过高密度寡核苷酸阵列鉴定差异基因表达。
BMC Bioinformatics. 2004 Apr 20;5:42. doi: 10.1186/1471-2105-5-42.
9
Sequence-matched probes produce increased cross-platform consistency and more reproducible biological results in microarray-based gene expression measurements.在基于微阵列的基因表达测量中,序列匹配的探针可提高跨平台一致性,并产生更具可重复性的生物学结果。
Nucleic Acids Res. 2004 May 25;32(9):e74. doi: 10.1093/nar/gnh071.
10
Long versus short oligonucleotide microarrays for the study of gene expression in nonhuman primates.用于研究非人类灵长类动物基因表达的长寡核苷酸与短寡核苷酸微阵列
J Neurosci Methods. 2006 Apr 15;152(1-2):179-89. doi: 10.1016/j.jneumeth.2005.09.007. Epub 2005 Oct 25.

引用本文的文献

1
Identification of gene expression profiles in myocardial infarction: a systematic review and meta-analysis.心肌梗死中基因表达谱的鉴定:系统评价和荟萃分析。
BMC Med Genomics. 2018 Nov 27;11(1):109. doi: 10.1186/s12920-018-0427-x.
2
virtualArray: a R/bioconductor package to merge raw data from different microarray platforms.virtualArray:一个 R/bioconductor 软件包,用于合并来自不同微阵列平台的原始数据。
BMC Bioinformatics. 2013 Mar 2;14:75. doi: 10.1186/1471-2105-14-75.
3
Probe mapping across multiple microarray platforms.在多个微阵列平台上进行探针映射。

本文引用的文献

1
Arabidopsis transcript profiling on Affymetrix GeneChip arrays.拟南芥在Affymetrix基因芯片阵列上的转录本分析。
Plant Mol Biol. 2003 Nov;53(4):457-65. doi: 10.1023/B:PLAN.0000019069.23317.97.
2
GeneAnnot: interfacing GeneCards with high-throughput gene expression compendia.基因注释:连接基因卡片与高通量基因表达汇编。
Brief Bioinform. 2003 Dec;4(4):349-60. doi: 10.1093/bib/4.4.349.
3
Comparing cDNA and oligonucleotide array data: concordance of gene expression across platforms for the NCI-60 cancer cells.比较互补DNA(cDNA)和寡核苷酸芯片数据:NCI - 60癌细胞跨平台基因表达的一致性
Brief Bioinform. 2012 Sep;13(5):547-54. doi: 10.1093/bib/bbr076. Epub 2011 Dec 23.
4
Modeling and analysis of retinoic acid induced differentiation of uncommitted precursor cells.视黄酸诱导未分化前体细胞分化的建模与分析。
Integr Biol (Camb). 2011 May;3(5):578-91. doi: 10.1039/c0ib00141d. Epub 2011 Mar 24.
5
Analysis of kinase gene expression patterns across 5681 human tissue samples reveals functional genomic taxonomy of the kinome.分析 5681 个人体组织样本中的激酶基因表达模式,揭示激酶组的功能基因组分类学。
PLoS One. 2010 Dec 3;5(12):e15068. doi: 10.1371/journal.pone.0015068.
6
Amacrine cell gene expression and survival signaling: differences from neighboring retinal ganglion cells.无
Invest Ophthalmol Vis Sci. 2010 Jul;51(7):3800-12. doi: 10.1167/iovs.09-4540. Epub 2010 May 5.
7
Standards affecting the consistency of gene expression arrays in clinical applications.影响临床应用中基因表达谱一致性的标准。
Cancer Epidemiol Biomarkers Prev. 2010 Apr;19(4):1000-3. doi: 10.1158/1055-9965.EPI-10-0044. Epub 2010 Mar 23.
8
Integration of heterogeneous expression data sets extends the role of the retinol pathway in diabetes and insulin resistance.异质表达数据集的整合扩展了视黄醇途径在糖尿病和胰岛素抵抗中的作用。
Bioinformatics. 2009 Dec 1;25(23):3121-7. doi: 10.1093/bioinformatics/btp559. Epub 2009 Sep 28.
9
Meta-analysis of glioblastoma multiforme versus anaplastic astrocytoma identifies robust gene markers.多形性胶质母细胞瘤与间变性星形细胞瘤的荟萃分析确定了可靠的基因标志物。
Mol Cancer. 2009 Sep 4;8:71. doi: 10.1186/1476-4598-8-71.
10
Comparison of three microarray probe annotation pipelines: differences in strategies and their effect on downstream analysis.三种微阵列探针注释流程的比较:策略差异及其对下游分析的影响。
BMC Proc. 2009 Jul 16;3 Suppl 4(Suppl 4):S1. doi: 10.1186/1753-6561-3-S4-S1.
Genome Biol. 2003;4(12):R82. doi: 10.1186/gb-2003-4-12-r82. Epub 2003 Nov 25.
4
A comparison of oligonucleotide and cDNA-based microarray systems.基于寡核苷酸和基于cDNA的微阵列系统的比较。
Physiol Genomics. 2004 Feb 13;16(3):361-70. doi: 10.1152/physiolgenomics.00080.2003.
5
Probe selection for high-density oligonucleotide arrays.高密度寡核苷酸阵列的探针选择
Proc Natl Acad Sci U S A. 2003 Sep 30;100(20):11237-42. doi: 10.1073/pnas.1534744100. Epub 2003 Sep 19.
6
Evaluation of gene expression measurements from commercial microarray platforms.商业微阵列平台基因表达测量的评估。
Nucleic Acids Res. 2003 Oct 1;31(19):5676-84. doi: 10.1093/nar/gkg763.
7
Microarrays: how many do you need?微阵列:你需要多少?
J Comput Biol. 2003;10(3-4):653-67. doi: 10.1089/10665270360688246.
8
Congruence of tissue expression profiles from Gene Expression Atlas, SAGEmap and TissueInfo databases.来自基因表达图谱、SAGEmap和组织信息数据库的组织表达谱的一致性。
BMC Genomics. 2003 Jul 29;4(1):31. doi: 10.1186/1471-2164-4-31.
9
Reproducibility of gene expression across generations of Affymetrix microarrays.Affymetrix微阵列各代间基因表达的可重复性。
BMC Bioinformatics. 2003 Jun 25;4:27. doi: 10.1186/1471-2105-4-27.
10
Spotted long oligonucleotide arrays for human gene expression analysis.用于人类基因表达分析的斑点长寡核苷酸阵列。
Genome Res. 2003 Jul;13(7):1775-85. doi: 10.1101/gr.1048803. Epub 2003 Jun 12.