• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高密度寡核苷酸阵列探针水平数据的探索、标准化及汇总

Exploration, normalization, and summaries of high density oligonucleotide array probe level data.

作者信息

Irizarry Rafael A, Hobbs Bridget, Collin Francois, Beazer-Barclay Yasmin D, Antonellis Kristen J, Scherf Uwe, Speed Terence P

机构信息

Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205, USA.

出版信息

Biostatistics. 2003 Apr;4(2):249-64. doi: 10.1093/biostatistics/4.2.249.

DOI:10.1093/biostatistics/4.2.249
PMID:12925520
Abstract

In this paper we report exploratory analyses of high-density oligonucleotide array data from the Affymetrix GeneChip system with the objective of improving upon currently used measures of gene expression. Our analyses make use of three data sets: a small experimental study consisting of five MGU74A mouse GeneChip arrays, part of the data from an extensive spike-in study conducted by Gene Logic and Wyeth's Genetics Institute involving 95 HG-U95A human GeneChip arrays; and part of a dilution study conducted by Gene Logic involving 75 HG-U95A GeneChip arrays. We display some familiar features of the perfect match and mismatch probe (PM and MM) values of these data, and examine the variance-mean relationship with probe-level data from probes believed to be defective, and so delivering noise only. We explain why we need to normalize the arrays to one another using probe level intensities. We then examine the behavior of the PM and MM using spike-in data and assess three commonly used summary measures: Affymetrix's (i) average difference (AvDiff) and (ii) MAS 5.0 signal, and (iii) the Li and Wong multiplicative model-based expression index (MBEI). The exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values. We evaluate the four expression summary measures using the dilution study data, assessing their behavior in terms of bias, variance and (for MBEI and RMA) model fit. Finally, we evaluate the algorithms in terms of their ability to detect known levels of differential expression using the spike-in data. We conclude that there is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities.

摘要

在本文中,我们报告了对来自Affymetrix GeneChip系统的高密度寡核苷酸阵列数据的探索性分析,目的是改进当前使用的基因表达测量方法。我们的分析使用了三个数据集:一个由五个MGU74A小鼠基因芯片阵列组成的小型实验研究;基因逻辑公司(Gene Logic)和惠氏遗传学研究所(Wyeth's Genetics Institute)进行的一项广泛的掺入研究中的部分数据,该研究涉及95个HG-U95A人类基因芯片阵列;以及基因逻辑公司进行的一项稀释研究中的部分数据,该研究涉及75个HG-U95A基因芯片阵列。我们展示了这些数据的完全匹配和错配探针(PM和MM)值的一些常见特征,并检查了与被认为有缺陷、因此只产生噪声的探针的探针水平数据的方差-均值关系。我们解释了为什么需要使用探针水平强度将阵列相互归一化。然后,我们使用掺入数据检查PM和MM的行为,并评估三种常用的汇总指标:Affymetrix的(i)平均差异(AvDiff)和(ii)MAS 5.0信号,以及(iii)基于李和王乘法模型的表达指数(MBEI)。对探针水平数据的探索性数据分析催生了一种新的汇总指标,即背景调整、归一化和对数转换后的PM值的稳健多阵列平均值(RMA)。我们使用稀释研究数据评估这四种表达汇总指标,从偏差、方差以及(对于MBEI和RMA)模型拟合方面评估它们的行为。最后,我们使用掺入数据评估这些算法检测已知差异表达水平的能力。我们得出结论,使用RMA并使用去除探针特异性亲和力的线性模型为该量附加标准误差(SE)没有明显的缺点。

相似文献

1
Exploration, normalization, and summaries of high density oligonucleotide array probe level data.高密度寡核苷酸阵列探针水平数据的探索、标准化及汇总
Biostatistics. 2003 Apr;4(2):249-64. doi: 10.1093/biostatistics/4.2.249.
2
Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model.高密度寡核苷酸阵列的统计分析:一种乘性噪声模型。
Bioinformatics. 2002 Dec;18(12):1633-40. doi: 10.1093/bioinformatics/18.12.1633.
3
A new summarization method for Affymetrix probe level data.一种针对Affymetrix探针水平数据的新汇总方法。
Bioinformatics. 2006 Apr 15;22(8):943-9. doi: 10.1093/bioinformatics/btl033. Epub 2006 Feb 10.
4
SUM: a new way to incorporate mismatch probe measurements.SUM:一种纳入错配探针测量的新方法。
Genomics. 2004 Oct;84(4):767-77. doi: 10.1016/j.ygeno.2004.06.013.
5
Summaries of Affymetrix GeneChip probe level data.Affymetrix基因芯片探针水平数据摘要。
Nucleic Acids Res. 2003 Feb 15;31(4):e15. doi: 10.1093/nar/gng015.
6
A Distribution-Free Convolution Model for background correction of oligonucleotide microarray data.一种用于寡核苷酸微阵列数据背景校正的无分布卷积模型。
BMC Genomics. 2009 Jul 7;10 Suppl 1(Suppl 1):S19. doi: 10.1186/1471-2164-10-S1-S19.
7
An expression index for Affymetrix GeneChips based on the generalized logarithm.基于广义对数的Affymetrix基因芯片表达指数。
Bioinformatics. 2005 Nov 1;21(21):3983-9. doi: 10.1093/bioinformatics/bti665. Epub 2005 Sep 13.
8
Utilization of two sample t-test statistics from redundant probe sets to evaluate different probe set algorithms in GeneChip studies.利用来自冗余探针集的双样本t检验统计量来评估基因芯片研究中的不同探针集算法。
BMC Bioinformatics. 2006 Jan 10;7:12. doi: 10.1186/1471-2105-7-12.
9
Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application.基于模型的寡核苷酸阵列分析:模型验证、设计问题及标准误差应用
Genome Biol. 2001;2(8):RESEARCH0032. doi: 10.1186/gb-2001-2-8-research0032. Epub 2001 Aug 3.
10
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.在表达谱分析中交互式优化信噪比:Affymetrix微阵列中特定项目的算法选择和检测p值加权
Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29.

引用本文的文献

1
Establishing a glycolysis-linked multigene prognostic signature in lung adenocarcinoma: a multicenter integrative approach.在肺腺癌中建立与糖酵解相关的多基因预后特征:一种多中心综合方法。
J Thorac Dis. 2025 Aug 31;17(8):6214-6228. doi: 10.21037/jtd-2025-1438. Epub 2025 Aug 28.
2
Identification and characterization of gene networks and key genes related to the high-yield production of milk in high-yield cows using meta-analysis of microarray data.利用微阵列数据的荟萃分析鉴定和表征高产奶牛高产奶相关的基因网络和关键基因。
Biochem Biophys Rep. 2025 Jun 14;43:102090. doi: 10.1016/j.bbrep.2025.102090. eCollection 2025 Sep.
3
Construction of prognostic risk prediction model of endometrial carcinoma based on bioinformatics analysis.
基于生物信息学分析构建子宫内膜癌预后风险预测模型
Medicine (Baltimore). 2025 Aug 29;104(35):e44193. doi: 10.1097/MD.0000000000044193.
4
Core Perturbomes of and Using a Machine Learning Approach.使用机器学习方法的[具体研究对象1]和[具体研究对象2]的核心扰动组。 (你原文中“of and ”表述不完整,这里是根据常见情况补充后翻译的,你可根据实际调整。)
Pathogens. 2025 Aug 7;14(8):788. doi: 10.3390/pathogens14080788.
5
Transformative insights from transcriptome analysis of colorectal cancer patient tissues: identification of four key prognostic genes.结直肠癌患者组织转录组分析的变革性见解:四个关键预后基因的鉴定
PeerJ. 2025 Aug 20;13:e19852. doi: 10.7717/peerj.19852. eCollection 2025.
6
Characterization of the tumor immune microenvironment in pregnancy-associated breast cancer through multiplex immunohistochemistry and transcriptome analyses.通过多重免疫组织化学和转录组分析对妊娠相关乳腺癌的肿瘤免疫微环境进行表征。
Breast Cancer Res. 2025 Aug 26;27(1):154. doi: 10.1186/s13058-025-02097-4.
7
miR-361-5p contributes to the pathogenesis of Alzheimer's disease.微小RNA-361-5p促成阿尔茨海默病的发病机制。
Sci Rep. 2025 Aug 23;15(1):31046. doi: 10.1038/s41598-025-17112-z.
8
Integrative machine learning and RT-qPCR analysis identify key stress-responsive genes in Thermus thermophilus HB8.整合机器学习和逆转录定量聚合酶链反应分析鉴定嗜热栖热菌HB8中的关键应激反应基因。
Genetica. 2025 Aug 20;153(1):28. doi: 10.1007/s10709-025-00243-6.
9
NADPH oxidase 1/4 dual inhibition impairs transforming growth factor-beta protumorigenic effects in cholangiocarcinoma cancer-associated fibroblasts.NADPH氧化酶1/4双重抑制削弱胆管癌癌相关成纤维细胞中转化生长因子-β的促肿瘤作用。
Signal Transduct Target Ther. 2025 Aug 18;10(1):257. doi: 10.1038/s41392-025-02347-z.
10
Downregulation of and its prognostic significance in squamous cell carcinoma of the cervix: a bioinformatics analysis based on public datasets.在子宫颈鳞状细胞癌中[具体基因或因子名称]的下调及其预后意义:基于公共数据集的生物信息学分析
Transl Cancer Res. 2025 Jul 30;14(7):4034-4045. doi: 10.21037/tcr-2025-307. Epub 2025 Jul 25.