一种用于Affymetrix基因芯片阵列的无分布汇总方法。

A distribution free summarization method for Affymetrix GeneChip arrays.

作者信息

Chen Zhongxue, McGee Monnie, Liu Qingzhong, Scheuermann Richard H

机构信息

Department of Statistical Science, Southern Methodist University, Dallas, TX 75275, USA.

出版信息

Bioinformatics. 2007 Feb 1;23(3):321-7. doi: 10.1093/bioinformatics/btl609. Epub 2006 Dec 5.

DOI:10.1093/bioinformatics/btl609

PMID:17148508

Abstract

MOTIVATION

Affymetrix GeneChip arrays require summarization in order to combine the probe-level intensities into one value representing the expression level of a gene. However, probe intensity measurements are expected to be affected by different levels of non-specific- and cross-hybridization to non-specific transcripts. Here, we present a new summarization technique, the Distribution Free Weighted method (DFW), which uses information about the variability in probe behavior to estimate the extent of non-specific and cross-hybridization for each probe. The contribution of the probe is weighted accordingly during summarization, without making any distributional assumptions for the probe-level data.

RESULTS

We compare DFW with several popular summarization methods on spike-in datasets, via both our own calculations and the 'Affycomp II' competition. The results show that DFW outperforms other methods when sensitivity and specificity are considered simultaneously. With the Affycomp spike-in datasets, the area under the receiver operating characteristic curve for DFW is nearly 1.0 (a perfect value), indicating that DFW can identify all differentially expressed genes with a few false positives. The approach used is also computationally faster than most other methods in current use.

AVAILABILITY

The R code for DFW is available upon request.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

Affymetrix基因芯片阵列需要进行数据汇总，以便将探针水平的强度合并为一个代表基因表达水平的值。然而，探针强度测量预计会受到与非特异性转录本不同程度的非特异性杂交和交叉杂交的影响。在此，我们提出一种新的数据汇总技术，即无分布加权法（DFW），该方法利用探针行为变异性的信息来估计每个探针的非特异性杂交和交叉杂交程度。在汇总过程中，会相应地对探针的贡献进行加权，而无需对探针水平的数据做出任何分布假设。

结果

我们通过自己的计算以及“Affycomp II”竞赛，在掺入数据集上比较了DFW与几种流行的数据汇总方法。结果表明，在同时考虑敏感性和特异性时，DFW优于其他方法。对于Affycomp掺入数据集，DFW的受试者工作特征曲线下面积接近1.0（完美值），表明DFW可以识别所有差异表达基因，且假阳性较少。所使用的方法在计算上也比当前使用的大多数其他方法更快。

可用性

可根据要求提供DFW的R代码。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

A distribution free summarization method for Affymetrix GeneChip arrays.一种用于Affymetrix基因芯片阵列的无分布汇总方法。

Bioinformatics. 2007 Feb 1;23(3):321-7. doi: 10.1093/bioinformatics/btl609. Epub 2006 Dec 5.

A new summarization method for Affymetrix probe level data.一种针对Affymetrix探针水平数据的新汇总方法。

Bioinformatics. 2006 Apr 15;22(8):943-9. doi: 10.1093/bioinformatics/btl033. Epub 2006 Feb 10.

Joint estimation of calibration and expression for high-density oligonucleotide arrays.高密度寡核苷酸阵列校准与表达的联合估计

Bioinformatics. 2006 Oct 1;22(19):2381-7. doi: 10.1093/bioinformatics/btl399. Epub 2006 Jul 28.

Exploiting sample variability to enhance multivariate analysis of microarray data.利用样本变异性增强微阵列数据的多变量分析。

Bioinformatics. 2007 Oct 15;23(20):2733-40. doi: 10.1093/bioinformatics/btm441. Epub 2007 Sep 7.

Characterization of mismatch and high-signal intensity probes associated with Affymetrix genechips.与Affymetrix基因芯片相关的错配和高信号强度探针的表征

Bioinformatics. 2007 Aug 15;23(16):2088-95. doi: 10.1093/bioinformatics/btm306. Epub 2007 Jun 6.

Probe rank approaches for gene selection in oligonucleotide arrays with a small number of replicates.探索少量重复样本的寡核苷酸阵列中基因选择的排序方法。

Bioinformatics. 2005 Jun 15;21(12):2861-6. doi: 10.1093/bioinformatics/bti413. Epub 2005 Apr 6.

Comparison of Affymetrix GeneChip expression measures.Affymetrix基因芯片表达量测量结果的比较

Bioinformatics. 2006 Apr 1;22(7):789-94. doi: 10.1093/bioinformatics/btk046. Epub 2006 Jan 12.

Segmentation and intensity estimation of microarray images using a gamma-t mixture model.使用伽马-t混合模型对微阵列图像进行分割和强度估计。

Bioinformatics. 2007 Feb 15;23(4):458-65. doi: 10.1093/bioinformatics/btl630. Epub 2006 Dec 12.

Multidimensional local false discovery rate for microarray studies.微阵列研究的多维局部错误发现率

Bioinformatics. 2006 Mar 1;22(5):556-65. doi: 10.1093/bioinformatics/btk013. Epub 2005 Dec 20.

An enhanced quantile approach for assessing differential gene expressions.一种用于评估差异基因表达的增强分位数方法。

Biometrics. 2008 Jun;64(2):449-57. doi: 10.1111/j.1541-0420.2007.00903.x. Epub 2008 Mar 5.

引用本文的文献

Simulating neuronal development: exploring potential mechanisms for central nervous system metastasis in acute lymphoblastic leukemia.模拟神经元发育：探索急性淋巴细胞白血病中枢神经系统转移的潜在机制

Front Oncol. 2024 Jan 4;13:1331802. doi: 10.3389/fonc.2023.1331802. eCollection 2023.

Gene Expression over Time during Cell Transformation Due to Non-Genotoxic Carcinogen Treatment of Bhas 42 Cells.Bhas 42 细胞经非遗传毒性致癌物处理后细胞转化过程中的基因表达随时间的变化。

Int J Mol Sci. 2022 Mar 16;23(6):3216. doi: 10.3390/ijms23063216.

Linking Diabetes Mellitus with Alzheimer's Disease: Bioinformatics Analysis for the Potential Pathways and Characteristic Genes.将糖尿病与阿尔茨海默病联系起来：潜在途径和特征基因的生物信息学分析

Biochem Genet. 2022 Jun;60(3):1049-1075. doi: 10.1007/s10528-021-10154-8. Epub 2021 Nov 15.

Network Pharmacology Reveals That Resveratrol Can Alleviate COVID-19-Related Hyperinflammation.网络药理学揭示白藜芦醇可缓解 COVID-19 相关的过度炎症。

Dis Markers. 2021 Sep 22;2021:4129993. doi: 10.1155/2021/4129993. eCollection 2021.

Bioinformatics analysis identified shared differentially expressed genes as potential biomarkers for Hashimoto's thyroiditis-related papillary thyroid cancer.生物信息学分析鉴定出共同差异表达基因作为桥本甲状腺炎相关甲状腺乳头状癌的潜在生物标志物。

Int J Med Sci. 2021 Aug 13;18(15):3478-3487. doi: 10.7150/ijms.63402. eCollection 2021.

CXCL12/CXCR4 axis as a key mediator in atrial fibrillation via bioinformatics analysis and functional identification.通过生物信息学分析和功能鉴定发现 CXCL12/CXCR4 轴作为心房颤动的关键介质。

Cell Death Dis. 2021 Aug 27;12(9):813. doi: 10.1038/s41419-021-04109-5.

The transcriptome of circulating cells indicates potential biomarkers and therapeutic targets in the course of hypertension-related myocardial infarction.循环细胞的转录组揭示了高血压相关性心肌梗死病程中的潜在生物标志物和治疗靶点。

Genes Dis. 2020 Jan 21;8(4):555-568. doi: 10.1016/j.gendis.2020.01.007. eCollection 2021 Jul.

In Silico Prediction of Molecular Targets of Astragaloside IV for Alleviation of COVID-19 Hyperinflammation by Systems Network Pharmacology and Bioinformatic Gene Expression Analysis.基于系统网络药理学和生物信息基因表达分析的黄芪甲苷减轻 COVID-19 过度炎症分子靶点的计算机模拟预测

Front Pharmacol. 2020 Sep 16;11:556984. doi: 10.3389/fphar.2020.556984. eCollection 2020.

A maple syrup extract alters lipid metabolism in obese type 2 diabetic model mice.一种枫糖浆提取物可改变肥胖2型糖尿病模型小鼠的脂质代谢。

Nutr Metab (Lond). 2019 Dec 4;16:84. doi: 10.1186/s12986-019-0403-2. eCollection 2019.

Ribosome Reconstruction during Recovery from High-Hydrostatic-Pressure-Induced Injury in Bacillus subtilis.枯草芽孢杆菌在从高静水压诱导损伤中恢复过程中的核糖体重建

Appl Environ Microbiol. 2019 Dec 13;86(1). doi: 10.1128/AEM.01640-19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于Affymetrix基因芯片阵列的无分布汇总方法。

A distribution free summarization method for Affymetrix GeneChip arrays.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

SUPPLEMENTARY INFORMATION

动机

结果

可用性

补充信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献