• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于寡核苷酸表达值不呈正态分布的说明。

A note on oligonucleotide expression values not being normally distributed.

作者信息

Hardin Johanna, Wilson Jason

机构信息

Department of Mathematics, Pomona College, 610 North College Avenue, Claremont, CA 91711, USA.

出版信息

Biostatistics. 2009 Jul;10(3):446-50. doi: 10.1093/biostatistics/kxp003. Epub 2009 Mar 10.

DOI:10.1093/biostatistics/kxp003
PMID:19276243
Abstract

Novel techniques for analyzing microarray data are constantly being developed. Though many of the methods contribute to biological discoveries, inability to properly evaluate the novel techniques limits their ability to advance science. Because the underlying distribution of microarray data is unknown, novel methods are typically tested against the assumed normal distribution. However, microarray data are not, in fact, normally distributed, and assuming so can have misleading consequences. Using an Affymetrix technical replicate spike-in data set, we show that oligonucleotide expression values are not normally distributed for any of the standard methods for calculating expression values. The resulting data tend to have a large proportion of skew and heavy tailed genes. Additionally, we show that standard methods can give unexpected and misleading results when the data are not well approximated by the normal distribution. Robust methods are therefore recommended when analyzing microarray data. Additionally, new techniques should be evaluated with skewed and/or heavy-tailed data distributions.

摘要

用于分析微阵列数据的新技术不断涌现。尽管许多方法有助于生物学发现,但无法正确评估这些新技术限制了它们推动科学发展的能力。由于微阵列数据的潜在分布未知,新方法通常根据假定的正态分布进行测试。然而,事实上微阵列数据并非正态分布,如此假设可能会产生误导性后果。使用Affymetrix技术重复掺入数据集,我们表明,对于任何计算表达值的标准方法,寡核苷酸表达值都不是正态分布的。所得数据往往有很大比例的偏态和重尾基因。此外,我们表明,当数据不能很好地用正态分布近似时,标准方法可能会给出意想不到的和误导性的结果。因此,在分析微阵列数据时推荐使用稳健方法。此外,新技术应该用偏态和/或重尾数据分布进行评估。

相似文献

1
A note on oligonucleotide expression values not being normally distributed.关于寡核苷酸表达值不呈正态分布的说明。
Biostatistics. 2009 Jul;10(3):446-50. doi: 10.1093/biostatistics/kxp003. Epub 2009 Mar 10.
2
A moment-based method for estimating the proportion of true null hypotheses and its application to microarray gene expression data.一种基于时刻估计真零假设比例的方法及其在微阵列基因表达数据中的应用。
Biostatistics. 2007 Oct;8(4):744-55. doi: 10.1093/biostatistics/kxm002. Epub 2007 Jan 22.
3
Robust segmentation and analysis of DNA microarray spots using an adaptative split and merge algorithm.
Comput Methods Programs Biomed. 2006 Feb;81(2):174-80. doi: 10.1016/j.cmpb.2005.12.001. Epub 2006 Jan 19.
4
Probability fold change: a robust computational approach for identifying differentially expressed gene lists.概率倍数变化:一种用于识别差异表达基因列表的稳健计算方法。
Comput Methods Programs Biomed. 2009 Feb;93(2):124-39. doi: 10.1016/j.cmpb.2008.07.013. Epub 2008 Oct 7.
5
A comparison of normalization techniques for microRNA microarray data.微小RNA微阵列数据标准化技术的比较
Stat Appl Genet Mol Biol. 2008;7(1):Article22. doi: 10.2202/1544-6115.1287. Epub 2008 Jul 21.
6
Mistakes in validating the accuracy of a prediction classifier in high-dimensional but small-sample microarray data.在高维但小样本的微阵列数据中验证预测分类器准确性时的错误。
Stat Methods Med Res. 2008 Dec;17(6):635-42. doi: 10.1177/0962280207084839. Epub 2008 Mar 28.
7
Random data set generation to support microarray analysis.
Methods Enzymol. 2006;411:312-25. doi: 10.1016/S0076-6879(06)11016-2.
8
Evaluation of nine strategies for analyzing a cDNA toxicology microarray data set.
J Biopharm Stat. 2005;15(3):403-18. doi: 10.1081/BIP-200056518.
9
A simple and robust algorithm for microarray data clustering based on gene population-variance ratio metric.一种基于基因群体方差比度量的简单且稳健的微阵列数据聚类算法。
Biotechnol J. 2009 Sep;4(9):1357-61. doi: 10.1002/biot.200800219.
10
On the probability of correct selection for large k populations, with application to microarray data.关于从大量k个总体中正确选择的概率及其在微阵列数据中的应用。
Biom J. 2008 Oct;50(5):870-83. doi: 10.1002/bimj.200710457.

引用本文的文献

1
Generating synthetic multidimensional molecular time series data for machine learning: considerations.为机器学习生成合成多维分子时间序列数据:注意事项。
Front Syst Biol. 2023 Jul 25;3:1188009. doi: 10.3389/fsysb.2023.1188009. eCollection 2023.
2
Hepcidin expression in the trigeminal ganglion and the oral mucosa in an oral ulcerative mucositis rat model.三叉神经节和口腔黏膜炎大鼠模型中hepcidin 的表达。
PLoS One. 2023 Apr 20;18(4):e0284617. doi: 10.1371/journal.pone.0284617. eCollection 2023.
3
Low protein expression enhances phenotypic evolvability by intensifying selection on folding stability.
低蛋白表达通过强化折叠稳定性选择来增强表型可进化性。
Nat Ecol Evol. 2022 Aug;6(8):1155-1164. doi: 10.1038/s41559-022-01797-w. Epub 2022 Jul 7.
4
Altered miRNA expression network in locus coeruleus of depressed suicide subjects.抑郁自杀者蓝斑核中改变的 miRNA 表达网络。
Sci Rep. 2017 Jun 29;7(1):4387. doi: 10.1038/s41598-017-04300-9.
5
Systems Biomedicine of Rabies Delineates the Affected Signaling Pathways.狂犬病的系统生物医学描绘了受影响的信号通路。
Front Microbiol. 2016 Nov 7;7:1688. doi: 10.3389/fmicb.2016.01688. eCollection 2016.
6
PERT: a method for expression deconvolution of human blood samples from varied microenvironmental and developmental conditions.PERT:一种用于从不同微环境和发育条件的人类血液样本中进行表达解卷积的方法。
PLoS Comput Biol. 2012;8(12):e1002838. doi: 10.1371/journal.pcbi.1002838. Epub 2012 Dec 20.
7
Non-gaussian distributions affect identification of expression patterns, functional annotation, and prospective classification in human cancer genomes.非高斯分布会影响人类癌症基因组中表达模式、功能注释和前瞻性分类的识别。
PLoS One. 2012;7(10):e46935. doi: 10.1371/journal.pone.0046935. Epub 2012 Oct 31.
8
β-empirical Bayes inference and model diagnosis of microarray data.基于经验贝叶斯的微阵列数据分析推断和模型诊断。
BMC Bioinformatics. 2012 Jun 19;13:135. doi: 10.1186/1471-2105-13-135.
9
Modeling skewness in human transcriptomes.人类转录组偏度建模。
PLoS One. 2012;7(6):e38919. doi: 10.1371/journal.pone.0038919. Epub 2012 Jun 11.
10
MicroRNA expression is down-regulated and reorganized in prefrontal cortex of depressed suicide subjects.抑郁自杀患者前额皮质中的 microRNA 表达下调和重排。
PLoS One. 2012;7(3):e33201. doi: 10.1371/journal.pone.0033201. Epub 2012 Mar 9.