• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在cDNA微阵列数据中普遍存在的三参数对数正态分布及其在参数数据处理中的应用。

Three-parameter lognormal distribution ubiquitously found in cDNA microarray data and its application to parametric data treatment.

作者信息

Konishi Tomokazu

机构信息

Faculty of Bioresource Sciences, Akita Prefectural University, Akita 010-0195, Japan.

出版信息

BMC Bioinformatics. 2004 Jan 13;5:5. doi: 10.1186/1471-2105-5-5.

DOI:10.1186/1471-2105-5-5
PMID:14718068
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC333424/
Abstract

BACKGROUND

To cancel experimental variations, microarray data must be normalized prior to analysis. Where an appropriate model for statistical data distribution is available, a parametric method can normalize a group of data sets that have common distributions. Although such models have been proposed for microarray data, they have not always fit the distribution of real data and thus have been inappropriate for normalization. Consequently, microarray data in most cases have been normalized with non-parametric methods that adjust data in a pair-wise manner. However, data analysis and the integration of resultant knowledge among experiments have been difficult, since such normalization concepts lack a universal standard.

RESULTS

A three-parameter lognormal distribution model was tested on over 300 sets of microarray data. The model treats the hybridization background, which is difficult to identify from images of hybridization, as one of the parameters. A rigorous coincidence of the model to data sets was found, proving the model's appropriateness for microarray data. In fact, a closer fitting to Northern analysis was obtained. The model showed inconsistency only at very strong or weak data intensities. Measurement of z-scores as well as calculated ratios was reproducible only among data in the model-consistent intensity range; also, the ratios were independent of signal intensity at the corresponding range.

CONCLUSION

The model could provide a universal standard for data, simplifying data analysis and knowledge integration. It was deduced that the ranges of inconsistency were caused by experimental errors or additive noise in the data; therefore, excluding the data corresponding to those marginal ranges will prevent misleading analytical conclusions.

摘要

背景

为消除实验差异,微阵列数据在分析前必须进行标准化处理。当存在适用于统计数据分布的模型时,参数化方法可对具有共同分布的一组数据集进行标准化。尽管已针对微阵列数据提出了此类模型,但它们并不总是能拟合实际数据的分布,因此不适用于标准化处理。因此,在大多数情况下,微阵列数据已采用以成对方式调整数据的非参数方法进行标准化。然而,由于此类标准化概念缺乏通用标准,数据分析以及实验间所得知识的整合一直很困难。

结果

在300多组微阵列数据上测试了三参数对数正态分布模型。该模型将难以从杂交图像中识别的杂交背景视为参数之一。发现该模型与数据集高度吻合,证明了该模型适用于微阵列数据。实际上,与Northern分析的拟合度更高。该模型仅在数据强度非常强或非常弱时表现出不一致。z分数的测量以及计算出的比率仅在模型一致的强度范围内的数据之间是可重复的;此外,在相应范围内,比率与信号强度无关。

结论

该模型可为数据提供通用标准,简化数据分析和知识整合。据推断,不一致的范围是由实验误差或数据中的加性噪声引起的;因此,排除与那些边缘范围相对应的数据将防止得出误导性的分析结论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/10dfdcc37c64/1471-2105-5-5-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/4a5a99c9c1e7/1471-2105-5-5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/e715957884ca/1471-2105-5-5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/073d249bf15e/1471-2105-5-5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/1fb52ce01816/1471-2105-5-5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/538546d2c0c4/1471-2105-5-5-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/cf42efcab84c/1471-2105-5-5-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/0b68b8f7121a/1471-2105-5-5-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/10dfdcc37c64/1471-2105-5-5-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/4a5a99c9c1e7/1471-2105-5-5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/e715957884ca/1471-2105-5-5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/073d249bf15e/1471-2105-5-5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/1fb52ce01816/1471-2105-5-5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/538546d2c0c4/1471-2105-5-5-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/cf42efcab84c/1471-2105-5-5-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/0b68b8f7121a/1471-2105-5-5-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc7b/333424/10dfdcc37c64/1471-2105-5-5-8.jpg

相似文献

1
Three-parameter lognormal distribution ubiquitously found in cDNA microarray data and its application to parametric data treatment.在cDNA微阵列数据中普遍存在的三参数对数正态分布及其在参数数据处理中的应用。
BMC Bioinformatics. 2004 Jan 13;5:5. doi: 10.1186/1471-2105-5-5.
2
Segmentation and intensity estimation of microarray images using a gamma-t mixture model.使用伽马-t混合模型对微阵列图像进行分割和强度估计。
Bioinformatics. 2007 Feb 15;23(4):458-65. doi: 10.1093/bioinformatics/btl630. Epub 2006 Dec 12.
3
Effect of local background intensities in the normalization of cDNA microarray data with a skewed expression profiles.局部背景强度在具有偏态表达谱的cDNA微阵列数据标准化中的作用。
Exp Mol Med. 2002 Jul 31;34(3):224-32. doi: 10.1038/emm.2002.31.
4
Systematic approaches for incorporating control spots and data quality information to improve normalization of cDNA microarray data.纳入对照点和数据质量信息以改善cDNA微阵列数据标准化的系统方法。
J Biopharm Stat. 2007;17(3):415-31. doi: 10.1080/10543400701199544.
5
Two-stage normalization using background intensities in cDNA microarray data.利用cDNA微阵列数据中的背景强度进行两阶段标准化。
BMC Bioinformatics. 2004 Jul 21;5:97. doi: 10.1186/1471-2105-5-97.
6
Statistical monitoring of weak spots for improvement of normalization and ratio estimates in microarrays.用于改进微阵列归一化和比率估计的弱点的统计监测。
BMC Bioinformatics. 2004 May 5;5:53. doi: 10.1186/1471-2105-5-53.
7
Unfolding of microarray data.微阵列数据的展开
J Comput Biol. 2001;8(4):443-61. doi: 10.1089/106652701752236232.
8
Analysis of microarray data using Z score transformation.使用Z分数转换分析微阵列数据。
J Mol Diagn. 2003 May;5(2):73-81. doi: 10.1016/S1525-1578(10)60455-2.
9
A generalized additive model for microarray gene expression data analysis.一种用于微阵列基因表达数据分析的广义相加模型。
J Biopharm Stat. 2004 Aug;14(3):553-73. doi: 10.1081/BIP-200025648.
10
Evaluation of experimental designs for two-color cDNA microarrays.双色 cDNA 微阵列实验设计的评估
J Comput Biol. 2005 Nov;12(9):1202-20. doi: 10.1089/cmb.2005.12.1202.

引用本文的文献

1
Theanine, a Tea-Leaf-Specific Amino Acid, Alleviates Stress through Modulation of Npas4 Expression in Group-Housed Older Mice.茶氨酸,一种茶叶特有的氨基酸,通过调节群居老年小鼠 Npas4 的表达来缓解压力。
Int J Mol Sci. 2023 Feb 16;24(4):3983. doi: 10.3390/ijms24043983.
2
L-Arginine Exerts Excellent Anti-Stress Effects on Stress-Induced Shortened Lifespan, Cognitive Decline and Depression.精氨酸对压力引起的寿命缩短、认知能力下降和抑郁有极好的抗应激作用。
Int J Mol Sci. 2021 Jan 6;22(2):508. doi: 10.3390/ijms22020508.
3
Mutation in Sodium-Glucose Cotransporter 2 Results in Down-Regulation of Amyloid Beta (A4) Precursor-Like Protein 1 in Young Age, Which May Lead to Poor Memory Retention in Old Age.

本文引用的文献

1
Ratio-based decisions and the quantitative analysis of cDNA microarray images.基于比率的决策与cDNA微阵列图像的定量分析
J Biomed Opt. 1997 Oct;2(4):364-74. doi: 10.1117/12.281504.
2
Arabidopsis microarrays identify conserved and differentially expressed genes involved in shoot growth and development from distantly related plant species.拟南芥微阵列鉴定出参与远缘植物物种茎生长和发育的保守且差异表达的基因。
Plant J. 2003 Apr;34(1):125-34. doi: 10.1046/j.1365-313x.2003.01706.x.
3
The Stanford Microarray Database: data access and quality assessment tools.
钠-葡萄糖共转运蛋白 2 基因突变导致年轻时期淀粉样前体样蛋白 1 下调,这可能导致老年时记忆保留能力差。
Int J Mol Sci. 2020 Aug 4;21(15):5579. doi: 10.3390/ijms21155579.
4
Green Tea Catechins Trigger Immediate-Early Genes in the Hippocampus and Prevent Cognitive Decline and Lifespan Shortening.绿茶儿茶素可引发海马体即刻早期基因表达,并预防认知能力下降和寿命缩短。
Molecules. 2020 Mar 25;25(7):1484. doi: 10.3390/molecules25071484.
5
Theanine, the Main Amino Acid in Tea, Prevents Stress-Induced Brain Atrophy by Modifying Early Stress Responses.茶氨酸,茶叶中的主要氨基酸,通过调节早期应激反应预防应激诱导的大脑萎缩。
Nutrients. 2020 Jan 8;12(1):174. doi: 10.3390/nu12010174.
6
Finite-size effects in transcript sequencing count distribution: its power-law correction necessarily precedes downstream normalization and comparative analysis.转录测序计数分布中的有限大小效应:幂律修正必然先于下游的标准化和比较分析。
Biol Direct. 2018 Feb 12;13(1):2. doi: 10.1186/s13062-018-0204-y.
7
Allantoin, a stress-related purine metabolite, can activate jasmonate signaling in a MYC2-regulated and abscisic acid-dependent manner.尿囊素是一种与胁迫相关的嘌呤代谢产物,它能够以一种受MYC2调控且依赖脱落酸的方式激活茉莉酸信号传导。
J Exp Bot. 2016 Apr;67(8):2519-2532. doi: 10.1093/jxb/erw071. Epub 2016 Mar 1.
8
MeSH ORA framework: R/Bioconductor packages to support MeSH over-representation analysis.医学主题词表ORA框架:用于支持医学主题词表过度表达分析的R/Bioconductor软件包。
BMC Bioinformatics. 2015 Feb 15;16:45. doi: 10.1186/s12859-015-0453-z.
9
Pyramidal neurons in the superficial layers of rat retrosplenial cortex exhibit a late-spiking firing property.大鼠后穹窿皮质浅层的锥体神经元表现出迟发放电的特性。
Brain Struct Funct. 2013 Jan;218(1):239-54. doi: 10.1007/s00429-012-0398-1. Epub 2012 Mar 1.
10
High-throughput processing and normalization of one-color microarrays for transcriptional meta-analyses.高通量处理和归一化单色微阵列用于转录组元分析。
BMC Bioinformatics. 2011 Oct 18;12 Suppl 10(Suppl 10):S2. doi: 10.1186/1471-2105-12-S10-S2.
斯坦福微阵列数据库:数据访问与质量评估工具。
Nucleic Acids Res. 2003 Jan 1;31(1):94-6. doi: 10.1093/nar/gkg078.
4
Microarray data normalization and transformation.微阵列数据归一化与转换
Nat Genet. 2002 Dec;32 Suppl:496-501. doi: 10.1038/ng1032.
5
Fundamentals of experimental design for cDNA microarrays.cDNA微阵列实验设计基础
Nat Genet. 2002 Dec;32 Suppl:490-5. doi: 10.1038/ng1031.
6
A new non-linear normalization method for reducing variability in DNA microarray experiments.一种用于减少DNA微阵列实验变异性的新型非线性归一化方法。
Genome Biol. 2002 Aug 30;3(9):research0048. doi: 10.1186/gb-2002-3-9-research0048.
7
Normalization and analysis of DNA microarray data by self-consistency and local regression.通过自一致性和局部回归对DNA微阵列数据进行归一化和分析。
Genome Biol. 2002 Jun 28;3(7):RESEARCH0037. doi: 10.1186/gb-2002-3-7-research0037.
8
A variance-stabilizing transformation for gene-expression microarray data.一种用于基因表达微阵列数据的方差稳定变换。
Bioinformatics. 2002;18 Suppl 1:S105-10. doi: 10.1093/bioinformatics/18.suppl_1.s105.
9
Variance stabilization applied to microarray data calibration and to the quantification of differential expression.方差稳定化应用于微阵列数据校准和差异表达定量分析。
Bioinformatics. 2002;18 Suppl 1:S96-104. doi: 10.1093/bioinformatics/18.suppl_1.s96.
10
Deriving quantitative conclusions from microarray expression data.
Bioinformatics. 2002 Jul;18(7):961-70. doi: 10.1093/bioinformatics/18.7.961.