• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

如何做出决定?从短寡核苷酸阵列数据计算基因表达的不同方法会得出不同的结果。

How to decide? Different methods of calculating gene expression from short oligonucleotide array data will give different results.

作者信息

Millenaar Frank F, Okyere John, May Sean T, van Zanten Martijn, Voesenek Laurentius A C J, Peeters Anton J M

机构信息

Plant Ecophysiology, Institute of Environmental Biology, Faculty of Science, Utrecht University, Sorbonnelaan 16, 3584 CA Utrecht, The Netherlands.

出版信息

BMC Bioinformatics. 2006 Mar 15;7:137. doi: 10.1186/1471-2105-7-137.

DOI:10.1186/1471-2105-7-137
PMID:16539732
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1431565/
Abstract

BACKGROUND

Short oligonucleotide arrays for transcript profiling have been available for several years. Generally, raw data from these arrays are analysed with the aid of the Microarray Analysis Suite or GeneChip Operating Software (MAS or GCOS) from Affymetrix. Recently, more methods to analyse the raw data have become available. Ideally all these methods should come up with more or less the same results. We set out to evaluate the different methods and include work on our own data set, in order to test which method gives the most reliable results.

RESULTS

Calculating gene expression with 6 different algorithms (MAS5, dChip PMMM, dChip PM, RMA, GC-RMA and PDNN) using the same (Arabidopsis) data, results in different calculated gene expression levels. Consequently, depending on the method used, different genes will be identified as differentially regulated. Surprisingly, there was only 27 to 36% overlap between the different methods. Furthermore, 47.5% of the genes/probe sets showed good correlation between the mismatch and perfect match intensities.

CONCLUSION

After comparing six algorithms, RMA gave the most reproducible results and showed the highest correlation coefficients with Real Time RT-PCR data on genes identified as differentially expressed by all methods. However, we were not able to verify, by Real Time RT-PCR, the microarray results for most genes that were solely calculated by RMA. Furthermore, we conclude that subtraction of the mismatch intensity from the perfect match intensity results most likely in a significant underestimation for at least 47.5% of the expression values. Not one algorithm produced significant expression values for genes present in quantities below 1 pmol. If the only purpose of the microarray experiment is to find new candidate genes, and too many genes are found, then mutual exclusion of the genes predicted by contrasting methods can be used to narrow down the list of new candidate genes by 64 to 73%.

摘要

背景

用于转录谱分析的短寡核苷酸阵列已经问世数年。通常,这些阵列的原始数据借助Affymetrix的微阵列分析套件或基因芯片操作软件(MAS或GCOS)进行分析。最近,有了更多分析原始数据的方法。理想情况下,所有这些方法应该得出大致相同的结果。我们着手评估不同的方法,并纳入对我们自己数据集的研究,以测试哪种方法能给出最可靠的结果。

结果

使用相同的(拟南芥)数据,用6种不同算法(MAS5、dChip PMMM、dChip PM、RMA、GC - RMA和PDNN)计算基因表达,得出不同的计算基因表达水平。因此,根据所使用的方法,会鉴定出不同的差异调节基因。令人惊讶的是,不同方法之间只有27%至36%的重叠。此外,47.5%的基因/探针集在错配和完全匹配强度之间显示出良好的相关性。

结论

在比较六种算法后,RMA给出了最可重复的结果,并且与所有方法鉴定为差异表达的基因的实时RT - PCR数据显示出最高的相关系数。然而,我们无法通过实时RT - PCR验证大多数仅由RMA计算的基因的微阵列结果。此外,我们得出结论,从完全匹配强度中减去错配强度很可能导致至少47.5%的表达值被显著低估。没有一种算法能为含量低于1皮摩尔的基因产生显著的表达值。如果微阵列实验的唯一目的是寻找新的候选基因,并且发现了太多基因,那么通过对比方法预测的基因相互排除可用于将新候选基因列表缩小64%至73%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/96c135313fb0/1471-2105-7-137-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/1a8724c1491e/1471-2105-7-137-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/c3bb11df6b59/1471-2105-7-137-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/8a8ad1027a39/1471-2105-7-137-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/6a75289185eb/1471-2105-7-137-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/b46fde947c57/1471-2105-7-137-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/588834a79fcd/1471-2105-7-137-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/96c135313fb0/1471-2105-7-137-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/1a8724c1491e/1471-2105-7-137-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/c3bb11df6b59/1471-2105-7-137-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/8a8ad1027a39/1471-2105-7-137-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/6a75289185eb/1471-2105-7-137-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/b46fde947c57/1471-2105-7-137-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/588834a79fcd/1471-2105-7-137-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd73/1431565/96c135313fb0/1471-2105-7-137-7.jpg

相似文献

1
How to decide? Different methods of calculating gene expression from short oligonucleotide array data will give different results.如何做出决定?从短寡核苷酸阵列数据计算基因表达的不同方法会得出不同的结果。
BMC Bioinformatics. 2006 Mar 15;7:137. doi: 10.1186/1471-2105-7-137.
2
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.在表达谱分析中交互式优化信噪比:Affymetrix微阵列中特定项目的算法选择和检测p值加权
Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29.
3
Utilization of two sample t-test statistics from redundant probe sets to evaluate different probe set algorithms in GeneChip studies.利用来自冗余探针集的双样本t检验统计量来评估基因芯片研究中的不同探针集算法。
BMC Bioinformatics. 2006 Jan 10;7:12. doi: 10.1186/1471-2105-7-12.
4
The utility of MAS5 expression summary and detection call algorithms.MAS5表达汇总及检测调用算法的效用。
BMC Bioinformatics. 2007 Jul 30;8:273. doi: 10.1186/1471-2105-8-273.
5
Normality of oligonucleotide microarray data and implications for parametric statistical analyses.寡核苷酸微阵列数据的正态性及其对参数统计分析的影响。
Bioinformatics. 2003 Nov 22;19(17):2254-62. doi: 10.1093/bioinformatics/btg311.
6
A comparison of statistical methods for analysis of high density oligonucleotide array data.用于分析高密度寡核苷酸阵列数据的统计方法比较。
Bioinformatics. 2003 Aug 12;19(12):1469-76. doi: 10.1093/bioinformatics/btg202.
7
Evaluation of methods for oligonucleotide array data via quantitative real-time PCR.通过定量实时PCR评估寡核苷酸阵列数据的方法
BMC Bioinformatics. 2006 Jan 17;7:23. doi: 10.1186/1471-2105-7-23.
8
A high performance test of differential gene expression for oligonucleotide arrays.寡核苷酸阵列差异基因表达的高性能测试。
Genome Biol. 2003;4(10):R67. doi: 10.1186/gb-2003-4-10-r67. Epub 2003 Sep 10.
9
Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model.高密度寡核苷酸阵列的统计分析:一种乘性噪声模型。
Bioinformatics. 2002 Dec;18(12):1633-40. doi: 10.1093/bioinformatics/18.12.1633.
10
Effects of filtering by Present call on analysis of microarray experiments.当前通话过滤对微阵列实验分析的影响。
BMC Bioinformatics. 2006 Jan 31;7:49. doi: 10.1186/1471-2105-7-49.

引用本文的文献

1
Prediction and expression analysis of deleterious nonsynonymous SNPs of Arabidopsis ACD11 gene by combining computational algorithms and molecular docking approach.通过结合计算算法和分子对接方法预测和分析拟南芥 ACD11 基因的有害非同义 SNP 及其表达。
PLoS Comput Biol. 2022 Jun 16;18(6):e1009539. doi: 10.1371/journal.pcbi.1009539. eCollection 2022 Jun.
2
Transcriptomic Response of Huanglongbing-Infected Following Field Application of a Microbial Fermentation Product.微生物发酵产品田间施用后黄龙病感染植株的转录组反应
Front Plant Sci. 2021 Nov 30;12:754391. doi: 10.3389/fpls.2021.754391. eCollection 2021.
3

本文引用的文献

1
Ethylene-induced differential growth of petioles in Arabidopsis. Analyzing natural variation, response kinetics, and regulation.乙烯诱导拟南芥叶柄的差异生长。分析自然变异、反应动力学和调控。
Plant Physiol. 2005 Mar;137(3):998-1008. doi: 10.1104/pp.104.053967. Epub 2005 Feb 22.
2
Benchmarking the CATMA microarray. A novel tool for Arabidopsis transcriptome analysis.对CATMA微阵列进行基准测试。一种用于拟南芥转录组分析的新工具。
Plant Physiol. 2005 Feb;137(2):588-601. doi: 10.1104/pp.104.051300.
3
Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset.
Relationship between insulin sensitivity and gene expression in human skeletal muscle.
胰岛素敏感性与人类骨骼肌基因表达的关系。
BMC Endocr Disord. 2021 Feb 27;21(1):32. doi: 10.1186/s12902-021-00687-9.
4
The sensitivity of transcriptomics BMD modeling to the methods used for microarray data normalization.转录组学 BMD 建模对微阵列数据标准化方法的敏感性。
PLoS One. 2020 May 15;15(5):e0232955. doi: 10.1371/journal.pone.0232955. eCollection 2020.
5
A big data pipeline: Identifying dynamic gene regulatory networks from time-course Gene Expression Omnibus data with applications to influenza infection.大数据管道:从时间序列基因表达综合数据库中识别动态基因调控网络及其在流感感染中的应用。
Stat Methods Med Res. 2018 Jul;27(7):1930-1955. doi: 10.1177/0962280217746719.
6
Framework for Parallel Preprocessing of Microarray Data Using Hadoop.使用Hadoop进行微阵列数据并行预处理的框架
Adv Bioinformatics. 2018 Mar 29;2018:9391635. doi: 10.1155/2018/9391635. eCollection 2018.
7
Pathway analysis of transcriptomic data shows immunometabolic effects of vitamin D.转录组数据分析显示维生素 D 具有免疫代谢作用。
J Mol Endocrinol. 2018 Feb;60(2):95-108. doi: 10.1530/JME-17-0186. Epub 2017 Dec 12.
8
A Four-Biomarker Blood Signature Discriminates Systemic Inflammation Due to Viral Infection Versus Other Etiologies.四项生物标志物血液特征可鉴别病毒感染引起的全身炎症与其他病因。
Sci Rep. 2017 Jun 6;7(1):2914. doi: 10.1038/s41598-017-02325-8.
9
Tracing the dynamics of gene transcripts after organismal death.追踪生物体死亡后基因转录本的动态变化。
Open Biol. 2017 Jan;7(1). doi: 10.1098/rsob.160267.
10
Comparison of gene expression microarray data with count-based RNA measurements informs microarray interpretation.将基因表达微阵列数据与基于计数的RNA测量结果进行比较有助于微阵列的解读。
BMC Genomics. 2014 Aug 4;15(1):649. doi: 10.1186/1471-2164-15-649.
由完全定义的对照数据集揭示的Affymetrix基因芯片的首选分析方法。
Genome Biol. 2005;6(2):R16. doi: 10.1186/gb-2005-6-2-r16. Epub 2005 Jan 28.
4
Preprocessing of oligonucleotide array data.寡核苷酸阵列数据的预处理。
Nat Biotechnol. 2004 Jun;22(6):656-8; author reply 658. doi: 10.1038/nbt0604-656b.
5
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.在表达谱分析中交互式优化信噪比:Affymetrix微阵列中特定项目的算法选择和检测p值加权
Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29.
6
Real-time RT-PCR profiling of over 1400 Arabidopsis transcription factors: unprecedented sensitivity reveals novel root- and shoot-specific genes.对1400多个拟南芥转录因子进行实时逆转录聚合酶链式反应分析:前所未有的灵敏度揭示了新的根和地上部特异性基因。
Plant J. 2004 Apr;38(2):366-79. doi: 10.1111/j.1365-313X.2004.02051.x.
7
A benchmark for Affymetrix GeneChip expression measures.Affymetrix基因芯片表达量测量的一个基准。
Bioinformatics. 2004 Feb 12;20(3):323-31. doi: 10.1093/bioinformatics/btg410.
8
Guidelines for incorporating non-perfectly matched oligonucleotides into target-specific hybridization probes for a DNA microarray.将非完全匹配的寡核苷酸掺入用于DNA微阵列的靶标特异性杂交探针的指南。
Nucleic Acids Res. 2004 Feb 2;32(2):681-90. doi: 10.1093/nar/gkh196. Print 2004.
9
Comparative analysis of algorithms for signal quantitation from oligonucleotide microarrays.来自寡核苷酸微阵列的信号定量算法的比较分析
Bioinformatics. 2004 Apr 12;20(6):839-46. doi: 10.1093/bioinformatics/btg487. Epub 2004 Jan 29.
10
Exploration, normalization, and summaries of high density oligonucleotide array probe level data.高密度寡核苷酸阵列探针水平数据的探索、标准化及汇总
Biostatistics. 2003 Apr;4(2):249-64. doi: 10.1093/biostatistics/4.2.249.