Centre for Plant Integrative Biology, University of Nottingham, Nottingham, LE12 5RD, UK.
School of Mathematical Sciences, University of Nottingham, Nottingham, NG7 2RD, UK.
Plant Methods. 2010 Mar 15;6:9. doi: 10.1186/1746-4811-6-9.
Microarrays are a powerful tool used for the determination of global RNA expression. There is an increasing requirement to focus on profiling gene expression in tissues where it is difficult to obtain large quantities of material, for example individual tissues within organs such as the root, or individual isolated cells. From such samples, it is difficult to produce the amount of RNA required for labelling and hybridisation in microarray experiments, thus a process of amplification is usually adopted. Despite the increasing use of two-cycle amplification for transcriptomic analyses on the Affymetrix ATH1 array, there has been no report investigating any potential bias in gene representation that may occur as a result.
Here we compare transcriptomic data generated using Affymetrix one-cycle (standard labelling protocol), two-cycle (small-sample protocol) and IVT-Express protocols with the Affymetrix ATH1 array using Arabidopsis root samples. Results obtained with each protocol are broadly similar. However, we show that there are 35 probe sets (of a total of 22810) that are misrepresented in the two-cycle data sets. Of these, 33 probe sets were classed as mis-amplified when comparisons of two independent publicly available data sets were undertaken.
Given the unreliable nature of the highlighted probes, we caution against using data associated with the corresponding genes in analyses involving transcriptomic data generated with two-cycle amplification protocols. We have shown that the Affymetrix IVT-E labelling protocol produces data with less associated bias than the two-cycle protocol, and as such, would recommend this kit for new experiments that involve small samples.
微阵列是一种用于确定全局 RNA 表达的强大工具。越来越需要专注于在难以获得大量材料的组织中进行基因表达谱分析,例如器官内的单个组织,如根,或单个分离的细胞。从这些样本中,很难产生用于微阵列实验标记和杂交的所需量的 RNA,因此通常采用扩增过程。尽管在 Affymetrix ATH1 阵列上进行转录组分析时越来越多地使用两轮扩增,但尚未有报道调查可能由此产生的基因表达代表性的任何潜在偏差。
在这里,我们使用 Arabidopsis 根样本比较了使用 Affymetrix 单轮(标准标记方案)、两轮(小样本方案)和 IVT-Express 方案与 Affymetrix ATH1 阵列生成的转录组数据。每个方案获得的结果大致相似。然而,我们表明,有 35 个探针集(总共 22810 个)在两轮数据集中标记错误。在进行两个独立的公开可用数据集的比较时,其中 33 个探针集被归类为过度扩增。
鉴于突出探针的不可靠性质,我们警告不要在涉及两轮扩增方案生成的转录组数据的分析中使用与相应基因相关的数据。我们已经表明,Affymetrix IVT-E 标记方案产生的数据与两轮方案相比具有较少的相关偏差,因此,我们建议在涉及小样本的新实验中使用该试剂盒。