Gold David, Coombes Kevin, Medhane Dina, Ramaswamy Anitha, Ju Zhenlin, Strong Louise, Koo Ja Seok, Kapoor Mini
Department of Biostatistics, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, USA.
BMC Genomics. 2004 Jan 6;5(1):2. doi: 10.1186/1471-2164-5-2.
To generate specific transcript profiles, one must isolate homogenous cell populations using techniques that often yield small amounts of RNA, requiring researchers to employ RNA amplification methods. The data generated by using these methods must be extensively evaluated to determine any technique dependent distortion of the expression profiles.
High-density oligonucleotide microarrays were used to perform experiments for comparing data generated by using two protocols, an in vitro transcription (IVT) protocol that requires 5 microg of total RNA and a double in vitro transcription (dIVT) protocol that requires 200 ng of total RNA for target preparation from RNA samples extracted from a normal and a cancer cell line. In both cell lines, about 10% more genes were detected with IVT than with dIVT. Genes were filtered to exclude those that were undetected on all arrays. Hierarchical clustering using the 9,482 genes that passed the filter showed that the variation attributable to biological differences between samples was greater than that introduced by differences in the protocols. We analyzed the behavior of these genes separately for each protocol by using a statistical model to estimate the posterior probability of various levels of fold change. At each level, more differentially expressed genes were detected with IVT than with dIVT. When we checked for genes that had a posterior probability greater than 99% of fold change greater than 2, in data generated by IVT but not dIVT, more than 60% of these genes had posterior probabilities greater than 90% in data generated by dIVT. Both protocols identified the same functional gene categories to be differentially expressed. Differential expression of selected genes was confirmed using quantitative real-time PCR.
Using nanogram quantities on total RNA, the usage of dIVT protocol identified differentially expressed genes and functional categories consistent with those detected by the IVT protocol. There was a loss in sensitivity of about 10% when detecting differentially expressed genes using the dIVT protocol. However, the lower amount of RNA required for this protocol, as compared to the IVT protocol, renders this methodology a highly desirable one for biological systems where sample amounts are limiting.
为了生成特定的转录本谱,必须使用通常只能产生少量RNA的技术分离出同质细胞群体,这就要求研究人员采用RNA扩增方法。使用这些方法生成的数据必须经过广泛评估,以确定表达谱中是否存在任何技术依赖性的失真。
使用高密度寡核苷酸微阵列进行实验,以比较两种方案生成的数据,一种是体外转录(IVT)方案,该方案需要5微克总RNA,另一种是双体外转录(dIVT)方案,该方案需要200纳克总RNA用于从正常细胞系和癌细胞系提取的RNA样本中制备靶标。在两种细胞系中,IVT检测到的基因比dIVT多约10%。对基因进行筛选,排除那些在所有阵列上均未检测到的基因。使用通过筛选的9482个基因进行层次聚类分析表明,样本间生物学差异导致的变异大于方案差异引入的变异。我们使用统计模型分别针对每个方案分析这些基因的行为,以估计不同水平的倍数变化的后验概率。在每个水平上,IVT检测到的差异表达基因比dIVT多。当我们检查在IVT生成的数据中后验概率大于99%且倍数变化大于2,但在dIVT生成的数据中未出现的基因时,这些基因中有超过60%在dIVT生成的数据中的后验概率大于90%。两种方案都鉴定出相同的功能基因类别存在差异表达。使用定量实时PCR确认了所选基因的差异表达。
使用纳克级的总RNA量,dIVT方案的使用鉴定出与IVT方案检测到的差异表达基因和功能类别一致。使用dIVT方案检测差异表达基因时,灵敏度损失约10%。然而,与IVT方案相比,该方案所需的RNA量更低,使得该方法对于样本量有限的生物系统非常适用。