Yuan Joshua S, Burris Jason, Stewart Nathan R, Mentewab Ayalew, Stewart C Neal
Department of Plant Sciences, University of Tennessee, Knoxville, TN 37996, USA.
BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-8-S7-S6.
As compared with traditional transgene copy number detection technologies such as Southern blot analysis, real-time PCR provides a fast, inexpensive and high-throughput alternative. However, the real-time PCR based transgene copy number estimation tends to be ambiguous and subjective stemming from the lack of proper statistical analysis and data quality control to render a reliable estimation of copy number with a prediction value. Despite the recent progresses in statistical analysis of real-time PCR, few publications have integrated these advancements in real-time PCR based transgene copy number determination.
Three experimental designs and four data quality control integrated statistical models are presented. For the first method, external calibration curves are established for the transgene based on serially-diluted templates. The Ct number from a control transgenic event and putative transgenic event are compared to derive the transgene copy number or zygosity estimation. Simple linear regression and two group T-test procedures were combined to model the data from this design. For the second experimental design, standard curves were generated for both an internal reference gene and the transgene, and the copy number of transgene was compared with that of internal reference gene. Multiple regression models and ANOVA models can be employed to analyze the data and perform quality control for this approach. In the third experimental design, transgene copy number is compared with reference gene without a standard curve, but rather, is based directly on fluorescence data. Two different multiple regression models were proposed to analyze the data based on two different approaches of amplification efficiency integration. Our results highlight the importance of proper statistical treatment and quality control integration in real-time PCR-based transgene copy number determination.
These statistical methods allow the real-time PCR-based transgene copy number estimation to be more reliable and precise with a proper statistical estimation. Proper confidence intervals are necessary for unambiguous prediction of trangene copy number. The four different statistical methods are compared for their advantages and disadvantages. Moreover, the statistical methods can also be applied for other real-time PCR-based quantification assays including transfection efficiency analysis and pathogen quantification.
与传统的转基因拷贝数检测技术如Southern印迹分析相比,实时荧光定量PCR提供了一种快速、廉价且高通量的替代方法。然而,基于实时荧光定量PCR的转基因拷贝数估计往往不明确且主观,这是由于缺乏适当的统计分析和数据质量控制,无法通过预测值可靠地估计拷贝数。尽管最近在实时荧光定量PCR的统计分析方面取得了进展,但很少有出版物将这些进展整合到基于实时荧光定量PCR的转基因拷贝数测定中。
提出了三种实验设计和四种数据质量控制综合统计模型。对于第一种方法,基于系列稀释的模板为转基因建立外部校准曲线。比较对照转基因事件和推定转基因事件的Ct值,以得出转基因拷贝数或纯合度估计值。将简单线性回归和两组T检验程序结合起来对该设计的数据进行建模。对于第二种实验设计,为内参基因和转基因都生成标准曲线,并将转基因的拷贝数与内参基因的拷贝数进行比较。可以采用多元回归模型和方差分析模型来分析数据并对该方法进行质量控制。在第三种实验设计中,转基因拷贝数与参考基因进行比较,无需标准曲线,而是直接基于荧光数据。基于两种不同的扩增效率整合方法,提出了两种不同的多元回归模型来分析数据。我们的结果强调了在基于实时荧光定量PCR的转基因拷贝数测定中进行适当统计处理和质量控制整合的重要性。
这些统计方法通过适当的统计估计,使基于实时荧光定量PCR的转基因拷贝数估计更加可靠和精确。明确预测转基因拷贝数需要适当的置信区间。比较了四种不同统计方法的优缺点。此外,这些统计方法还可应用于其他基于实时荧光定量PCR的定量分析,包括转染效率分析和病原体定量。