Department of Statistics, TU Dortmund University, Dortmund, Germany.
PLoS One. 2023 Oct 20;18(10):e0293180. doi: 10.1371/journal.pone.0293180. eCollection 2023.
In toxicological concentration-response studies, a frequent goal is the determination of an 'alert concentration', i.e. the lowest concentration where a notable change in the response in comparison to the control is observed. In high-throughput gene expression experiments, e.g. based on microarray or RNA-seq technology, concentration-response profiles can be measured for thousands of genes simultaneously. One approach for determining the alert concentration is given by fitting a parametric model to the data which allows interpolation between the tested concentrations. It is well known that the quality of a model fit improves with the number of measured data points. However, adding new replicates for existing concentrations or even several replicates for new concentrations is time-consuming and expensive. Here, we propose an empirical Bayes approach to information sharing across genes, where in essence a weighted mean of the individual estimate for one specific parameter of a fitted model and the mean of all estimates of the entire set of genes is calculated as a result. Results of a controlled plasmode simulation study show that for many genes a notable improvement in terms of the mean squared error (MSE) between estimate and true underlying value of the parameter can be observed. However, for some genes, the MSE increases, and this cannot be prevented by using a more sophisticated prior distribution in the Bayesian approach.
在毒理学浓度-反应研究中,一个常见的目标是确定“警报浓度”,即与对照相比,观察到反应明显变化的最低浓度。在高通量基因表达实验中,例如基于微阵列或 RNA-seq 技术,可以同时测量数千个基因的浓度-反应曲线。确定警报浓度的一种方法是拟合数据的参数模型,该模型允许在测试浓度之间进行插值。众所周知,模型拟合的质量随着测量数据点数量的增加而提高。然而,为现有浓度添加新的重复实验,甚至为新浓度添加多个重复实验既耗时又昂贵。在这里,我们提出了一种跨基因信息共享的经验贝叶斯方法,实质上是为拟合模型的一个特定参数的个体估计值和整个基因集的所有估计值的平均值计算加权平均值。受控质粒模拟研究的结果表明,对于许多基因,可以观察到参数估计值与真实值之间的均方误差(MSE)显著提高。然而,对于一些基因,MSE 增加,并且通过在贝叶斯方法中使用更复杂的先验分布,无法防止这种情况发生。