Biosystems Data Analysis, Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, The Netherlands.
Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands.
PLoS One. 2018 Apr 26;13(4):e0195939. doi: 10.1371/journal.pone.0195939. eCollection 2018.
Combining different metabolomics platforms can contribute significantly to the discovery of complementary processes expressed under different conditions. However, analysing the fused data might be hampered by the difference in their quality. In metabolomics data, one often observes that measurement errors increase with increasing measurement level and that different platforms have different measurement error variance. In this paper we compare three different approaches to correct for the measurement error heterogeneity, by transformation of the raw data, by weighted filtering before modelling and by a modelling approach using a weighted sum of residuals. For an illustration of these different approaches we analyse data from healthy obese and diabetic obese individuals, obtained from two metabolomics platforms. Concluding, the filtering and modelling approaches that both estimate a model of the measurement error did not outperform the data transformation approaches for this application. This is probably due to the limited difference in measurement error and the fact that estimation of measurement error models is unstable due to the small number of repeats available. A transformation of the data improves the classification of the two groups.
将不同的代谢组学平台相结合可以显著促进在不同条件下表达的互补过程的发现。然而,融合数据的分析可能会受到其质量差异的阻碍。在代谢组学数据中,人们经常观察到测量误差随着测量水平的增加而增加,并且不同的平台具有不同的测量误差方差。在本文中,我们比较了三种不同的方法来纠正测量误差异质性,方法是对原始数据进行转换、在建模前进行加权过滤以及使用加权残差和进行建模。为了说明这些不同的方法,我们分析了来自两个代谢组学平台的健康肥胖和糖尿病肥胖个体的数据。结论是,对于这种应用,过滤和建模方法都没有优于数据转换方法,因为测量误差的差异有限,并且由于可用的重复次数较少,测量误差模型的估计不稳定。对数据的转换可以改善两组的分类。