Bongaerts Michiel, Kulkarni Purva, Zammit Alan, Bonte Ramon, Kluijtmans Leo A J, Blom Henk J, Engelke Udo F H, Tax David M J, Ruijter George J G, Reinders Marcel J T
Department of Clinical Genetics, University Medical Center Rotterdam, Dr. Molewaterplein 40, 3015 GD Rotterdam, The Netherlands.
Department of Human Genetics, Radboud University Medical Center, 6525 GA Nijmegen, The Netherlands.
Metabolites. 2023 Jan 7;13(1):97. doi: 10.3390/metabo13010097.
Untargeted metabolomics (UM) is increasingly being deployed as a strategy for screening patients that are suspected of having an inborn error of metabolism (IEM). In this study, we examined the potential of existing outlier detection methods to detect IEM patient profiles. We benchmarked 30 different outlier detection methods when applied to three untargeted metabolomics datasets. Our results show great differences in IEM detection performances across the various methods. The methods and performed most consistently across the three metabolomics datasets. For datasets with a more balanced number of samples-to-features ratio, we found that , and also performed well. Furthermore, we demonstrated the importance of a PCA transform prior to applying an outlier detection method since we observed that this increases the performance of several outlier detection methods. For only one of the three metabolomics datasets, we observed clinically satisfying performances for some outlier detection methods, where we were able to detect 90% of the IEM patient samples while detecting no false positives. These results suggest that outlier detection methods have the potential to aid the clinical investigator in routine screening for IEM using untargeted metabolomics data, but also show that further improvements are needed to ensure clinically satisfying performances.
非靶向代谢组学(UM)越来越多地被用作一种筛查疑似患有先天性代谢缺陷(IEM)患者的策略。在本研究中,我们考察了现有异常值检测方法检测IEM患者谱的潜力。当应用于三个非靶向代谢组学数据集时,我们对30种不同的异常值检测方法进行了基准测试。我们的结果表明,各种方法在IEM检测性能上存在很大差异。[此处原文缺失具体方法名称]方法在三个代谢组学数据集中表现最为一致。对于样本与特征比例较为均衡的数据集,我们发现[此处原文缺失具体方法名称]方法也表现良好。此外,我们证明了在应用异常值检测方法之前进行主成分分析(PCA)变换的重要性,因为我们观察到这会提高几种异常值检测方法的性能。对于三个代谢组学数据集中的仅一个,我们观察到某些异常值检测方法具有临床满意的性能,即我们能够检测出90%的IEM患者样本,同时未检测到假阳性。这些结果表明,异常值检测方法有潜力帮助临床研究人员利用非靶向代谢组学数据对IEM进行常规筛查,但也表明需要进一步改进以确保临床满意的性能。