Centre for Applied Mathematics and Bioinformatics, Department of Mathematics and Natural Sciences, Gulf University for Science and Technology, P.O. Box 7207, Hawally, 32093, Kuwait.
Department of Mathematics and Computer Science, University of Southern Denmark, Campusvej 55, 5230, Odense M, Denmark.
Sci Rep. 2019 Jul 4;9(1):9684. doi: 10.1038/s41598-019-46097-9.
Establishing a cytokine signature associated to some medical condition is an important task in immunology. Increasingly, large numbers of cytokines are used for signatures, via lists of reference ranges for each individual cytokine or ratios of cytokines. Here we argue that this common approach has weaknesses, especially when many different cytokines are analysed. Instead, we propose that establishing signatures can be framed as a multivariate anomaly detection problem, and hence exploit the many statistical methods available for this. In this framework, whether or not a given subject's profile matches the cytokine signature of some condition is determined by whether or not the profile is typical of reference samples of that condition, as judged by an anomaly detection algorithm. We examine previously published cytokine data sets associated to pregnancy complications, brain tumours, and rheumatoid arthritis, as well as normal healthy control samples, and test the performance of a range of anomaly detection algorithms on these data, identifying the best performing methods. Finally, we suggest that this anomaly detection approach could be adopted more widely for general multi-biomarker signatures.
建立与某些医学病症相关的细胞因子特征是免疫学中的一项重要任务。越来越多的细胞因子被用于特征分析,通常通过每个细胞因子的参考范围列表或细胞因子的比值来实现。在这里,我们认为这种常见的方法存在一些弱点,特别是在分析许多不同的细胞因子时。相反,我们提出可以将特征的建立视为一个多元异常检测问题,并利用为此提供的许多统计方法。在这种框架下,给定个体的特征是否与某种病症的细胞因子特征相匹配,取决于该特征是否与该病症的参考样本特征一致,这是由异常检测算法来判断的。我们检查了与妊娠并发症、脑肿瘤和类风湿性关节炎以及正常健康对照样本相关的以前发表的细胞因子数据集,并在这些数据上测试了一系列异常检测算法的性能,确定了表现最好的方法。最后,我们建议更广泛地采用这种异常检测方法来进行一般的多生物标志物特征分析。