Personalised Medicine Centre, School of Medicine, Ulster University, C-TRIC Building, Altnagelvin Area Hospital, Glenshane Road, Londonderry, BT47 6SB, UK.
Proteomics. 2023 Jan;23(2):e2200252. doi: 10.1002/pmic.202200252. Epub 2022 Oct 3.
Multidimensional omic datasets often have correlated features leading to the possibility of discovering multiple biological signatures with similar predictive performance for a phenotype. However, their exploration is limited by low sample size and the exponential nature of the combinatorial search leading to high computational cost. To address these issues, we have developed an algorithm muSignAl (multiple signature algorithm) which selects multiple signatures with similar predictive performance while systematically bypassing the requirement of exploring all the combinations of features. We demonstrated the workflow of this algorithm with an example of proteomics dataset. muSignAl is applicable in various bioinformatics-driven explorations, such as understanding the relationship between multiple biological feature sets and phenotypes, and discovery and development of biomarker panels while providing the opportunity of optimising their development cost with the help of equally good multiple signatures. Source code of muSignAl is freely available at https://github.com/ShuklaLab/muSignAl.
多维组学数据集通常具有相关的特征,这使得有可能发现多个具有相似预测性能的生物特征,从而对表型进行预测。然而,由于样本量小和组合搜索的指数性质,导致计算成本高,其探索受到限制。为了解决这些问题,我们开发了一种算法 muSignAl(多特征算法),该算法可以选择具有相似预测性能的多个特征,同时系统地绕过探索所有特征组合的要求。我们通过一个蛋白质组学数据集的示例展示了该算法的工作流程。muSignAl 适用于各种基于生物信息学的探索,例如理解多个生物特征集与表型之间的关系,以及标志物面板的发现和开发,同时提供了通过同样好的多个特征来优化其开发成本的机会。muSignAl 的源代码可在 https://github.com/ShuklaLab/muSignAl 上免费获得。