Lin Zhaozhou, Zhang Qiao, Dai Shengyun, Gao Xiaoyan
Beijing Institute of Chinese Materia Medica, Beijing 100035, China.
School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing 10029, China.
Metabolites. 2020 Jan 13;10(1):33. doi: 10.3390/metabo10010033.
Temporal associations in longitudinal nontargeted metabolomics data are generally ignored by common pattern recognition methods such as partial least squares discriminant analysis (PLS-DA) and orthogonal partial least squares discriminant analysis (OPLS-DA). To discover temporal patterns in longitudinal metabolomics, a multitask learning (MTL) method employing structural regularization was proposed. The group regularization term of the proposed MTL method enables the selection of a small number of tentative biomarkers while maintaining high prediction accuracy. Meanwhile, the nuclear norm imposed into the regression coefficient accounts for the interrelationship of the metabolomics data obtained on consecutive time points. The effectiveness of the proposed method was demonstrated by comparison study performed on a metabolomics dataset and a simulating dataset. The results showed that a compact set of tentative biomarkers charactering the whole antipyretic process of Qingkailing injection were selected with the proposed method. In addition, the nuclear norm introduced in the new method could help the group norm to improve the method's recovery ability.
纵向非靶向代谢组学数据中的时间关联通常被诸如偏最小二乘判别分析(PLS-DA)和正交偏最小二乘判别分析(OPLS-DA)等常见模式识别方法所忽略。为了发现纵向代谢组学中的时间模式,提出了一种采用结构正则化的多任务学习(MTL)方法。所提出的MTL方法的组正则化项能够在保持高预测准确性的同时选择少量的暂定生物标志物。同时,施加到回归系数上的核范数考虑了在连续时间点获得的代谢组学数据的相互关系。通过在代谢组学数据集和模拟数据集上进行的比较研究证明了所提出方法的有效性。结果表明,使用所提出的方法选择了一组紧凑的暂定生物标志物,这些标志物表征了清开灵注射液的整个退热过程。此外,新方法中引入的核范数可以帮助组范数提高方法的恢复能力。