Date Yasuhiro, Wei Feifei, Tsuboi Yuuri, Ito Kengo, Sakata Kenji, Kikuchi Jun
RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan.
Graduate School of Medical Life Science, Yokohama City University, 1-7-29 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan.
BMC Chem. 2021 Feb 20;15(1):13. doi: 10.1186/s13065-020-00731-0.
Nuclear magnetic resonance (NMR)-based relaxometry is widely used in various fields of research because of its advantages such as simple sample preparation, easy handling, and relatively low cost compared with metabolomics approaches. However, there have been no reports on the application of the T relaxation curves in metabolomics studies involving the evaluation of metabolic mixtures, such as geographical origin determination and feature extraction by pattern recognition and data mining. In this study, we describe a data mining method for relaxometric data (i.e., relaxometric learning). This method is based on a machine learning algorithm supported by the analytical framework optimized for the relaxation curve analyses. In the analytical framework, we incorporated a variable optimization approach and bootstrap resampling-based matrixing to enhance the classification performance and balance the sample size between groups, respectively. The relaxometric learning enabled the extraction of features related to the physical properties of fish muscle and the determination of the geographical origin of the fish by improving the classification performance. Our results suggest that relaxometric learning is a powerful and versatile alternative to conventional metabolomics approaches for evaluating fleshiness of chemical mixtures in food and for other biological and chemical research requiring a nondestructive, cost-effective, and time-saving method.
基于核磁共振(NMR)的弛豫测量法因其具有诸如样品制备简单、操作便捷以及与代谢组学方法相比成本相对较低等优点,而在各个研究领域中得到广泛应用。然而,关于T弛豫曲线在代谢组学研究中的应用,尚无涉及评估代谢混合物(如地理来源确定以及通过模式识别和数据挖掘进行特征提取)的相关报道。在本研究中,我们描述了一种用于弛豫测量数据的数据挖掘方法(即弛豫测量学习)。该方法基于一种机器学习算法,该算法由针对弛豫曲线分析优化的分析框架所支持。在该分析框架中,我们分别纳入了变量优化方法和基于自助重采样的矩阵化方法,以提高分类性能并平衡组间样本量。弛豫测量学习通过提高分类性能,实现了与鱼肉物理特性相关特征的提取以及鱼的地理来源的确定。我们的结果表明,弛豫测量学习是一种强大且通用的方法,可替代传统代谢组学方法,用于评估食品中化学混合物的肉质以及其他需要无损、经济高效且省时方法的生物和化学研究。