School of Mathematics and Statistics, The University of Sydney, Sydney, NSW, 2006, Australia.
Charles Perkins Centre, The University of Sydney, Sydney, NSW, 2006, Australia.
BMC Bioinformatics. 2020 Nov 17;21(1):530. doi: 10.1186/s12859-020-03861-3.
Nutrigenomics aims at understanding the interaction between nutrition and gene information. Due to the complex interactions of nutrients and genes, their relationship exhibits non-linearity. One of the most effective and efficient methods to explore their relationship is the nutritional geometry framework which fits a response surface for the gene expression over two prespecified nutrition variables. However, when the number of nutrients involved is large, it is challenging to find combinations of informative nutrients with respect to a certain gene and to test whether the relationship is stronger than chance. Methods for identifying informative combinations are essential to understanding the relationship between nutrients and genes.
We introduce Local Consistency Nutrition to Graphics (LC-N2G), a novel approach for ranking and identifying combinations of nutrients with gene expression. In LC-N2G, we first propose a model-free quantity called Local Consistency statistic to measure whether there is non-random relationship between combinations of nutrients and gene expression measurements based on (1) the similarity between samples in the nutrient space and (2) their difference in gene expression. Then combinations with small LC are selected and a permutation test is performed to evaluate their significance. Finally, the response surfaces are generated for the subset of significant relationships. Evaluation on simulated data and real data shows the LC-N2G can accurately find combinations that are correlated with gene expression.
The LC-N2G is practically powerful for identifying the informative nutrition variables correlated with gene expression. Therefore, LC-N2G is important in the area of nutrigenomics for understanding the relationship between nutrition and gene expression information.
营养基因组学旨在理解营养与基因信息之间的相互作用。由于营养与基因的相互作用复杂,它们之间的关系呈现出非线性。探索它们之间关系的最有效和最有效的方法之一是营养几何框架,该框架适合在两个预设的营养变量上的基因表达的响应曲面。然而,当涉及的营养素有很多时,找到与特定基因有关的信息性营养组合并测试它们之间的关系是否强于偶然就具有挑战性。识别信息性组合的方法对于理解营养与基因之间的关系至关重要。
我们引入了局部一致性营养到图形(LC-N2G),这是一种用于对与基因表达相关的营养素组合进行排名和识别的新方法。在 LC-N2G 中,我们首先提出了一种无模型数量,称为局部一致性统计量,用于测量营养素组合与基因表达测量之间是否存在非随机关系,这基于(1)营养素空间中样本之间的相似性和(2)它们在基因表达上的差异。然后选择 LC 较小的组合,并进行置换检验以评估其显著性。最后,为显著关系的子集生成响应曲面。模拟数据和真实数据的评估表明,LC-N2G 可以准确地找到与基因表达相关的组合。
LC-N2G 对于识别与基因表达相关的信息性营养变量具有实际的强大功能。因此,LC-N2G 在营养基因组学领域对于理解营养与基因表达信息之间的关系非常重要。