Waaijenborg Sandra, Verselewel de Witt Hamer Philip C, Zwinderman Aeilko H
Academic Medical Center / University of Amsterdam.
Stat Appl Genet Mol Biol. 2008;7(1):Article3. doi: 10.2202/1544-6115.1329. Epub 2008 Jan 23.
Multiple changes at the DNA level are at the basis of complex diseases. Identifying the genetic networks that are influenced by these changes might help in understanding the development of these diseases. Canonical correlation analysis is used to associate gene expressions with DNA-markers and thus reveals sets of co-expressed and co-regulated genes and their associating DNA-markers. However, when the number of variables gets high, e.g. in the case of microarray studies, interpretation of these results can be difficult. By adapting the elastic net to canonical correlation analysis the number of variables reduces, and interpretation becomes easier, moreover, due to the grouping effect of the elastic net co-regulated and co-expressed genes cluster. Additionally, our adaptation works well in situations where the number of variables exceeds by far the number of subjects.
DNA 水平的多种变化是复杂疾病的基础。识别受这些变化影响的遗传网络可能有助于理解这些疾病的发展。典型相关分析用于将基因表达与 DNA 标记相关联,从而揭示共表达和共调控基因的集合及其相关的 DNA 标记。然而,当变量数量增加时,例如在微阵列研究中,解释这些结果可能会很困难。通过将弹性网络应用于典型相关分析,变量数量减少,解释变得更容易,此外,由于弹性网络的分组效应,共调控和共表达的基因会聚类。此外,我们的改进方法在变量数量远远超过样本数量的情况下也能很好地工作。