CSIRO Animal, Food and Health Sciences, Queensland Bioscience Precinct, Brisbane, QLD 4067, Australia.
J Anim Sci. 2013 Feb;91(2):530-6. doi: 10.2527/jas.2012-5780. Epub 2012 Oct 24.
The advent of economically viable high-throughput genetic and genomic techniques has equipped animal geneticists with an unprecedented ability to generate massive amounts of molecular data. As a result, large lists of genes differentially expressed in many experimental conditions of interests have been reported and, likewise, the association of an ever growing number of DNA variants with phenotypes of importance is now a routine endeavor. Although these studies have greatly improved our understanding of the genetic basis of complex phenotypes, they have also revealed the difficulty in explaining more than a fraction of the genetic variance. Inspired by this data-rich and knowledge-poor dichotomy, systems biology aims at the formal integration of seemingly disparate datasets allowing for a holistic view where key properties of the systems emerge as an intuitive feature and enable the generation of testable hypotheses. Herein, we present 2 examples of integrating molecular data anchored in the power of gene network inference. The first example is concerned with the onset of puberty in Bos indicus-influenced cows bred in Australia. Using the results from genomewide association studies across a range of phenotypes, we developed what we termed an association weight matrix to generate a gene network underlying phenotypes of puberty in cattle. The network was mined for the minimal set of transcription factor genes whose predicted target spanned the majority of the topology of the entire network. The second example deals with piebald, a pigmentation phenotype in Merino sheep. Two networks were developed: a regulatory network and an epistatic network. The former is inferred based on promoter sequence analysis of differentially expressed genes. The epistatic network is built from 2-locus models among all pairwise associated polymorphisms. At the intersection between these 2 networks, we revealed a set of genes and gene-gene interactions of validated and de novo predicted relevance to the piebald phenotype. We argue that these new approaches are holistic and therefore more appropriate than traditional approaches for investigating genetic mechanisms underlying complex phenotypes of importance in livestock species.
高通量遗传和基因组技术的出现使动物遗传学家具备了前所未有的能力,能够产生大量的分子数据。因此,已经报道了大量在许多感兴趣的实验条件下差异表达的基因列表,同样,越来越多的 DNA 变体与重要表型的关联现在也成为常规研究。尽管这些研究极大地提高了我们对复杂表型遗传基础的理解,但它们也揭示了仅能解释一小部分遗传变异的困难。受这种数据丰富但知识匮乏的二分法的启发,系统生物学旨在对看似不同的数据进行正式整合,从而形成一个整体视图,其中系统的关键特性作为直观特征出现,并能够生成可测试的假设。在此,我们介绍了 2 个基于基因网络推断将分子数据整合的例子。第一个例子涉及在澳大利亚饲养的受印度野牛影响的奶牛的青春期开始。使用跨一系列表型的全基因组关联研究的结果,我们开发了我们所谓的关联权重矩阵,以生成牛青春期表型的基因网络。该网络用于挖掘转录因子基因的最小集合,这些基因的预测靶标跨越整个网络拓扑的大部分。第二个例子涉及美利奴羊的白化病,一种色素沉着表型。开发了两个网络:一个调控网络和一个上位网络。前者是基于差异表达基因的启动子序列分析推断出来的。上位网络是由所有成对相关多态性之间的 2 个位点模型构建的。在这两个网络的交点处,我们揭示了一组基因和基因-基因相互作用,这些基因和基因-基因相互作用对白化病表型具有验证和新预测的相关性。我们认为,这些新方法是整体的,因此比传统方法更适合研究对家畜物种重要的复杂表型的遗传机制。