Zhang Tao, Jiang Min, Chen Lei, Niu Bing, Cai Yudong
Institute of Systems Biology, Shanghai University, 99 ShangDa Road, Shanghai 200444, China.
Biomed Res Int. 2013;2013:870795. doi: 10.1155/2013/870795. Epub 2013 Nov 7.
Observing what phenotype the overexpression or knockdown of gene can cause is the basic method of investigating gene functions. Many advanced biotechnologies, such as RNAi, were developed to study the gene phenotype. But there are still many limitations. Besides the time and cost, the knockdown of some gene may be lethal which makes the observation of other phenotypes impossible. Due to ethical and technological reasons, the knockdown of genes in complex species, such as mammal, is extremely difficult. Thus, we proposed a new sequence-based computational method called kNNA-based method for gene phenotypes prediction. Different to the traditional sequence-based computational method, our method regards the multiphenotype as a whole network which can rank the possible phenotypes associated with the query protein and shows a more comprehensive view of the protein's biological effects. According to the prediction result of yeast, we also find some more related features, including GO and KEGG information, which are making more contributions in identifying protein phenotypes. This method can be applied in gene phenotype prediction in other species.
观察基因的过表达或敲低会导致何种表型是研究基因功能的基本方法。许多先进的生物技术,如RNA干扰,被开发用于研究基因表型。但仍存在许多局限性。除了时间和成本外,某些基因的敲低可能是致死性的,这使得观察其他表型变得不可能。由于伦理和技术原因,在复杂物种(如哺乳动物)中敲低基因极其困难。因此,我们提出了一种新的基于序列的计算方法——基于kNNA的方法来预测基因表型。与传统的基于序列的计算方法不同,我们的方法将多表型视为一个整体网络,该网络可以对与查询蛋白相关的可能表型进行排序,并展示出该蛋白生物学效应的更全面视图。根据酵母的预测结果,我们还发现了一些更相关的特征,包括GO和KEGG信息,它们在识别蛋白表型方面发挥着更大的作用。该方法可应用于其他物种的基因表型预测。