Johns Mitrick A, Mao Long
Department of Biological Sciences, Northern Illinois University, DeKalb, IL, 60115, USA.
Funct Integr Genomics. 2007 Apr;7(2):135-51. doi: 10.1007/s10142-006-0036-1. Epub 2006 Nov 28.
Cultivated rice (Oryza sativa) is comprised of two subspecies: japonica and indica. Polymorphism levels between putative homologues were determined for genes whose japonica homologue had been classified into functional categories using the Gene Ontology (GO) system. Genes were partitioned into below-average and above-average polymorphism groups, and then the set of genes having each GO term was checked for the randomness of its distribution into these polymorphism groups using a series of False Discovery Rate (FDR) tests. The robustness of the conclusions was enhanced by employing different cutoff values and sequence samplings in the FDR tests. Significant nonrandom polymorphism distributions were found for protein-coding sequences in many GO categories. In contrast, a random distribution for nearly all GO terms was seen with intron sequences. These results were extended by measuring the nonsynonymous to synonymous codon usage ratio (dN/dS) using a permutation test, which showed that some above-average polymorphism GO categories also had a high proportion of genes with a dN/dS ratio greater than one, suggesting positive selection on these GO categories during indica-japonica differentiation. An analysis of predominant gene names in the significant GO categories divided them into four functional classes: production of defense-related compounds, cell wall, cell signaling, and transcription factors.
栽培稻(Oryza sativa)由两个亚种组成:粳稻和籼稻。对于那些其粳稻同源基因已使用基因本体论(GO)系统分类到功能类别的基因,确定了推定同源基因之间的多态性水平。基因被分为多态性低于平均水平和高于平均水平的组,然后使用一系列错误发现率(FDR)测试检查具有每个GO术语的基因集在这些多态性组中的分布随机性。通过在FDR测试中采用不同的截止值和序列抽样,增强了结论的稳健性。在许多GO类别中发现蛋白质编码序列存在显著的非随机多态性分布。相比之下,内含子序列的几乎所有GO术语都呈现随机分布。通过使用置换检验测量非同义与同义密码子使用比率(dN/dS)扩展了这些结果,该检验表明一些多态性高于平均水平的GO类别也有很大比例的基因其dN/dS比率大于1,这表明在籼粳分化过程中这些GO类别受到正选择。对显著GO类别中主要基因名称的分析将它们分为四个功能类别:防御相关化合物的产生、细胞壁、细胞信号传导和转录因子。