Mitchell Rowan A C
Sustainable Soils and Crops, Rothamsted Research, Harpenden, Hertfordshire AL5 2JQ, United Kingdom.
Bioinform Adv. 2025 Apr 7;5(1):vbaf079. doi: 10.1093/bioadv/vbaf079. eCollection 2025.
Where experiments identify sets of grass genes of unknown function, e.g. underlying a QTL or co-expressed in a transcriptome, it is useful to know which of these genes are common to all grasses (universal) and whether they likely have monocot-/commelinid-/grass-specific function.
A pipeline used data on 16 grass full genomes from Ensembl Plants to generate 13 312 highly conserved, universal groups of grass protein-coding genes. Validation steps showed that 98.8% of these groups also had gene matches in recently sequenced genomes from two major grass clades not used in the pipeline. Comparison with many non-grass genomes identified 4609 of these groups as likely of monocot-/commelinid-/grass-specific function. Both grouping of genes and specificity were defined using hidden Markov model (HMM) profiles of the groups. The HMM-based approach performed better than simple percentage identity in discriminating between test sets of known specific and non-specific genes. The results give novel insight into the nature of monocot-/commelinid-/grass-specific genes. Researchers can use the universal_grass_peps database to gain evidence for their experimentally identified grass genes being involved in monocot-/commelinid-/grass-specific traits.
The universal_grass_peps database is available for download at https://data.rothamsted.ac.uk/dataset/universal_grass_peps.
在实验确定了功能未知的禾本科基因集的情况下,例如位于数量性状位点(QTL)之下或在转录组中共表达的基因集,了解这些基因中哪些是所有禾本科植物共有的(通用基因)以及它们是否可能具有单子叶植物/鸭跖草类/禾本科特有的功能是很有用的。
一个流程利用来自Ensembl Plants的16个禾本科植物全基因组数据,生成了13312个高度保守的、通用的禾本科蛋白质编码基因组。验证步骤表明,这些基因组中有98.8%在该流程未使用的两个主要禾本科分支最近测序的基因组中也有基因匹配。与许多非禾本科基因组的比较确定了其中4609个基因组可能具有单子叶植物/鸭跖草类/禾本科特有的功能。基因分组和特异性均使用这些基因组的隐马尔可夫模型(HMM)图谱来定义。基于HMM的方法在区分已知特异性和非特异性基因的测试集时比简单的百分比一致性表现更好。这些结果为单子叶植物/鸭跖草类/禾本科特有的基因的性质提供了新的见解。研究人员可以使用universal_grass_peps数据库来获取证据,证明他们通过实验确定的禾本科基因参与了单子叶植物/鸭跖草类/禾本科特有的性状。
universal_grass_peps数据库可在https://data.rothamsted.ac.uk/dataset/universal_grass_peps下载。