Department of Clinical Sciences, College of Veterinary Medicine, Mississippi State University, MS 39762, USA.
Anim Genet. 2011 Aug;42(4):395-405. doi: 10.1111/j.1365-2052.2010.02158.x. Epub 2011 Feb 6.
The equine genome sequence enables the use of high-throughput genomic technologies in equine research, but accurate identification of expressed gene products and interpreting their biological relevance require additional structural and functional genome annotation. Here, we employ the equine genome sequence to identify predicted and known proteins using proteomics and model these proteins into biological pathways, identifying 582 proteins in normal cell-free equine bronchoalveolar lavage fluid (BALF). We improved structural and functional annotation by directly confirming the in vivo expression of 558 (96%) proteins, which were computationally predicted previously, and adding Gene Ontology (GO) annotations for 174 proteins, 108 of which lacked functional annotation. Bronchoalveolar lavage is commonly used to investigate equine respiratory disease, leading us to model the associated proteome and its biological functions. Modelling of protein functions using Ingenuity Pathway Analysis identified carbohydrate metabolism, cell-to-cell signalling, cellular function, inflammatory response, organ morphology, lipid metabolism and cellular movement as key biological processes in normal equine BALF. Comparative modelling of protein functions in normal cell-free bronchoalveolar lavage proteomes from horse, human, and mouse, performed by grouping GO terms sharing common ancestor terms, confirms conservation of functions across species. Ninety-one of 92 human GO categories and 105 of 109 mouse GO categories were conserved in the horse. Our approach confirms the utility of the equine genome sequence to characterize protein networks without antibodies or mRNA quantification, highlights the need for continued structural and functional annotation of the equine genome and provides a framework for equine researchers to aid in the annotation effort.
马的基因组序列使我们能够在马的研究中使用高通量基因组技术,但要准确识别表达的基因产物并解释其生物学相关性,还需要额外的结构和功能基因组注释。在这里,我们使用马的基因组序列来使用蛋白质组学鉴定预测和已知的蛋白质,并将这些蛋白质构建成生物途径,从而在正常的细胞游离马支气管肺泡灌洗液 (BALF) 中鉴定出 582 种蛋白质。我们通过直接确认 558 种(96%)先前计算预测的蛋白质的体内表达,从而改进了结构和功能注释,并为 174 种蛋白质添加了基因本体论 (GO) 注释,其中 108 种蛋白质缺乏功能注释。支气管肺泡灌洗通常用于研究马的呼吸疾病,因此我们对相关的蛋白质组及其生物学功能进行建模。使用 Ingenuity Pathway Analysis 对蛋白质功能进行建模,确定碳水化合物代谢、细胞间信号转导、细胞功能、炎症反应、器官形态、脂质代谢和细胞运动是正常马 BALF 中的关键生物学过程。通过对来自马、人、和鼠的正常细胞游离支气管肺泡灌洗液蛋白质组进行比较建模,将具有共同祖先术语的 GO 术语分组,对蛋白质功能进行比较建模,证实了功能在物种间的保守性。92 个人类 GO 类别中有 91 个和 109 个小鼠 GO 类别中有 105 个在马中得到了保留。我们的方法证实了马的基因组序列在没有抗体或 mRNA 定量的情况下用于描述蛋白质网络的实用性,强调了继续对马基因组进行结构和功能注释的必要性,并为马研究人员提供了一个框架来帮助注释工作。