Natale Darren A, Arighi Cecilia N, Barker Winona C, Blake Judith, Chang Ti-Cheng, Hu Zhangzhi, Liu Hongfang, Smith Barry, Wu Cathy H
Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, 3300 Whitehaven St, NW, Washington, DC 20007, USA.
BMC Bioinformatics. 2007 Nov 27;8 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2105-8-S9-S1.
Biomedical ontologies are emerging as critical tools in genomic and proteomic research, where complex data in disparate resources need to be integrated. A number of ontologies describe properties that can be attributed to proteins. For example, protein functions are described by the Gene Ontology (GO) and human diseases by SNOMED CT or ICD10. There is, however, a gap in the current set of ontologies - one that describes the protein entities themselves and their relationships. We have designed the PRotein Ontology (PRO) to facilitate protein annotation and to guide new experiments. The components of PRO extend from the classification of proteins on the basis of evolutionary relationships to the representation of the multiple protein forms of a gene (products generated by genetic variation, alternative splicing, proteolytic cleavage, and other post-translational modifications). PRO will allow the specification of relationships between PRO, GO and other ontologies in the OBO Foundry. Here we describe the initial development of PRO, illustrated using human and mouse proteins involved in the transforming growth factor-beta and bone morphogenetic protein signaling pathways.
生物医学本体论正在成为基因组和蛋白质组研究中的关键工具,在这些研究中,不同资源中的复杂数据需要整合。许多本体论描述了可归因于蛋白质的特性。例如,蛋白质功能由基因本体论(GO)描述,人类疾病由SNOMED CT或ICD10描述。然而,当前的本体论集合存在一个空白——一个描述蛋白质实体本身及其关系的空白。我们设计了蛋白质本体论(PRO)以促进蛋白质注释并指导新的实验。PRO的组件从基于进化关系的蛋白质分类扩展到基因多种蛋白质形式的表示(由基因变异、可变剪接、蛋白水解切割和其他翻译后修饰产生的产物)。PRO将允许指定PRO、GO和OBO铸造厂中其他本体论之间的关系。在这里,我们描述了PRO的初步开发,并以参与转化生长因子-β和骨形态发生蛋白信号通路的人类和小鼠蛋白质为例进行说明。