Orchard Sandra, Montecchi-Palazzi Luisa, Hermjakob Henning, Apweiler Rolf
Sequence Database Group, EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
Pac Symp Biocomput. 2005:186-96.
Controlled vocabularies provide a roadmap through complex biological data. Proteomic data is increasing in volume and is currently poorly served by public repositories due to the large number of different formats in which the data is generated and stored. The Human Proteome Organization Proteome Standards Initiative is establishing standards for data transfer and deposition. These standards utilize ontologies and controlled vocabularies to describe experimental procedures and common processes such as sample preparation This paper will discuss the development of such ontologies by the user community and their current utilization in the fields of protein:proein interactions and mass spectrometry.
受控词汇表为浏览复杂的生物学数据提供了路线图。蛋白质组学数据量在不断增加,由于数据生成和存储的格式繁多,公共数据库目前难以满足其需求。人类蛋白质组组织蛋白质组标准倡议正在制定数据传输和存档标准。这些标准利用本体论和受控词汇表来描述实验程序和诸如样品制备等常见过程。本文将讨论用户社区对这类本体论的开发及其目前在蛋白质-蛋白质相互作用和质谱领域的应用。