Università degli Studi di Milano, Via Comelico 39, Milan, Italy.
BMC Bioinformatics. 2009 Oct 15;10 Suppl 12(Suppl 12):S7. doi: 10.1186/1471-2105-10-S12-S7.
The today's public database infrastructure spans a very large collection of heterogeneous biological data, opening new opportunities for molecular biology, bio-medical and bioinformatics research, but raising also new problems for their integration and computational processing.
In this paper we survey the most interesting and novel approaches for the representation, integration and management of different kinds of biological data by exploiting XML and the related recommendations and approaches. Moreover, we present new and interesting cutting edge approaches for the appropriate management of heterogeneous biological data represented through XML.
XML has succeeded in the integration of heterogeneous biomolecular information, and has established itself as the syntactic glue for biological data sources. Nevertheless, a large variety of XML-based data formats have been proposed, thus resulting in a difficult effective integration of bioinformatics data schemes. The adoption of a few semantic-rich standard formats is urgent to achieve a seamless integration of the current biological resources.
当今的公共数据库基础设施涵盖了非常广泛的异构生物数据,为分子生物学、生物医学和生物信息学研究开辟了新的机会,但也为它们的整合和计算处理带来了新的问题。
在本文中,我们通过利用 XML 及相关建议和方法,调查了最有趣和新颖的方法,用于通过 XML 表示、整合和管理不同类型的生物数据。此外,我们还提出了新的、有趣的前沿方法,用于通过 XML 表示的异构生物数据的适当管理。
XML 成功地整合了异构的生物分子信息,并确立了自己作为生物数据源的语法黏合剂。然而,已经提出了大量基于 XML 的数据格式,因此导致了生物信息学数据方案的有效整合变得困难。采用少数语义丰富的标准格式是迫切需要的,以实现当前生物资源的无缝整合。