Suppr超能文献

基于语义网技术的弗氏柠檬酸杆菌 novicida 蛋白质组学和转录组学数据集成与注释。

Francisella tularensis novicida proteomic and transcriptomic data integration and annotation based on semantic web technologies.

机构信息

Faculty of Biomedical and Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.

出版信息

BMC Bioinformatics. 2009 Oct 1;10 Suppl 10(Suppl 10):S3. doi: 10.1186/1471-2105-10-S10-S3.

Abstract

BACKGROUND

This paper summarises the lessons and experiences gained from a case study of the application of semantic web technologies to the integration of data from the bacterial species Francisella tularensis novicida (Fn). Fn data sources are disparate and heterogeneous, as multiple laboratories across the world, using multiple technologies, perform experiments to understand the mechanism of virulence. It is hard to integrate these data sources in a flexible manner that allows new experimental data to be added and compared when required.

RESULTS

Public domain data sources were combined in RDF. Using this connected graph of database cross references, we extended the annotations of an experimental data set by superimposing onto it the annotation graph. Identifiers used in the experimental data automatically resolved and the data acquired annotations in the rest of the RDF graph. This happened without the expensive manual annotation that would normally be required to produce these links. This graph of resolved identifiers was then used to combine two experimental data sets, a proteomics experiment and a transcriptomic experiment studying the mechanism of virulence through the comparison of wildtype Fn with an avirulent mutant strain.

CONCLUSION

We produced a graph of Fn cross references which enabled the combination of two experimental datasets. Through combination of these data we are able to perform queries that compare the results of the two experiments. We found that data are easily combined in RDF and that experimental results are easily compared when the data are integrated. We conclude that semantic data integration offers a convenient, simple and flexible solution to the integration of published and unpublished experimental data.

摘要

背景

本文总结了应用语义 Web 技术整合弗氏志贺样杆菌 novicida (Fn) 数据的案例研究中的经验教训。Fn 数据源具有多样性和异质性,因为世界各地的多个实验室使用多种技术进行实验以了解毒力机制。很难以灵活的方式整合这些数据源,以便在需要时添加和比较新的实验数据。

结果

公共领域数据源在 RDF 中组合。使用此数据库交叉引用的连接图,我们通过将注释图叠加在实验数据集的注释上来扩展实验数据集的注释。实验数据中使用的标识符自动解析,并在 RDF 图的其余部分获取注释。这是在通常需要进行这些链接的昂贵的手动注释的情况下发生的。然后,使用此解析标识符图来组合两个实验数据集,一个是蛋白质组学实验,另一个是通过比较野生型 Fn 与无毒突变株来研究毒力机制的转录组学实验。

结论

我们生成了一个 Fn 交叉引用图,该图实现了两个实验数据集的组合。通过组合这些数据,我们能够执行比较两个实验结果的查询。我们发现,当数据被整合时,RDF 中很容易组合数据,并且很容易比较实验结果。我们得出结论,语义数据集成提供了一种方便、简单和灵活的解决方案,可用于整合已发表和未发表的实验数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5bd/2755824/a157d0eb1978/12859_2009_Article_3371_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验