基于语义的EMBOSS服务组合。

Semantics-based composition of EMBOSS services.

作者信息

Lamprecht Anna-Lena, Naujokat Stefan, Margaria Tiziana, Steffen Bernhard

机构信息

Chair for Programming Systems, Technical University Dortmund, Dortmund, D-44227, Germany.

出版信息

J Biomed Semantics. 2011 Mar 7;2 Suppl 1(Suppl 1):S5. doi: 10.1186/2041-1480-2-S1-S5.

DOI:10.1186/2041-1480-2-S1-S5

PMID:21388574

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3105497/

Abstract

BACKGROUND

More than in other domains the heterogeneous services world in bioinformatics demands for a methodology to classify and relate resources in a both human and machine accessible manner. The Semantic Web, which is meant to address exactly this challenge, is currently one of the most ambitious projects in computer science. Collective efforts within the community have already led to a basis of standards for semantic service descriptions and meta-information. In combination with process synthesis and planning methods, such knowledge about types and services can facilitate the automatic composition of workflows for particular research questions.

RESULTS

In this study we apply the synthesis methodology that is available in the Bio-jETI workflow management framework for the semantics-based composition of EMBOSS services. EMBOSS (European Molecular Biology Open Software Suite) is a collection of 350 tools (March 2010) for various sequence analysis tasks, and thus a rich source of services and types that imply comprehensive domain models for planning and synthesis approaches. We use and compare two different setups of our EMBOSS synthesis domain: 1) a manually defined domain setup where an intuitive, high-level, semantically meaningful nomenclature is applied to describe the input/output behavior of the single EMBOSS tools and their classifications, and 2) a domain setup where this information has been automatically derived from the EMBOSS Ajax Command Definition (ACD) files and the EMBRACE Data and Methods ontology (EDAM). Our experiments demonstrate that these domain models in combination with our synthesis methodology greatly simplify working with the large, heterogeneous, and hence manually intractable EMBOSS collection. However, they also show that with the information that can be derived from the (current) ACD files and EDAM ontology alone, some essential connections between services can not be recognized.

CONCLUSIONS

Our results show that adequate domain modeling requires to incorporate as much domain knowledge as possible, far beyond the mere technical aspects of the different types and services. Finding or defining semantically appropriate service and type descriptions is a difficult task, but the bioinformatics community appears to be on the right track towards a Life Science Semantic Web, which will eventually allow automatic service composition methods to unfold their full potential.

摘要

背景

与其他领域相比，生物信息学中异构服务的世界更需要一种方法，以便以人类和机器都能访问的方式对资源进行分类和关联。语义网旨在应对这一挑战，目前是计算机科学中最具雄心的项目之一。社区内的集体努力已经形成了语义服务描述和元信息的标准基础。结合流程合成和规划方法，这种关于类型和服务的知识可以促进针对特定研究问题的工作流自动组合。

结果

在本研究中，我们应用了Bio-jETI工作流管理框架中可用的合成方法，用于基于语义的EMBOSS服务组合。EMBOSS（欧洲分子生物学开放软件套件）是一个包含350个工具（截至2010年3月）的集合，用于各种序列分析任务，因此是丰富的服务和类型来源，为规划和合成方法暗示了全面的领域模型。我们使用并比较了EMBOSS合成领域的两种不同设置：1）手动定义的领域设置，其中应用直观、高级、语义有意义的术语来描述单个EMBOSS工具的输入/输出行为及其分类；2）领域设置，其中此信息已从EMBOSS Ajax命令定义（ACD）文件和EMBRACE数据与方法本体（EDAM）自动派生。我们的实验表明，这些领域模型与我们的合成方法相结合，极大地简化了处理庞大、异构且因此难以手动处理的EMBOSS集合的工作。然而，它们也表明，仅从（当前的）ACD文件和EDAM本体中获取的信息，无法识别服务之间的一些基本联系。

结论

我们的结果表明，适当的领域建模需要纳入尽可能多的领域知识，远远超出不同类型和服务的纯粹技术方面。找到或定义语义上合适的服务和类型描述是一项艰巨的任务，但生物信息学社区似乎正朝着生命科学语义网的正确方向前进，这最终将使自动服务组合方法充分发挥其潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bef5/3105497/ad2fa507f7dc/2041-1480-2-S1-S5-1.jpg

相似文献

Semantics-based composition of EMBOSS services.

J Biomed Semantics. 2011 Mar 7;2 Suppl 1(Suppl 1):S5. doi: 10.1186/2041-1480-2-S1-S5.

Bio-jETI: a framework for semantics-based service composition.

BMC Bioinformatics. 2009 Oct 1;10 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-10-S10-S8.

KBWS: an EMBOSS associated package for accessing bioinformatics web services.

Source Code Biol Med. 2011 Apr 29;6:8. doi: 10.1186/1751-0473-6-8.

GEMBASSY: an EMBOSS associated software package for comprehensive genome analyses.

Source Code Biol Med. 2013 Aug 29;8(1):17. doi: 10.1186/1751-0473-8-17.

The design of Jemboss: a graphical user interface to EMBOSS.

Bioinformatics. 2003 Sep 22;19(14):1837-43. doi: 10.1093/bioinformatics/btg251.

EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats.

Bioinformatics. 2013 May 15;29(10):1325-32. doi: 10.1093/bioinformatics/btt113. Epub 2013 Mar 11.

SemanticSCo: A platform to support the semantic composition of services for gene expression analysis.

J Biomed Inform. 2017 Feb;66:116-128. doi: 10.1016/j.jbi.2016.12.014. Epub 2017 Jan 3.

GeneFisher-P: variations of GeneFisher as processes in Bio-jETI.

BMC Bioinformatics. 2008 Apr 25;9 Suppl 4(Suppl 4):S13. doi: 10.1186/1471-2105-9-S4-S13.

BioXSD: the common data-exchange format for everyday bioinformatics web services.

Bioinformatics. 2010 Sep 15;26(18):i540-6. doi: 10.1093/bioinformatics/btq391.

Automated workflow composition in mass spectrometry-based proteomics.

Bioinformatics. 2019 Feb 15;35(4):656-664. doi: 10.1093/bioinformatics/bty646.

引用本文的文献

Systematic optimization of antimicrobial peptides against .

JAC Antimicrob Resist. 2024 Jul 3;6(4):dlae096. doi: 10.1093/jacamr/dlae096. eCollection 2024 Aug.

Insights into the phylogenetic relationships and species boundaries of the complex (Tamaricaceae) based on the complete chloroplast genome.

PeerJ. 2023 Dec 11;11:e16642. doi: 10.7717/peerj.16642. eCollection 2023.

Designing a Multi-Epitope Antigen for Serodiagnosis of Based on L3Nie.01 and IgG Immunoreactive Epitopes.

Avicenna J Med Biotechnol. 2022 Apr-Jun;14(2):114-124. doi: 10.18502/ajmb.v14i2.8886.

A broad survey of DNA sequence data simulation tools.

Brief Funct Genomics. 2020 Jan 22;19(1):49-59. doi: 10.1093/bfgp/elz033.

Community curation of bioinformatics software and data resources.

Brief Bioinform. 2020 Sep 25;21(5):1697-1705. doi: 10.1093/bib/bbz075.

Automated workflow composition in mass spectrometry-based proteomics.

Bioinformatics. 2019 Feb 15;35(4):656-664. doi: 10.1093/bioinformatics/bty646.

ReGaTE: Registration of Galaxy Tools in Elixir.

Gigascience. 2017 Jun 1;6(6):1-4. doi: 10.1093/gigascience/gix022.

EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats.

Bioinformatics. 2013 May 15;29(10):1325-32. doi: 10.1093/bioinformatics/btt113. Epub 2013 Mar 11.

Bacteriophage cocktail significantly reduces Escherichia coli O157: H7 contamination of lettuce and beef, but does not protect against recontamination.

Bacteriophage. 2012 Jul 1;2(3):178-185. doi: 10.4161/bact.22825.

Accelerating cancer systems biology research through Semantic Web technology.

Wiley Interdiscip Rev Syst Biol Med. 2013 Mar-Apr;5(2):135-51. doi: 10.1002/wsbm.1200. Epub 2012 Nov 27.

本文引用的文献

Bio-jETI: a framework for semantics-based service composition.

BMC Bioinformatics. 2009 Oct 1;10 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-10-S10-S8.

Knowledge-driven enhancements for task composition in bioinformatics.

BMC Bioinformatics. 2009 Oct 1;10 Suppl 10(Suppl 10):S12. doi: 10.1186/1471-2105-10-S10-S12.

Web API for biology with a workflow navigation system.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W11-6. doi: 10.1093/nar/gkp300. Epub 2009 May 5.

GeneFisher-P: variations of GeneFisher as processes in Bio-jETI.

BMC Bioinformatics. 2008 Apr 25;9 Suppl 4(Suppl 4):S13. doi: 10.1186/1471-2105-9-S4-S13.

Bio-jETI: a service integration, design, and provisioning platform for orchestrated bioinformatics processes.

BMC Bioinformatics. 2008 Apr 25;9 Suppl 4(Suppl 4):S12. doi: 10.1186/1471-2105-9-S4-S12.

A Semantic Web for bioinformatics: goals, tools, systems, applications.

BMC Bioinformatics. 2008 Apr 25;9 Suppl 4(Suppl 4):S1. doi: 10.1186/1471-2105-9-S4-S1.

Semi-automatic web service composition for the life sciences using the BioMoby semantic web framework.

J Biomed Inform. 2008 Oct;41(5):837-47. doi: 10.1016/j.jbi.2008.02.005. Epub 2008 Mar 4.

The (my)Grid ontology: bioinformatics service discovery.

Int J Bioinform Res Appl. 2007;3(3):303-25. doi: 10.1504/IJBRA.2007.015005.

The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration.

Nat Biotechnol. 2007 Nov;25(11):1251-5. doi: 10.1038/nbt1346.

REMORA: a pilot in the ocean of BioMoby web-services.

Bioinformatics. 2006 Apr 1;22(7):900-1. doi: 10.1093/bioinformatics/btl001. Epub 2006 Jan 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于语义的EMBOSS服务组合。

Semantics-based composition of EMBOSS services.

作者信息

Lamprecht Anna-Lena, Naujokat Stefan, Margaria Tiziana, Steffen Bernhard

机构信息

Chair for Programming Systems, Technical University Dortmund, Dortmund, D-44227, Germany.