符合 FAIR 原则的作物表型数据描述和标准化控制词汇比较面临的挑战。

Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies.

机构信息

Southern Cross Plant Science, Southern Cross University, PO Box 157, Lismore, NSW 2480, Australia.

School of Biosciences, University of Nottingham, Sutton Bonington, Leicestershire, LE12 5RD,Nottingham, Nottingham, UK.

出版信息

Database (Oxford). 2021 May 15;2021. doi: 10.1093/database/baab028.

DOI:10.1093/database/baab028

PMID:33991093

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8122365/

Abstract

Crop phenotypic data underpin many pre-breeding efforts to characterize variation within germplasm collections. Although there has been an increase in the global capacity for accumulating and comparing such data, a lack of consistency in the systematic description of metadata often limits integration and sharing. We therefore aimed to understand some of the challenges facing findable, accesible, interoperable and reusable (FAIR) curation and annotation of phenotypic data from minor and underutilized crops. We used bambara groundnut (Vigna subterranea) as an exemplar underutilized crop to assess the ability of the Crop Ontology system to facilitate curation of trait datasets, so that they are accessible for comparative analysis. This involved generating a controlled vocabulary Trait Dictionary of 134 terms. Systematic quantification of syntactic and semantic cohesiveness of the full set of 28 crop-specific COs identified inconsistencies between trait descriptor names, a relative lack of cross-referencing to other ontologies and a flat ontological structure for classifying traits. We also evaluated the Minimal Information About a Phenotyping Experiment and FAIR compliance of bambara trait datasets curated within the CropStoreDB schema. We discuss specifications for a more systematic and generic approach to trait controlled vocabularies, which would benefit from representation of terms that adhere to Open Biological and Biomedical Ontologies principles. In particular, we focus on the benefits of reuse of existing definitions within pre- and post-composed axioms from other domains in order to facilitate the curation and comparison of datasets from a wider range of crops. Database URL: https://www.cropstoredb.org/cs_bambara.html.

摘要

作物表型数据是许多前培育工作的基础，这些工作旨在描述种质资源收集内的变异。尽管全球在积累和比较此类数据的能力方面有所提高，但元数据的系统描述缺乏一致性往往限制了集成和共享。因此，我们旨在了解一些在可发现性、可访问性、互操作性和可重用性（FAIR）方面面临的挑战，这些挑战涉及从小作物和低利用率作物中进行表型数据的编目和注释。我们使用斑鸠豌豆（Vigna subterranea）作为一个低利用率作物的范例，评估作物本体系统促进特征数据集编目的能力，以便它们可以进行比较分析。这涉及生成一个包含 134 个术语的受控词汇特征词典。对 28 个特定于作物的 CO 全集的句法和语义内聚性进行系统量化，发现特征描述符名称之间存在不一致，与其他本体的交叉引用相对较少，以及用于分类特征的扁平本体结构。我们还评估了在 CropStoreDB 架构中编目斑鸠豌豆特征数据集的最小表型实验信息和 FAIR 合规性。我们讨论了更系统和通用的特征受控词汇方法的规范，这将受益于遵守开放生物和生物医学本体原则的术语表示。特别是，我们关注在预组合和后组合公理中重用来自其他领域的现有定义的好处，以促进更广泛的作物数据集的编目和比较。数据库 URL：https://www.cropstoredb.org/cs_bambara.html。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db65/8122365/81bad314e1f5/baab028f1.jpg

相似文献

Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies.符合 FAIR 原则的作物表型数据描述和标准化控制词汇比较面临的挑战。

Database (Oxford). 2021 May 15;2021. doi: 10.1093/database/baab028.

Features of a FAIR vocabulary.FAIR 词汇表的特点。

J Biomed Semantics. 2023 Jun 1;14(1):6. doi: 10.1186/s13326-023-00286-8.

Data sharing and ontology use among agricultural genetics, genomics, and breeding databases and resources of the Agbiodata Consortium.Agbiodata 联盟的农业遗传学、基因组学和育种数据库和资源的数据共享和本体使用。

Database (Oxford). 2023 Nov 15;2023. doi: 10.1093/database/baad076.

Bridging the phenotypic and genetic data useful for integrated breeding through a data annotation using the Crop Ontology developed by the crop communities of practice.通过使用由作物实践社区开发的作物本体进行数据注释，将有助于综合育种的表型数据和遗传数据联系起来。

Front Physiol. 2012 Aug 25;3:326. doi: 10.3389/fphys.2012.00326. eCollection 2012.

The BMS-LM ontology for biomedical data reporting throughout the lifecycle of a research study: From data model to ontology.生物医学数据报告的 BMS-LM 本体贯穿研究过程的整个生命周期：从数据模型到本体。

J Biomed Inform. 2022 Mar;127:104007. doi: 10.1016/j.jbi.2022.104007. Epub 2022 Feb 4.

FAIR-compliant clinical, radiomics and DICOM metadata of RIDER, interobserver, Lung1 and head-Neck1 TCIA collections.符合 FAIR 原则的 RIDER、观察者间一致性、Lung1 和 head-Neck1 TCIA 数据集的临床、影像组学和 DICOM 元数据。

Med Phys. 2020 Nov;47(11):5931-5940. doi: 10.1002/mp.14322. Epub 2020 Jun 27.

Multifunctional crop trait ontology for breeders' data: field book, annotation, data discovery and semantic enrichment of the literature.多功能作物性状本体论，用于育种者的数据：田野手册、注释、文献数据发现和语义丰富。

AoB Plants. 2010;2010:plq008. doi: 10.1093/aobpla/plq008. Epub 2010 May 27.

Integrating genetic maps in bambara groundnut [Vigna subterranea (L) Verdc.] and their syntenic relationships among closely related legumes.整合 bambara 花生[Vigna subterranea (L) Verdc.]的遗传图谱及其与近缘豆科植物之间的共线性关系。

BMC Genomics. 2017 Feb 20;18(1):192. doi: 10.1186/s12864-016-3393-8.

The eXtensible ontology development (XOD) principles and tool implementation to support ontology interoperability.支持本体互操作性的可扩展本体开发（XOD）原则与工具实现。

J Biomed Semantics. 2018 Jan 12;9(1):3. doi: 10.1186/s13326-017-0169-2.

The Xenopus phenotype ontology: bridging model organism phenotype data to human health and development.非洲爪蟾表型本体：连接模型生物表型数据与人类健康和发育。

BMC Bioinformatics. 2022 Mar 22;23(1):99. doi: 10.1186/s12859-022-04636-8.

引用本文的文献

Building a community-driven bioinformatics platform to facilitate multi-omics research.构建一个由社区驱动的生物信息学平台，以促进多组学研究。

GigaByte. 2024 Oct 18;2024:gigabyte137. doi: 10.46471/gigabyte.137. eCollection 2024.

Application of crop wild relatives in modern breeding: An overview of resources, experimental and computational methodologies.作物野生近缘种在现代育种中的应用：资源、实验及计算方法概述

Front Plant Sci. 2022 Nov 17;13:1008904. doi: 10.3389/fpls.2022.1008904. eCollection 2022.

本文引用的文献

Towards semantic interoperability: finding and repairing hidden contradictions in biomedical ontologies.迈向语义互操作性：在生物医学本体中发现和修复隐藏的矛盾。

BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 10):311. doi: 10.1186/s12911-020-01336-2.

Plant Phenotyping: Past, Present, and Future.植物表型分析：过去、现在与未来。

Plant Phenomics. 2019 Mar 26;2019:7507131. doi: 10.34133/2019/7507131. eCollection 2019.

The Ontologies Community of Practice: A CGIAR Initiative for Big Data in Agrifood Systems.实践本体论社区：国际农业研究磋商组织在农业食品系统大数据方面的一项倡议。

Patterns (N Y). 2020 Sep 25;1(7):100105. doi: 10.1016/j.patter.2020.100105. eCollection 2020 Oct 9.

An open-source GIS-enabled lookup service for Nagoya Protocol party information.名古屋议定书缔约方信息的开源 GIS 支持查找服务。

Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa014.

Maximising recombination across macadamia populations to generate linkage maps for genome anchoring.最大限度地提高澳洲坚果群体间的重组率，以生成基因组锚定的连锁图谱。

Sci Rep. 2020 Mar 19;10(1):5048. doi: 10.1038/s41598-020-61708-6.

Enabling reusability of plant phenomic datasets with MIAPPE 1.1.利用MIAPPE 1.1实现植物表型组学数据集的可重复使用性。

New Phytol. 2020 Jul;227(1):260-273. doi: 10.1111/nph.16544. Epub 2020 Apr 25.

Bridging the food security gap: an information-led approach to connect dietary nutrition, food composition and crop production.弥合粮食安全缺口：以信息为导向的方法连接膳食营养、食物成分和作物生产。

J Sci Food Agric. 2020 Mar 15;100(4):1495-1504. doi: 10.1002/jsfa.10157. Epub 2019 Dec 31.

FoodOn: a harmonized food ontology to increase global food traceability, quality control and data integration.FoodOn：一个用于提高全球食品可追溯性、质量控制和数据整合的统一食品本体。

NPJ Sci Food. 2018 Dec 18;2:23. doi: 10.1038/s41538-018-0032-6. eCollection 2018.

Cyberinfrastructure to Improve Forest Health and Productivity: The Role of Tree Databases in Connecting Genomes, Phenomes, and the Environment.改善森林健康与生产力的网络基础设施：树木数据库在连接基因组、表型组与环境中的作用。

Front Plant Sci. 2019 Jun 25;10:813. doi: 10.3389/fpls.2019.00813. eCollection 2019.

BrAPI-an application programming interface for plant breeding applications.BrAPI-用于植物育种应用的应用程序编程接口。

Bioinformatics. 2019 Oct 15;35(20):4147-4155. doi: 10.1093/bioinformatics/btz190.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

符合 FAIR 原则的作物表型数据描述和标准化控制词汇比较面临的挑战。

Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献