Suppr超能文献

生物样本数据库:FAIRer 样本元数据加速研究数据管理。

BioSamples database: FAIRer samples metadata to accelerate research data management.

机构信息

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.

出版信息

Nucleic Acids Res. 2022 Jan 7;50(D1):D1500-D1507. doi: 10.1093/nar/gkab1046.

Abstract

The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number of communities, resulting in a higher number of submissions and more heterogeneous data. The BioSamples database now has a valuable set of features and processes to improve data quality in BioSamples, and in particular enriching metadata content and following FAIR principles. In this manuscript, we describe how BioSamples in 2021 handles requirements from our community of users through exemplar use cases: increased findability of samples and improved data management practices support the goals of the ReSOLUTE project, how the plant community benefits from being able to link genotypic to phenotypic information, and we highlight how cumulatively those improvements contribute to more complex multi-omics data integration supporting COVID-19 research. Finally, we present underlying technical features used as pillars throughout those use cases and how they are reused for expanded engagement with communities such as FAIRplus and the Global Alliance for Genomics and Health. Availability: The BioSamples database is freely available at http://www.ebi.ac.uk/biosamples. Content is distributed under the EMBL-EBI Terms of Use available at https://www.ebi.ac.uk/about/terms-of-use. The BioSamples code is available at https://github.com/EBIBioSamples/biosamples-v4 and distributed under the Apache 2.0 license.

摘要

EMBL-EBI 的 BioSamples 数据库是样本元数据存储的中央机构存储库,可连接到 EMBL-EBI 档案和其他资源。我们在上一次更新中描述的基础设施技术改进使我们能够扩展并适应越来越多的社区,从而导致提交的数量增加,数据也更加多样化。BioSamples 数据库现在具有一系列有价值的功能和流程,可以提高 BioSamples 中的数据质量,特别是丰富元数据内容并遵循 FAIR 原则。在本文中,我们通过示例用例描述了 2021 年的 BioSamples 如何满足我们用户社区的需求:提高样本的可发现性和改进数据管理实践支持 ReSOLUTE 项目的目标,植物社区如何受益于能够将基因型与表型信息联系起来,我们还强调了这些改进如何有助于更复杂的多组学数据集成,从而支持 COVID-19 研究。最后,我们介绍了贯穿这些用例的基础技术功能,以及它们如何被重新用于与 FAIRplus 和全球基因组与健康联盟等社区的扩展参与。

可用性

BioSamples 数据库可在 http://www.ebi.ac.uk/biosamples 上免费获得。内容根据可在 https://www.ebi.ac.uk/about/terms-of-use 上获得的 EMBL-EBI 使用条款进行分发。BioSamples 代码可在 https://github.com/EBIBioSamples/biosamples-v4 上获得,并根据 Apache 2.0 许可证进行分发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b58/8728232/b09d65f579c9/gkab1046fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验