• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物样本数据库:FAIRer 样本元数据加速研究数据管理。

BioSamples database: FAIRer samples metadata to accelerate research data management.

机构信息

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.

出版信息

Nucleic Acids Res. 2022 Jan 7;50(D1):D1500-D1507. doi: 10.1093/nar/gkab1046.

DOI:10.1093/nar/gkab1046
PMID:34747489
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8728232/
Abstract

The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number of communities, resulting in a higher number of submissions and more heterogeneous data. The BioSamples database now has a valuable set of features and processes to improve data quality in BioSamples, and in particular enriching metadata content and following FAIR principles. In this manuscript, we describe how BioSamples in 2021 handles requirements from our community of users through exemplar use cases: increased findability of samples and improved data management practices support the goals of the ReSOLUTE project, how the plant community benefits from being able to link genotypic to phenotypic information, and we highlight how cumulatively those improvements contribute to more complex multi-omics data integration supporting COVID-19 research. Finally, we present underlying technical features used as pillars throughout those use cases and how they are reused for expanded engagement with communities such as FAIRplus and the Global Alliance for Genomics and Health. Availability: The BioSamples database is freely available at http://www.ebi.ac.uk/biosamples. Content is distributed under the EMBL-EBI Terms of Use available at https://www.ebi.ac.uk/about/terms-of-use. The BioSamples code is available at https://github.com/EBIBioSamples/biosamples-v4 and distributed under the Apache 2.0 license.

摘要

EMBL-EBI 的 BioSamples 数据库是样本元数据存储的中央机构存储库,可连接到 EMBL-EBI 档案和其他资源。我们在上一次更新中描述的基础设施技术改进使我们能够扩展并适应越来越多的社区,从而导致提交的数量增加,数据也更加多样化。BioSamples 数据库现在具有一系列有价值的功能和流程,可以提高 BioSamples 中的数据质量,特别是丰富元数据内容并遵循 FAIR 原则。在本文中,我们通过示例用例描述了 2021 年的 BioSamples 如何满足我们用户社区的需求:提高样本的可发现性和改进数据管理实践支持 ReSOLUTE 项目的目标,植物社区如何受益于能够将基因型与表型信息联系起来,我们还强调了这些改进如何有助于更复杂的多组学数据集成,从而支持 COVID-19 研究。最后,我们介绍了贯穿这些用例的基础技术功能,以及它们如何被重新用于与 FAIRplus 和全球基因组与健康联盟等社区的扩展参与。

可用性

BioSamples 数据库可在 http://www.ebi.ac.uk/biosamples 上免费获得。内容根据可在 https://www.ebi.ac.uk/about/terms-of-use 上获得的 EMBL-EBI 使用条款进行分发。BioSamples 代码可在 https://github.com/EBIBioSamples/biosamples-v4 上获得,并根据 Apache 2.0 许可证进行分发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b58/8728232/80cd70ecc7dc/gkab1046fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b58/8728232/b09d65f579c9/gkab1046fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b58/8728232/80cd70ecc7dc/gkab1046fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b58/8728232/b09d65f579c9/gkab1046fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5b58/8728232/80cd70ecc7dc/gkab1046fig2.jpg

相似文献

1
BioSamples database: FAIRer samples metadata to accelerate research data management.生物样本数据库:FAIRer 样本元数据加速研究数据管理。
Nucleic Acids Res. 2022 Jan 7;50(D1):D1500-D1507. doi: 10.1093/nar/gkab1046.
2
BioSamples database: an updated sample metadata hub.BioSamples 数据库:更新的样本元数据中心。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1172-D1178. doi: 10.1093/nar/gky1061.
3
MetaboLights: open data repository for metabolomics.MetaboLights:代谢组学开放数据知识库。
Nucleic Acids Res. 2024 Jan 5;52(D1):D640-D646. doi: 10.1093/nar/gkad1045.
4
The European Bioinformatics Institute: empowering cooperation in response to a global health crisis.欧洲生物信息学研究所:应对全球健康危机,助力合作。
Nucleic Acids Res. 2021 Jan 8;49(D1):D29-D37. doi: 10.1093/nar/gkaa1077.
5
MetaboLights: a resource evolving in response to the needs of its scientific community.代谢组学文献共享资源库(MetaboLights):一个响应其科研群体需求而不断发展的资源库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D440-D444. doi: 10.1093/nar/gkz1019.
6
ELIXIR biovalidator for semantic validation of life science metadata.ELIXIR 生物验证器,用于生命科学元数据的语义验证。
Bioinformatics. 2022 May 26;38(11):3141-3142. doi: 10.1093/bioinformatics/btac195.
7
The European Bioinformatics Institute (EMBL-EBI) in 2021.2021 年的欧洲生物信息学研究所(EMBL-EBI)。
Nucleic Acids Res. 2022 Jan 7;50(D1):D11-D19. doi: 10.1093/nar/gkab1127.
8
The sample locator: A federated search tool for biosamples and associated data in Europe using HL7 FHIR.样本定位器:一种使用 HL7 FHIR 的欧洲生物样本和相关数据的联合搜索工具。
Comput Biol Med. 2024 Sep;180:108941. doi: 10.1016/j.compbiomed.2024.108941. Epub 2024 Aug 5.
9
From ArrayExpress to BioStudies.从 ArrayExpress 到 BioStudies。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1502-D1506. doi: 10.1093/nar/gkaa1062.
10
The BioSample Database (BioSD) at the European Bioinformatics Institute.欧洲生物信息学研究所的生物样本数据库 (BioSD)。
Nucleic Acids Res. 2012 Jan;40(Database issue):D64-70. doi: 10.1093/nar/gkr937. Epub 2011 Nov 16.

引用本文的文献

1
First release of the European marine omics biodiversity observation network (EMO BON) shotgun metagenomics data from water and sediment samples.欧洲海洋组学生物多样性观测网络(EMO BON)首次发布来自水和沉积物样本的鸟枪法宏基因组学数据。
Biodivers Data J. 2025 Mar 12;13:e143585. doi: 10.3897/BDJ.13.e143585. eCollection 2025.
2
A Call for Action: Lessons Learned From a Pilot to Share a Complex, Linked COVID-19 Cohort Dataset for Open Science.行动呼吁:从一个试点项目中汲取的经验教训,该项目旨在为开放科学共享一个复杂的、相互关联的COVID-19队列数据集。
JMIR Public Health Surveill. 2025 Feb 11;11:e63996. doi: 10.2196/63996.
3
HoloFood Data Portal: holo-omic datasets for analysing host-microbiota interactions in animal production.

本文引用的文献

1
The European Variation Archive: a FAIR resource of genomic variation for all species.欧洲变异档案库:一个面向所有物种的基因组变异的 FAIR 资源。
Nucleic Acids Res. 2022 Jan 7;50(D1):D1216-D1220. doi: 10.1093/nar/gkab960.
2
ISA API: An open platform for interoperable life science experimental metadata.ISA API:一个用于可互操作的生命科学实验元数据的开放平台。
Gigascience. 2021 Sep 16;10(9). doi: 10.1093/gigascience/giab060.
3
Mapping the human genetic architecture of COVID-19.绘制人类 COVID-19 遗传结构图谱。
全食物数据门户:用于分析动物生产中宿主-微生物群相互作用的全组学数据集。
Database (Oxford). 2025 Jan 11;2025. doi: 10.1093/database/baae112.
4
Biobank Digitalization: From Data Acquisition to Efficient Use.生物样本库数字化:从数据采集到高效利用
Biology (Basel). 2024 Nov 22;13(12):957. doi: 10.3390/biology13120957.
5
The PRIDE database at 20 years: 2025 update.20年的PRIDE数据库:2025年更新
Nucleic Acids Res. 2025 Jan 6;53(D1):D543-D553. doi: 10.1093/nar/gkae1011.
6
COPO - Managing sample metadata for biodiversity: considerations from the Darwin Tree of Life project.COPO - 管理生物多样性的样本元数据:来自达尔文生命之树项目的考量
Wellcome Open Res. 2024 Jun 10;7:279. doi: 10.12688/wellcomeopenres.18499.2. eCollection 2022.
7
Multiple graphical views for automatically generating SQL for the MycoDiversity DB; making fungal biodiversity studies accessible.用于为真菌多样性数据库自动生成SQL的多个图形视图;使真菌生物多样性研究变得容易进行。
Biodivers Data J. 2024 Jun 18;12:e119660. doi: 10.3897/BDJ.12.e119660. eCollection 2024.
8
A Standardized Nomenclature Design for Systematic Referencing and Identification of Animal Cellular Material.用于动物细胞材料系统引用和识别的标准化命名设计。
Animals (Basel). 2024 May 23;14(11):1541. doi: 10.3390/ani14111541.
9
Creation of Standardized Common Data Elements for Diagnostic Tests in Infectious Disease Studies: Semantic and Syntactic Mapping.创建传染病研究中诊断测试的标准化通用数据元素:语义和句法映射。
J Med Internet Res. 2024 Jun 10;26:e50049. doi: 10.2196/50049.
10
Global soil metagenomics reveals distribution and predominance of Deltaproteobacteria in nitrogen-fixing microbiome.全球土壤宏基因组学揭示固氮微生物组中δ变形菌的分布和优势地位。
Microbiome. 2024 May 24;12(1):95. doi: 10.1186/s40168-024-01812-1.
Nature. 2021 Dec;600(7889):472-477. doi: 10.1038/s41586-021-03767-x. Epub 2021 Jul 8.
4
The COVID-19 Data Portal: accelerating SARS-CoV-2 and COVID-19 research through rapid open access data sharing.COVID-19 数据门户:通过快速开放获取数据共享加速 SARS-CoV-2 和 COVID-19 研究。
Nucleic Acids Res. 2021 Jul 2;49(W1):W619-W623. doi: 10.1093/nar/gkab417.
5
Applying FAIR Principles to Plant Phenotypic Data Management in GnpIS.将公平原则应用于全球植物表型信息系统中的植物表型数据管理。
Plant Phenomics. 2019 Apr 30;2019:1671403. doi: 10.34133/2019/1671403. eCollection 2019.
6
The European Nucleotide Archive in 2020.2020 年的欧洲核苷酸档案库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D82-D85. doi: 10.1093/nar/gkaa1028.
7
The international nucleotide sequence database collaboration.国际核苷酸序列数据库合作组织。
Nucleic Acids Res. 2021 Jan 8;49(D1):D121-D124. doi: 10.1093/nar/gkaa967.
8
NCBI Taxonomy: a comprehensive update on curation, resources and tools.NCBI 分类学:在管理、资源和工具方面的全面更新。
Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa062.
9
The challenges in data integration - heterogeneity and complexity in clinical trials and patient registries of Systemic Lupus Erythematosus.数据集成面临的挑战 - 系统性红斑狼疮临床试验和患者登记处的异质性和复杂性。
BMC Med Res Methodol. 2020 Jun 24;20(1):164. doi: 10.1186/s12874-020-01057-0.
10
Tara Oceans: towards global ocean ecosystems biology.塔拉海洋:走向全球海洋生态系统生物学。
Nat Rev Microbiol. 2020 Aug;18(8):428-445. doi: 10.1038/s41579-020-0364-5. Epub 2020 May 12.