• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

发布符合FAIR原则的数据:一种利用PHI数据库的范例方法

Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base.

作者信息

Rodríguez-Iglesias Alejandro, Rodríguez-González Alejandro, Irvine Alistair G, Sesma Ane, Urban Martin, Hammond-Kosack Kim E, Wilkinson Mark D

机构信息

Center for Plant Biotechnology and Genomics, Universidad Politécnica de Madrid Madrid, Spain.

ETS de Ingenieros Informáticos, Universidad Politécnica de Madrid Madrid, Spain.

出版信息

Front Plant Sci. 2016 May 12;7:641. doi: 10.3389/fpls.2016.00641. eCollection 2016.

DOI:10.3389/fpls.2016.00641
PMID:27433158
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4922217/
Abstract

Pathogen-Host interaction data is core to our understanding of disease processes and their molecular/genetic bases. Facile access to such core data is particularly important for the plant sciences, where individual genetic and phenotypic observations have the added complexity of being dispersed over a wide diversity of plant species vs. the relatively fewer host species of interest to biomedical researchers. Recently, an international initiative interested in scholarly data publishing proposed that all scientific data should be "FAIR"-Findable, Accessible, Interoperable, and Reusable. In this work, we describe the process of migrating a database of notable relevance to the plant sciences-the Pathogen-Host Interaction Database (PHI-base)-to a form that conforms to each of the FAIR Principles. We discuss the technical and architectural decisions, and the migration pathway, including observations of the difficulty and/or fidelity of each step. We examine how multiple FAIR principles can be addressed simultaneously through careful design decisions, including making data FAIR for both humans and machines with minimal duplication of effort. We note how FAIR data publishing involves more than data reformatting, requiring features beyond those exhibited by most life science Semantic Web or Linked Data resources. We explore the value-added by completing this FAIR data transformation, and then test the result through integrative questions that could not easily be asked over traditional Web-based data resources. Finally, we demonstrate the utility of providing explicit and reliable access to provenance information, which we argue enhances citation rates by encouraging and facilitating transparent scholarly reuse of these valuable data holdings.

摘要

病原体-宿主相互作用数据是我们理解疾病过程及其分子/遗传基础的核心。轻松获取此类核心数据对植物科学尤为重要,因为与生物医学研究人员感兴趣的宿主物种相对较少相比,个体遗传和表型观察在众多植物物种中更为分散。最近,一个关注学术数据发布的国际倡议提出,所有科学数据都应“FAIR”——即可查找、可访问、可互操作和可重用。在这项工作中,我们描述了将一个与植物科学显著相关的数据库——病原体-宿主相互作用数据库(PHI-base)迁移到符合每个FAIR原则的形式的过程。我们讨论了技术和架构决策以及迁移路径,包括对每个步骤的难度和/或保真度的观察。我们研究了如何通过精心的设计决策同时满足多个FAIR原则,包括以最小的工作量使数据对人类和机器都具有FAIR性。我们注意到FAIR数据发布不仅仅涉及数据重新格式化,还需要超越大多数生命科学语义网或关联数据资源所具备的功能。我们探讨了完成这种FAIR数据转换所带来的附加值,然后通过基于传统网络数据资源难以提出的综合问题来测试结果。最后,我们展示了提供明确且可靠的出处信息的效用,我们认为这通过鼓励和促进对这些宝贵数据资产的透明学术重用提高了引用率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ab0/4922217/2b1048348bba/fpls-07-00641-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ab0/4922217/2b1048348bba/fpls-07-00641-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ab0/4922217/2b1048348bba/fpls-07-00641-g0001.jpg

相似文献

1
Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base.发布符合FAIR原则的数据:一种利用PHI数据库的范例方法
Front Plant Sci. 2016 May 12;7:641. doi: 10.3389/fpls.2016.00641. eCollection 2016.
2
A framework for community curation of interspecies interactions literature.物种间相互作用文献的社区策展框架。
Elife. 2023 Jul 4;12:e84658. doi: 10.7554/eLife.84658.
3
A Data Transformation Methodology to Create Findable, Accessible, Interoperable, and Reusable Health Data: Software Design, Development, and Evaluation Study.一种创建可发现、可访问、可互操作和可重用健康数据的数据转换方法:软件设计、开发和评估研究。
J Med Internet Res. 2023 Mar 8;25:e42822. doi: 10.2196/42822.
4
Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review.健康数据治理中可发现性、可访问性、互操作性和可重用性数据原则的举措、概念和实施实践:范围综述。
J Med Internet Res. 2023 Aug 28;25:e45013. doi: 10.2196/45013.
5
: Advancing Information Search, Sharing and Reuse on Pharmacovigilance Signals via FAIR Principles and Semantic Web Technologies.通过FAIR原则和语义网技术推进药物警戒信号的信息搜索、共享和重用。
Front Pharmacol. 2018 Jun 26;9:609. doi: 10.3389/fphar.2018.00609. eCollection 2018.
6
Initiatives, Concepts, and Implementation Practices of FAIR (Findable, Accessible, Interoperable, and Reusable) Data Principles in Health Data Stewardship Practice: Protocol for a Scoping Review.健康数据管理实践中FAIR(可查找、可访问、可互操作和可重用)数据原则的倡议、概念及实施实践:一项范围综述方案
JMIR Res Protoc. 2021 Feb 2;10(2):e22505. doi: 10.2196/22505.
7
SOCCOMAS: a FAIR web content management system that uses knowledge graphs and that is based on semantic programming.SOCCOMAS:一个使用知识图谱且基于语义编程的 FAIR 网络内容管理系统。
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz067.
8
FAIR data station for lightweight metadata management and validation of omics studies.用于轻量级元数据管理和验证组学研究的 FAIR 数据站。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad014. Epub 2023 Mar 6.
9
Applying the FAIR principles to data in a hospital: challenges and opportunities in a pandemic.在医院的数据中应用 FAIR 原则:大流行中的挑战和机遇。
J Biomed Semantics. 2022 Apr 25;13(1):12. doi: 10.1186/s13326-022-00263-7.
10
Making Metadata Machine-Readable as the First Step to Providing Findable, Accessible, Interoperable, and Reusable Population Health Data: Framework Development and Implementation Study.将元数据转化为机器可读形式作为提供可查找、可访问、可互操作和可重用的人群健康数据的第一步:框架开发与实施研究
Online J Public Health Inform. 2024 Aug 1;16:e56237. doi: 10.2196/56237.

引用本文的文献

1
A framework for community curation of interspecies interactions literature.物种间相互作用文献的社区策展框架。
Elife. 2023 Jul 4;12:e84658. doi: 10.7554/eLife.84658.
2
A new framework for host-pathogen interaction research.宿主-病原体相互作用研究的新框架。
Front Immunol. 2022 Dec 15;13:1066733. doi: 10.3389/fimmu.2022.1066733. eCollection 2022.
3
medna-metadata: an open-source data management system for tracking environmental DNA samples and metadata.medna-metadata:一个用于跟踪环境 DNA 样本和元数据的开源数据管理系统。

本文引用的文献

1
The health care and life sciences community profile for dataset descriptions.数据集描述的医疗保健和生命科学领域概况。
PeerJ. 2016 Aug 16;4:e2331. doi: 10.7717/peerj.2331. eCollection 2016.
2
The FAIR Guiding Principles for scientific data management and stewardship.科学数据管理和保存的 FAIR 指导原则。
Sci Data. 2016 Mar 15;3:160018. doi: 10.1038/sdata.2016.18.
3
Public Data Archiving in Ecology and Evolution: How Well Are We Doing?生态学与进化领域的公共数据存档:我们做得如何?
Bioinformatics. 2022 Sep 30;38(19):4589-4597. doi: 10.1093/bioinformatics/btac556.
4
Systematic review of the status of veterinary epidemiological research in two species regarding the FAIR guiding principles.关于 FAIR 指导原则,对两种物种兽医流行病学研究现状的系统评价。
BMC Vet Res. 2021 Aug 11;17(1):270. doi: 10.1186/s12917-021-02971-1.
5
Challenges for FAIR-compliant description and comparison of crop phenotype data with standardized controlled vocabularies.符合 FAIR 原则的作物表型数据描述和标准化控制词汇比较面临的挑战。
Database (Oxford). 2021 May 15;2021. doi: 10.1093/database/baab028.
6
SCALEUS-FD: A FAIR Data Tool for Biomedical Applications.SCALEUS-FD:用于生物医学应用的公平数据工具。
Biomed Res Int. 2020 Aug 26;2020:3041498. doi: 10.1155/2020/3041498. eCollection 2020.
7
BioHackathon 2015: Semantics of data for life sciences and reproducible research.2015 年生物黑客马拉松:生命科学和可重复研究的数据语义学。
F1000Res. 2020 Feb 24;9:136. doi: 10.12688/f1000research.18236.1. eCollection 2020.
8
It's Hard to Avoid Avoidance: Uncoupling the Evolutionary Connection between Plant Growth, Productivity and Stress "Tolerance".难以回避的问题:植物生长、生产力和压力“耐受”之间的进化联系被割裂。
Int J Mol Sci. 2018 Nov 20;19(11):3671. doi: 10.3390/ijms19113671.
9
Establishing a distributed national research infrastructure providing bioinformatics support to life science researchers in Australia.建立一个分布式的国家研究基础设施,为澳大利亚的生命科学研究人员提供生物信息学支持。
Brief Bioinform. 2019 Mar 22;20(2):384-389. doi: 10.1093/bib/bbx071.
10
PHI-base: a new interface and further additions for the multi-species pathogen-host interactions database.PHI 数据库:多物种病原体 - 宿主相互作用数据库的新界面及更多补充内容
Nucleic Acids Res. 2017 Jan 4;45(D1):D604-D610. doi: 10.1093/nar/gkw1089. Epub 2016 Dec 3.
PLoS Biol. 2015 Nov 10;13(11):e1002295. doi: 10.1371/journal.pbio.1002295. eCollection 2015.
4
The 2015 Nucleic Acids Research Database Issue and molecular biology database collection.《核酸研究》2015年数据库专刊及分子生物学数据库合集。
Nucleic Acids Res. 2015 Jan;43(Database issue):D1-5. doi: 10.1093/nar/gku1241.
5
The Pathogen-Host Interactions database (PHI-base): additions and future developments.病原体-宿主相互作用数据库(PHI-base):新增内容及未来发展
Nucleic Acids Res. 2015 Jan;43(Database issue):D645-55. doi: 10.1093/nar/gku1165. Epub 2014 Nov 20.
6
Araport: the Arabidopsis information portal.Araport:拟南芥信息门户。
Nucleic Acids Res. 2015 Jan;43(Database issue):D1003-9. doi: 10.1093/nar/gku1200. Epub 2014 Nov 20.
7
BioBenchmark Toyama 2012: an evaluation of the performance of triple stores on biological data.2012年富山生物基准测试:三元组存储在生物数据上的性能评估
J Biomed Semantics. 2014 Jul 10;5:32. doi: 10.1186/2041-1480-5-32. eCollection 2014.
8
The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery.用于生物医学研究和知识发现的语义科学集成本体(SIO)。
J Biomed Semantics. 2014 Mar 6;5(1):14. doi: 10.1186/2041-1480-5-14.
9
EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats.EDAM:生物信息学操作、数据和标识符类型、主题和格式的本体论。
Bioinformatics. 2013 May 15;29(10):1325-32. doi: 10.1093/bioinformatics/btt113. Epub 2013 Mar 11.
10
RNAcentral: A vision for an international database of RNA sequences.RNAcentral:建立国际 RNA 序列数据库的愿景。
RNA. 2011 Nov;17(11):1941-6. doi: 10.1261/rna.2750811. Epub 2011 Sep 22.