• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自定义管道在云端进行FAIR生物医学数据分析的基础。

Fundamentals of FAIR biomedical data analyses in the cloud using custom pipelines.

作者信息

Berke Seth R, Kanchan Kanika, Marazita Mary L, Tobin Eric, Ruczinski Ingo

机构信息

Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America.

Division of Allergy and Clinical Immunology, Johns Hopkins School of Medicine, Baltimore, Maryland, United States of America.

出版信息

PLoS Comput Biol. 2025 Jul 2;21(7):e1013215. doi: 10.1371/journal.pcbi.1013215. eCollection 2025 Jul.

DOI:10.1371/journal.pcbi.1013215
PMID:40601758
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12221167/
Abstract

As the biomedical data ecosystem increasingly embraces the findable, accessible, interoperable, and reusable (FAIR) data principles to publish multimodal datasets to the cloud, opportunities for cloud-based research continue to expand. Besides the potential for accelerated and diverse biomedical discovery that comes from a harmonized data ecosystem, the cloud also presents a shift away from the standard practice of duplicating data to computational clusters or local computers for analysis. However, despite these benefits, researcher migration to the cloud has lagged, in part due to insufficient educational resources to train biomedical scientists on cloud infrastructure. There exists a conceptual lack especially around the crafting of custom analytic pipelines that require software not pre-installed by cloud analysis platforms. We here present three fundamental concepts necessary for custom pipeline creation in the cloud. These overarching concepts are workflow and cloud provider agnostic, extending the utility of this education to serve as a foundation for any computational analysis running any dataset in any biomedical cloud platform. We illustrate these concepts using one of our own custom analyses, a study using the case-parent trio design to detect sex-specific genetic effects on orofacial cleft (OFC) risk, which we crafted in the biomedical cloud analysis platform CAVATICA.

摘要

随着生物医学数据生态系统越来越多地采用可查找、可访问、可互操作和可重用(FAIR)的数据原则,将多模态数据集发布到云端,基于云的研究机会也在不断扩大。除了来自统一数据生态系统的加速和多样化生物医学发现的潜力之外,云还带来了一种转变,即从将数据复制到计算集群或本地计算机进行分析的标准做法中脱离出来。然而,尽管有这些好处,但研究人员向云端的迁移却滞后了,部分原因是缺乏足够的教育资源来培训生物医学科学家使用云基础设施。特别是在创建需要云分析平台未预先安装的软件的自定义分析管道方面,存在概念上的不足。我们在此介绍在云端创建自定义管道所需的三个基本概念。这些总体概念与工作流程和云提供商无关,扩展了这种教育的实用性,使其成为在任何生物医学云平台上运行任何数据集的任何计算分析的基础。我们使用我们自己的一项自定义分析来阐述这些概念,该分析是一项使用病例-父母三联体设计来检测性别特异性基因对口腔颌面裂(OFC)风险影响的研究,我们在生物医学云分析平台CAVATICA中完成了这项分析。

相似文献

1
Fundamentals of FAIR biomedical data analyses in the cloud using custom pipelines.使用自定义管道在云端进行FAIR生物医学数据分析的基础。
PLoS Comput Biol. 2025 Jul 2;21(7):e1013215. doi: 10.1371/journal.pcbi.1013215. eCollection 2025 Jul.
2
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
3
GRAPEVNE - Graphical Analytical Pipeline Development Environment for Infectious Diseases.GRAPEVNE - 传染病图形分析管道开发环境
Wellcome Open Res. 2025 May 27;10:279. doi: 10.12688/wellcomeopenres.23824.1. eCollection 2025.
4
Factors that influence parents' and informal caregivers' views and practices regarding routine childhood vaccination: a qualitative evidence synthesis.影响父母和非正式照顾者对常规儿童疫苗接种看法和做法的因素:定性证据综合分析。
Cochrane Database Syst Rev. 2021 Oct 27;10(10):CD013265. doi: 10.1002/14651858.CD013265.pub2.
5
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
6
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.
7
Surgical interventions for treating extracapsular hip fractures in older adults: a network meta-analysis.老年人髋关节囊外骨折的手术干预:一项网络荟萃分析。
Cochrane Database Syst Rev. 2022 Feb 10;2(2):CD013405. doi: 10.1002/14651858.CD013405.pub2.
8
Nivolumab for adults with Hodgkin's lymphoma (a rapid review using the software RobotReviewer).纳武单抗用于成人霍奇金淋巴瘤(使用RobotReviewer软件进行的快速综述)
Cochrane Database Syst Rev. 2018 Jul 12;7(7):CD012556. doi: 10.1002/14651858.CD012556.pub2.
9
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

本文引用的文献

1
Research for all: building a diverse researcher community for the All of Us Research Program.为所有人开展研究:为“我们所有人研究计划”建立多元化的研究人员群体。
J Am Med Inform Assoc. 2025 Jan 1;32(1):38-50. doi: 10.1093/jamia/ocae270.
2
The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update.Galaxy 平台,用于可访问、可重现和协作的数据分析:2024 年更新。
Nucleic Acids Res. 2024 Jul 5;52(W1):W83-W94. doi: 10.1093/nar/gkae410.
3
Packaging and containerization of computational methods.计算方法的封装和容器化。
Nat Protoc. 2024 Sep;19(9):2529-2539. doi: 10.1038/s41596-024-00986-0. Epub 2024 Apr 2.
4
Using existing pediatric cancer data from the Gabriella Miller Kids First Data Resource Program.利用 Gabriella Miller 儿童第一数据资源计划现有的儿科癌症数据。
JNCI Cancer Spectr. 2023 Oct 31;7(6). doi: 10.1093/jncics/pkad079.
5
FAIR in action - a flexible framework to guide FAIRification.实践中的 FAIR 原则 - 一个灵活的框架来指导 FAIR 化。
Sci Data. 2023 May 19;10(1):291. doi: 10.1038/s41597-023-02167-2.
6
Introducing the FAIR Principles for research software.提出研究软件的 FAIR 原则。
Sci Data. 2022 Oct 14;9(1):622. doi: 10.1038/s41597-022-01710-x.
7
From biomedical cloud platforms to microservices: next steps in FAIR data and analysis.从生物医学云平台到微服务:FAIR 数据和分析的下一步。
Sci Data. 2022 Sep 8;9(1):553. doi: 10.1038/s41597-022-01619-5.
8
The minimum information required for a glycomics experiment (MIRAGE): reporting guidelines for capillary electrophoresis.糖组学实验的最低信息要求 (MIRAGE):毛细管电泳报告指南。
Glycobiology. 2022 Jun 13;32(7):580-587. doi: 10.1093/glycob/cwac021.
9
International federation of genomic medicine databases using GA4GH standards.使用全球基因组与健康联盟(GA4GH)标准的国际基因组医学数据库联合会。
Cell Genom. 2021 Nov 10;1(2). doi: 10.1016/j.xgen.2021.100032.
10
Terra takes the pain out of 'omics' computing in the cloud.Terra消除了云端“组学”计算的痛苦。
Nature. 2022 Jan;601(7891):154-155. doi: 10.1038/d41586-021-03822-7.