• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从生物库和数据孤岛到数据共享:汇聚以支持转化医学。

From biobank and data silos into a data commons: convergence to support translational medicine.

机构信息

Department of Molecular Oncology, BC Cancer Research Centre, 675 West 10th Avenue, Vancouver, BC, V5Z 1L3, Canada.

BC Children's Hospital Research Institute, 938 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada.

出版信息

J Transl Med. 2021 Dec 4;19(1):493. doi: 10.1186/s12967-021-03147-z.

DOI:10.1186/s12967-021-03147-z
PMID:34863191
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8645144/
Abstract

BACKGROUND

To drive translational medicine, modern day biobanks need to integrate with other sources of data (clinical, genomics) to support novel data-intensive research. Currently, vast amounts of research and clinical data remain in silos, held and managed by individual researchers, operating under different standards and governance structures; a framework that impedes sharing and effective use of data. In this article, we describe the journey of British Columbia's Gynecological Cancer Research Program (OVCARE) in moving a traditional tumour biobank, outcomes unit, and a collection of data silos, into an integrated data commons to support data standardization and resource sharing under collaborative governance, as a means of providing the gynecologic cancer research community in British Columbia access to tissue samples and associated clinical and molecular data from thousands of patients.

RESULTS

Through several engagements with stakeholders from various research institutions within our research community, we identified priorities and assessed infrastructure needs required to optimize and support data collections, storage and sharing, under three main research domains: (1) biospecimen collections, (2) molecular and genomics data, and (3) clinical data. We further built a governance model and a resource portal to implement protocols and standard operating procedures for seamless collections, management and governance of interoperable data, making genomic, and clinical data available to the broader research community.

CONCLUSIONS

Proper infrastructures for data collection, sharing and governance is a translational research imperative. We have consolidated our data holdings into a data commons, along with standardized operating procedures to meet research and ethics requirements of the gynecologic cancer community in British Columbia. The developed infrastructure brings together, diverse data, computing frameworks, as well as tools and applications for managing, analyzing, and sharing data. Our data commons bridges data access gaps and barriers to precision medicine and approaches for diagnostics, treatment and prevention of gynecological cancers, by providing access to large datasets required for data-intensive science.

摘要

背景

为推动转化医学的发展,现代生物银行需要与其他数据源(临床、基因组学)集成,以支持新的数据密集型研究。目前,大量的研究和临床数据仍然存在于各个领域,由个体研究人员持有和管理,他们遵循不同的标准和治理结构;这种框架阻碍了数据的共享和有效利用。在本文中,我们描述了不列颠哥伦比亚妇科癌症研究计划(OVCARE)的历程,该计划将传统的肿瘤生物银行、结果单位和一系列数据孤岛整合到一个集成的数据公共设施中,以支持数据标准化和资源共享,并采用协作治理的方式,为不列颠哥伦比亚的妇科癌症研究界提供数千名患者的组织样本以及相关的临床和分子数据。

结果

通过与我们研究社区内各研究机构的利益相关者进行多次接触,我们确定了优先事项,并评估了优化和支持数据收集、存储和共享所需的基础设施需求,主要涉及三个研究领域:(1)生物样本收集,(2)分子和基因组数据,以及(3)临床数据。我们进一步构建了一个治理模型和资源门户,以实施协议和标准操作程序,实现可互操作数据的无缝收集、管理和治理,使基因组和临床数据能够为更广泛的研究社区所利用。

结论

适当的数据收集、共享和治理基础设施是转化研究的当务之急。我们已经将数据保存到一个数据公共设施中,并制定了标准化的操作程序,以满足不列颠哥伦比亚妇科癌症社区的研究和伦理要求。所开发的基础设施汇集了多样化的数据、计算框架以及用于管理、分析和共享数据的工具和应用程序。我们的数据公共设施弥合了数据访问差距和障碍,为精准医学以及诊断、治疗和预防妇科癌症的方法提供了支持,为数据密集型科学提供了所需的大型数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/240b65c4dda9/12967_2021_3147_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/d1325c38fb83/12967_2021_3147_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/4694ed1b78ce/12967_2021_3147_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/883d43af5393/12967_2021_3147_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/240b65c4dda9/12967_2021_3147_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/d1325c38fb83/12967_2021_3147_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/4694ed1b78ce/12967_2021_3147_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/883d43af5393/12967_2021_3147_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec3/8645144/240b65c4dda9/12967_2021_3147_Fig4_HTML.jpg

相似文献

1
From biobank and data silos into a data commons: convergence to support translational medicine.从生物库和数据孤岛到数据共享:汇聚以支持转化医学。
J Transl Med. 2021 Dec 4;19(1):493. doi: 10.1186/s12967-021-03147-z.
2
Biobank for Translational Medicine: Standard Operating Procedures for Optimal Sample Management.转化医学生物样本库:最佳样本管理的标准操作规程。
J Vis Exp. 2022 Nov 30(189). doi: 10.3791/63950.
3
Balancing the local and the universal in maintaining ethical access to a genomics biobank.在维持对基因组生物样本库的伦理获取方面平衡地方与普遍因素。
BMC Med Ethics. 2017 Dec 28;18(1):80. doi: 10.1186/s12910-017-0240-7.
4
The State of Cloud-Based Biospecimen and Biobank Data Management Tools.基于云的生物样本和生物样本库数据管理工具的现状。
Biopreserv Biobank. 2017 Apr;15(2):169-172. doi: 10.1089/bio.2017.0019. Epub 2017 Mar 16.
5
Certification for biobanks: the program developed by the Canadian Tumour Repository Network (CTRNet).生物样本库认证:由加拿大肿瘤样本库网络(CTRNet)开发的项目。
Biopreserv Biobank. 2012 Oct;10(5):426-32. doi: 10.1089/bio.2012.0026.
6
The biobank for the molecular classification of kidney disease: research translation and precision medicine in nephrology.肾脏疾病分子分类生物样本库:肾脏病学中的研究转化与精准医学
BMC Nephrol. 2017 Jul 26;18(1):252. doi: 10.1186/s12882-017-0669-4.
7
"It's all about trust": reflections of researchers on the complexity and controversy surrounding biobanking in South Africa.“一切都关乎信任”:研究人员对南非生物样本库相关复杂性与争议的反思
BMC Med Ethics. 2016 Oct 10;17(1):57. doi: 10.1186/s12910-016-0140-2.
8
The iTHRIV Commons: a cross-institution information and health research data sharing architecture and web application.iTHRIV Commons:一个跨机构的信息和健康研究数据共享架构和网络应用程序。
J Am Med Inform Assoc. 2022 Mar 15;29(4):631-642. doi: 10.1093/jamia/ocab262.
9
Biorepositories and Databanks for the Development of Novel Biomarkers for Genitourinary Cancer Prevention and Management.用于开发新型生物标志物以预防和管理泌尿生殖系统癌症的生物库和数据库。
Eur Urol Focus. 2021 May;7(3):513-521. doi: 10.1016/j.euf.2021.06.002. Epub 2021 Jun 22.
10
A strategic plan for the second phase (2013-2015) of the Korea biobank project.韩国生物样本库项目第二阶段(2013 - 2015年)战略计划。
Osong Public Health Res Perspect. 2013 Apr;4(2):107-16. doi: 10.1016/j.phrp.2013.03.006.

引用本文的文献

1
Current Bioinformatics Tools in Precision Oncology.精准肿瘤学中的当前生物信息学工具
MedComm (2020). 2025 Jul 9;6(7):e70243. doi: 10.1002/mco2.70243. eCollection 2025 Jul.
2
Status and opportunities of machine learning applications in obstructive sleep apnea: A narrative review.机器学习在阻塞性睡眠呼吸暂停中的应用现状与机遇:一项叙述性综述
Comput Struct Biotechnol J. 2025 Apr 25;28:167-174. doi: 10.1016/j.csbj.2025.04.033. eCollection 2025.
3
Simple controls exceed best deep learning algorithms and reveal foundation model effectiveness for predicting genetic perturbations.

本文引用的文献

1
Translational Research in the Era of Precision Medicine: Where We Are and Where We Will Go.精准医学时代的转化研究:我们所处的位置与前行的方向。
J Pers Med. 2021 Mar 18;11(3):216. doi: 10.3390/jpm11030216.
2
Comprehensive genomic analysis of dietary habits in UK Biobank identifies hundreds of genetic associations.英国生物银行饮食习惯的综合基因组分析确定了数百种遗传关联。
Nat Commun. 2020 Mar 19;11(1):1467. doi: 10.1038/s41467-020-15193-0.
3
Identifying cross-disease components of genetic risk across hospital data in the UK Biobank.
简单的控制方法超越了最佳深度学习算法,并揭示了基础模型在预测基因扰动方面的有效性。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf317.
4
Status and Opportunities of Machine Learning Applications in Obstructive Sleep Apnea: A Narrative Review.机器学习在阻塞性睡眠呼吸暂停中的应用现状与机遇:一篇叙述性综述
medRxiv. 2025 May 10:2025.02.27.25322950. doi: 10.1101/2025.02.27.25322950.
5
Federated Learning: Breaking Down Barriers in Global Genomic Research.联邦学习:打破全球基因组研究中的障碍。
Genes (Basel). 2024 Dec 22;15(12):1650. doi: 10.3390/genes15121650.
6
Biobank Digitalization: From Data Acquisition to Efficient Use.生物样本库数字化:从数据采集到高效利用
Biology (Basel). 2024 Nov 22;13(12):957. doi: 10.3390/biology13120957.
7
Building Flexible, Scalable, and Machine Learning-Ready Multimodal Oncology Datasets.构建灵活、可扩展且可适应机器学习的多模态肿瘤学数据集。
Sensors (Basel). 2024 Mar 2;24(5):1634. doi: 10.3390/s24051634.
8
Data Liberation and Crowdsourcing in Medical Research: The Intersection of Collective and Artificial Intelligence.数据解放与医学研究中的众包:集体智慧与人工智能的交集。
Radiol Artif Intell. 2024 Jan;6(1):e230006. doi: 10.1148/ryai.230006.
9
A data-driven approach to decode metabolic dysfunction-associated steatotic liver disease.一种基于数据驱动的方法来解码代谢功能障碍相关脂肪性肝病。
Ann Hepatol. 2024 Mar-Apr;29(2):101278. doi: 10.1016/j.aohep.2023.101278. Epub 2023 Dec 20.
10
An integrated gene-to-outcome multimodal database for metabolic dysfunction-associated steatotic liver disease.代谢功能障碍相关脂肪性肝病的基因-表型多模态综合数据库。
Nat Med. 2023 Nov;29(11):2939-2953. doi: 10.1038/s41591-023-02602-2. Epub 2023 Oct 30.
在英国生物库的医院数据中识别跨疾病的遗传风险成分。
Nat Genet. 2020 Jan;52(1):126-134. doi: 10.1038/s41588-019-0550-4. Epub 2019 Dec 23.
4
Artificial intelligence for precision medicine in neurodevelopmental disorders.用于神经发育障碍精准医学的人工智能
NPJ Digit Med. 2019 Nov 21;2:112. doi: 10.1038/s41746-019-0191-0. eCollection 2019.
5
Identification of novel mutations of ovarian cancer-related genes from RNA-sequencing data for Japanese epithelial ovarian cancer patients.从日本上皮性卵巢癌患者的 RNA 测序数据中鉴定卵巢癌相关基因的新突变。
Endocr J. 2020 Feb 28;67(2):219-229. doi: 10.1507/endocrj.EJ19-0283. Epub 2019 Nov 19.
6
Biobanking in health care: evolution and future directions.生物银行在医疗保健中的应用:发展与未来方向。
J Transl Med. 2019 May 22;17(1):172. doi: 10.1186/s12967-019-1922-3.
7
The REDCap consortium: Building an international community of software platform partners.REDCap 联盟:构建软件平台合作伙伴的国际社区。
J Biomed Inform. 2019 Jul;95:103208. doi: 10.1016/j.jbi.2019.103208. Epub 2019 May 9.
8
Merging heterogeneous clinical data to enable knowledge discovery.合并异类临床数据以实现知识发现。
Pac Symp Biocomput. 2019;24:439-443.
9
Integrative analysis of the cancer genome atlas and cancer cell lines encyclopedia large-scale genomic databases: MUC4/MUC16/MUC20 signature is associated with poor survival in human carcinomas.癌症基因组图谱和癌症细胞系百科全书大规模基因组数据库的综合分析:MUC4/MUC16/MUC20 标志物与人类癌中的不良预后相关。
J Transl Med. 2018 Sep 20;16(1):259. doi: 10.1186/s12967-018-1632-2.
10
Systematic analysis of gene expression alterations and clinical outcomes of STAT3 in cancer.癌症中STAT3基因表达改变与临床结局的系统分析。
Oncotarget. 2017 Dec 14;9(3):3198-3213. doi: 10.18632/oncotarget.23226. eCollection 2018 Jan 9.