• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一组经过精心整理的生物医学数据和临床病例报告元数据的参考集。

A reference set of curated biomedical data and metadata from clinical case reports.

机构信息

The NIH BD2K Center of Excellence in Biomedical Computing, University of California at Los Angeles, Los Angeles, CA 90095, USA.

Department of Physiology, University of California at Los Angeles, Los Angeles, CA 90095, USA.

出版信息

Sci Data. 2018 Nov 20;5:180258. doi: 10.1038/sdata.2018.258.

DOI:10.1038/sdata.2018.258
PMID:30457569
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6244181/
Abstract

Clinical case reports (CCRs) provide an important means of sharing clinical experiences about atypical disease phenotypes and new therapies. However, published case reports contain largely unstructured and heterogeneous clinical data, posing a challenge to mining relevant information. Current indexing approaches generally concern document-level features and have not been specifically designed for CCRs. To address this disparity, we developed a standardized metadata template and identified text corresponding to medical concepts within 3,100 curated CCRs spanning 15 disease groups and more than 750 reports of rare diseases. We also prepared a subset of metadata on reports on selected mitochondrial diseases and assigned ICD-10 diagnostic codes to each. The resulting resource, Metadata Acquired from Clinical Case Reports (MACCRs), contains text associated with high-level clinical concepts, including demographics, disease presentation, treatments, and outcomes for each report. Our template and MACCR set render CCRs more findable, accessible, interoperable, and reusable (FAIR) while serving as valuable resources for key user groups, including researchers, physician investigators, clinicians, data scientists, and those shaping government policies for clinical trials.

摘要

临床病例报告(CCR)为分享关于非典型疾病表型和新疗法的临床经验提供了重要手段。然而,已发表的病例报告包含大量非结构化和异质的临床数据,这给挖掘相关信息带来了挑战。目前的索引方法通常关注文档级别的特征,而不是专门为 CCR 设计的。为了解决这一差异,我们开发了一个标准化的元数据模板,并在 15 个疾病组的 3100 个经过策展的 CCR 中确定了与医疗概念相对应的文本,这些 CCR 涵盖了超过 750 份罕见疾病报告。我们还为选定的线粒体疾病报告准备了一部分元数据,并为每个报告分配了 ICD-10 诊断代码。由此产生的资源,从临床病例报告中获取的元数据(MACCR),包含与高级临床概念相关的文本,包括每个报告的人口统计学、疾病表现、治疗和结果。我们的模板和 MACCR 集使 CCR 更易于发现、访问、互操作和重用(FAIR),同时也为关键用户群体(包括研究人员、医师调查员、临床医生、数据科学家以及制定临床试验政府政策的人员)提供了有价值的资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d935/6244181/5541cd6fa676/sdata2018258-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d935/6244181/90500098a898/sdata2018258-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d935/6244181/1be1540221f5/sdata2018258-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d935/6244181/5541cd6fa676/sdata2018258-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d935/6244181/90500098a898/sdata2018258-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d935/6244181/1be1540221f5/sdata2018258-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d935/6244181/5541cd6fa676/sdata2018258-f3.jpg

相似文献

1
A reference set of curated biomedical data and metadata from clinical case reports.一组经过精心整理的生物医学数据和临床病例报告元数据的参考集。
Sci Data. 2018 Nov 20;5:180258. doi: 10.1038/sdata.2018.258.
2
A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts.一种用于临床病例报告的元数据提取方法,以促进对生物医学概念的深入理解。
J Vis Exp. 2018 Sep 20(139):58392. doi: 10.3791/58392.
3
BioSharing: curated and crowd-sourced metadata standards, databases and data policies in the life sciences.生物数据共享:生命科学领域经整理和众包的元数据标准、数据库及数据政策。
Database (Oxford). 2016 May 17;2016. doi: 10.1093/database/baw075. Print 2016.
4
FAANG, establishing metadata standards, validation and best practices for the farmed and companion animal community.FAANG正在为养殖动物和伴侣动物群体建立元数据标准、验证方法和最佳实践。
Anim Genet. 2018 Dec;49(6):520-526. doi: 10.1111/age.12736. Epub 2018 Oct 12.
5
Maximizing the reusability of gene expression data by predicting missing metadata.通过预测缺失的元数据来最大化基因表达数据的可重用性。
PLoS Comput Biol. 2020 Nov 6;16(11):e1007450. doi: 10.1371/journal.pcbi.1007450. eCollection 2020 Nov.
6
FAIRifying Clinical Studies Metadata: A Registry for the Biomedical Research.临床研究元数据的 FAIR 化:生物医学研究的注册中心。
Stud Health Technol Inform. 2021 May 27;281:779-783. doi: 10.3233/SHTI210281.
7
Improving the Utility of the Tox21 Dataset by Deep Metadata Annotations and Constructing Reusable Benchmarked Chemical Reference Signatures.通过深度元数据注释和构建可重复使用的基准化学参考特征来提高 Tox21 数据集的实用性。
Molecules. 2019 Apr 23;24(8):1604. doi: 10.3390/molecules24081604.
8
Sustainable data and metadata management at the BD2K-LINCS Data Coordination and Integration Center.BD2K-LINCS 数据协调与整合中心的数据和元数据的可持续管理。
Sci Data. 2018 Jun 19;5:180117. doi: 10.1038/sdata.2018.117.
9
The RD-Connect Registry & Biobank Finder: a tool for sharing aggregated data and metadata among rare disease researchers.RD-Connect 注册中心和生物样本库查找器:一个用于在罕见病研究人员之间共享汇总数据和元数据的工具。
Eur J Hum Genet. 2018 May;26(5):631-643. doi: 10.1038/s41431-017-0085-z. Epub 2018 Feb 2.
10
Development of an information retrieval tool for biomedical patents.生物医学专利信息检索工具的开发。
Comput Methods Programs Biomed. 2018 Jun;159:125-134. doi: 10.1016/j.cmpb.2018.03.012. Epub 2018 Mar 14.

引用本文的文献

1
Systematic Review and Meta-Analysis of Risk Factors for Dehydration and the Development of a Predictive Scoring System.脱水危险因素的系统评价与Meta分析及预测评分系统的开发
Healthcare (Basel). 2025 Aug 12;13(16):1974. doi: 10.3390/healthcare13161974.
2
Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review.健康数据治理中可发现性、可访问性、互操作性和可重用性数据原则的举措、概念和实施实践:范围综述。
J Med Internet Res. 2023 Aug 28;25:e45013. doi: 10.2196/45013.
3
Effects of Probiotic Supplementation on Exercise and the Underlying Mechanisms.

本文引用的文献

1
Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction.通过半监督关系抽取技术生成居里温度和奈尔温度的自动材料数据库。
Sci Data. 2018 Jun 19;5:180111. doi: 10.1038/sdata.2018.111.
2
Immune-centric network of cytokines and cells in disease context identified by computational mining of PubMed.通过对 PubMed 进行计算挖掘,确定了疾病背景下细胞因子和细胞的免疫中心网络。
Nat Biotechnol. 2018 Aug;36(7):651-659. doi: 10.1038/nbt.4152. Epub 2018 Jun 18.
3
PubMed Phrases, an open set of coherent phrases for searching biomedical literature.
补充益生菌对运动的影响及其潜在机制。
Foods. 2023 Apr 25;12(9):1787. doi: 10.3390/foods12091787.
4
The h-ANN Model: Comprehensive Colonoscopy Concept Compilation Using Combined Contextual Embeddings.h-ANN模型:使用组合上下文嵌入的结肠镜检查综合概念汇编。
Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap. 2022 Feb;5:189-200. doi: 10.5220/0010903300003123.
5
Bacteriocins: Properties and potential use as antimicrobials.细菌素:特性及作为抗菌剂的潜在用途。
J Clin Lab Anal. 2022 Jan;36(1):e24093. doi: 10.1002/jcla.24093. Epub 2021 Dec 1.
6
Furin and the adaptive mutation of SARS-COV2: a computational framework.弗林蛋白酶与新型冠状病毒2的适应性突变:一个计算框架
Model Earth Syst Environ. 2022;8(2):2827-2836. doi: 10.1007/s40808-021-01260-y. Epub 2021 Aug 26.
7
Chiropractic case reports: a review and bibliometric analysis.整脊病例报告:回顾与文献计量分析。
Chiropr Man Therap. 2021 Apr 28;29(1):17. doi: 10.1186/s12998-021-00374-5.
8
Cardiovascular informatics: building a bridge to data harmony.心血管信息学:构建通向数据和谐的桥梁。
Cardiovasc Res. 2022 Feb 21;118(3):732-745. doi: 10.1093/cvr/cvab067.
9
A Second Look at FAIR in Proteomic Investigations.重新审视蛋白质组学研究中的 FAIR 原则。
J Proteome Res. 2021 May 7;20(5):2182-2186. doi: 10.1021/acs.jproteome.1c00177. Epub 2021 Mar 13.
10
TAZ encodes tafazzin, a transacylase essential for cardiolipin formation and central to the etiology of Barth syndrome.TAZ 编码tafazzin,这是一种转酰基酶,对于心磷脂的形成至关重要,也是 Barth 综合征发病机制的核心。
Gene. 2020 Feb 5;726:144148. doi: 10.1016/j.gene.2019.144148. Epub 2019 Oct 21.
PubMed 词组,一组用于搜索生物医学文献的开放式连贯词组。
Sci Data. 2018 Jun 12;5:180104. doi: 10.1038/sdata.2018.104.
4
Identifying Suicide Ideation and Suicidal Attempts in a Psychiatric Clinical Research Database using Natural Language Processing.使用自然语言处理技术在精神科临床研究数据库中识别自杀意念和自杀企图。
Sci Rep. 2018 May 9;8(1):7426. doi: 10.1038/s41598-018-25773-2.
5
SciRide Finder: a citation-based paradigm in biomedical literature search.SciRide 查找器:基于引文的生物医学文献搜索范例。
Sci Rep. 2018 Apr 18;8(1):6193. doi: 10.1038/s41598-018-24571-0.
6
Opportunities and obstacles for deep learning in biology and medicine.深度学习在生物学和医学中的机遇与挑战。
J R Soc Interface. 2018 Apr;15(141). doi: 10.1098/rsif.2017.0387.
7
A dataset of 200 structured product labels annotated for adverse drug reactions.一个标注了 200 个结构产品标签的药物不良反应数据集。
Sci Data. 2018 Jan 30;5:180001. doi: 10.1038/sdata.2018.1.
8
CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines.CLAMP - 一个用于高效构建定制化临床自然语言处理管道的工具包。
J Am Med Inform Assoc. 2018 Mar 1;25(3):331-336. doi: 10.1093/jamia/ocx132.
9
The Reactome Pathway Knowledgebase.Reactome 通路知识库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D649-D655. doi: 10.1093/nar/gkx1132.
10
HMDB 4.0: the human metabolome database for 2018.HMDB 4.0:2018 年人类代谢组数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D608-D617. doi: 10.1093/nar/gkx1089.