• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索罕见病及其致病基因发现的资源。

A resource to explore the discovery of rare diseases and their causative genes.

机构信息

Department of Bioinformatics - BiGCaT, NUTRIM School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, The Netherlands.

Governor Kremers Centre - Rett Expertise Centre, Maastricht University Medical Center, Maastricht, The Netherlands.

出版信息

Sci Data. 2021 May 4;8(1):124. doi: 10.1038/s41597-021-00905-y.

DOI:10.1038/s41597-021-00905-y
PMID:33947870
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8096966/
Abstract

Here, we describe a dataset with information about monogenic, rare diseases with a known genetic background, supplemented with manually extracted provenance for the disease itself and the discovery of the underlying genetic cause. We assembled a collection of 4166 rare monogenic diseases and linked them to 3163 causative genes, annotated with OMIM and Ensembl identifiers and HGNC symbols. The PubMed identifiers of the scientific publications, which for the first time described the rare diseases, and the publications, which found the genes causing the diseases were added using information from OMIM, PubMed, Wikipedia, whonamedit.com, and Google Scholar. The data are available under CC0 license as spreadsheet and as RDF in a semantic model modified from DisGeNET, and was added to Wikidata. This dataset relies on publicly available data and publications with a PubMed identifier, but by our effort to make the data interoperable and linked, we can now analyse this data. Our analysis revealed the timeline of rare disease and causative gene discovery and links them to developments in methods.

摘要

在这里,我们描述了一个数据集,其中包含有关具有已知遗传背景的单基因、罕见疾病的信息,并补充了有关疾病本身和潜在遗传原因发现的手动提取来源。我们收集了 4166 种罕见的单基因疾病,并将它们与 3163 个致病基因联系起来,这些基因都被标注了 OMIM 和 Ensembl 标识符以及 HGNC 符号。首次描述这些罕见疾病的科学出版物的 PubMed 标识符以及发现导致这些疾病的基因的出版物,是通过从 OMIM、PubMed、维基百科、whonamedit.com 和 Google Scholar 中获取信息添加的。这些数据在 CC0 许可下以电子表格的形式提供,以从 DisGeNET 修改后的语义模型中的 RDF 形式提供,并已添加到 Wikidata 中。该数据集依赖于具有 PubMed 标识符的公开可用数据和出版物,但通过我们努力使数据具有互操作性和链接性,我们现在可以分析这些数据。我们的分析揭示了罕见疾病和致病基因发现的时间线,并将它们与方法的发展联系起来。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1e9/8096966/05c9c76a3b84/41597_2021_905_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1e9/8096966/3ff06c5a7739/41597_2021_905_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1e9/8096966/11ff3b1db0f6/41597_2021_905_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1e9/8096966/05c9c76a3b84/41597_2021_905_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1e9/8096966/3ff06c5a7739/41597_2021_905_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1e9/8096966/11ff3b1db0f6/41597_2021_905_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1e9/8096966/05c9c76a3b84/41597_2021_905_Fig3_HTML.jpg

相似文献

1
A resource to explore the discovery of rare diseases and their causative genes.探索罕见病及其致病基因发现的资源。
Sci Data. 2021 May 4;8(1):124. doi: 10.1038/s41597-021-00905-y.
2
RDFIO: extending Semantic MediaWiki for interoperable biomedical data management.RDFIO:扩展语义媒体维基以实现可互操作的生物医学数据管理。
J Biomed Semantics. 2017 Sep 4;8(1):35. doi: 10.1186/s13326-017-0136-y.
3
Rare diseases knowledge management: the contribution of proximity measurements in OntoOrpha and OMIM.罕见病知识管理:OntoOrpha和OMIM中邻近度测量的贡献。
Stud Health Technol Inform. 2012;180:88-92.
4
PhenoDis: a comprehensive database for phenotypic characterization of rare cardiac diseases.PhenoDis:一个用于罕见心脏疾病表型特征描述的综合数据库。
Orphanet J Rare Dis. 2018 Jan 25;13(1):22. doi: 10.1186/s13023-018-0765-y.
5
FORGE Canada Consortium: outcomes of a 2-year national rare-disease gene-discovery project.加拿大 FORGE 联盟:一项为期两年的全国罕见病基因发现项目的结果。
Am J Hum Genet. 2014 Jun 5;94(6):809-17. doi: 10.1016/j.ajhg.2014.05.003.
6
DisGeNET-RDF: harnessing the innovative power of the Semantic Web to explore the genetic basis of diseases.DisGeNET-RDF:利用语义网的创新力量探索疾病的遗传基础。
Bioinformatics. 2016 Jul 15;32(14):2236-8. doi: 10.1093/bioinformatics/btw214. Epub 2016 Apr 22.
7
An innovative portal for rare genetic diseases research: the semantic Diseasecard.用于罕见遗传疾病研究的创新门户:语义疾病卡片。
J Biomed Inform. 2013 Dec;46(6):1108-15. doi: 10.1016/j.jbi.2013.08.006. Epub 2013 Aug 21.
8
The unsolved rare genetic disease atlas? An analysis of the unexplained phenotypic descriptions in OMIM®.未解的罕见遗传病图谱?对 OMIM® 中未解释的表型描述的分析。
Am J Med Genet C Semin Med Genet. 2018 Dec;178(4):458-463. doi: 10.1002/ajmg.c.31662.
9
The expanding spectrum of rare monogenic autoinflammatory diseases.罕见单基因自身炎症性疾病谱的不断扩大。
Orphanet J Rare Dis. 2013 Oct 16;8:162. doi: 10.1186/1750-1172-8-162.
10
DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants.DisGeNET:一个整合人类疾病相关基因和变异信息的综合平台。
Nucleic Acids Res. 2017 Jan 4;45(D1):D833-D839. doi: 10.1093/nar/gkw943. Epub 2016 Oct 19.

引用本文的文献

1
Domain-specific embeddings uncover latent genetics knowledge.特定领域的嵌入揭示潜在的遗传学知识。
bioRxiv. 2025 Mar 19:2025.03.17.643817. doi: 10.1101/2025.03.17.643817.
2
Quantifying hope: an EU perspective of rare disease therapeutic space and market dynamics.量化希望:欧盟对罕见病治疗领域及市场动态的视角
Front Public Health. 2025 Feb 3;13:1520467. doi: 10.3389/fpubh.2025.1520467. eCollection 2025.
3
Leaving no patient behind! Expert recommendation in the use of innovative technologies for diagnosing rare diseases.

本文引用的文献

1
A framework for the investigation of rare genetic disorders in neuropsychiatry.神经精神医学中罕见遗传疾病研究的框架。
Nat Med. 2019 Oct;25(10):1477-1487. doi: 10.1038/s41591-019-0581-5. Epub 2019 Sep 23.
2
CyTargetLinker app update: A flexible solution for network extension in Cytoscape.CyTargetLinker应用程序更新:一种用于Cytoscape中网络扩展的灵活解决方案。
F1000Res. 2018 Jun 14;7. doi: 10.12688/f1000research.14613.2. eCollection 2018.
3
Beyond Pathway Analysis: Identification of Active Subnetworks in Rett Syndrome.超越通路分析:雷特综合征中活性子网的识别
一个都不能少!诊断罕见病的创新技术应用专家建议。
Orphanet J Rare Dis. 2024 Sep 27;19(1):357. doi: 10.1186/s13023-024-03361-0.
4
PSAP-Genomic-Regions: A Method Leveraging Population Data to Prioritize Coding and Non-Coding Variants in Whole Genome Sequencing for Rare Disease Diagnosis.PSAP基因组区域:一种利用群体数据对全基因组测序中的编码和非编码变异进行优先级排序以用于罕见病诊断的方法。
Genet Epidemiol. 2025 Jan;49(1):e22593. doi: 10.1002/gepi.22593. Epub 2024 Sep 24.
5
Rapid Whole-Genome Sequencing and Clinical Management in the PICU: A Multicenter Cohort, 2016-2023.快速全基因组测序与 PICU 的临床管理:一项多中心队列研究,2016-2023 年。
Pediatr Crit Care Med. 2024 Aug 1;25(8):699-709. doi: 10.1097/PCC.0000000000003522. Epub 2024 Apr 26.
6
Heterozygous loss-of-function variants in DOCK4 cause neurodevelopmental delay and microcephaly.DOCK4基因的杂合功能丧失变异导致神经发育迟缓及小头畸形。
Hum Genet. 2024 Mar;143(3):455-469. doi: 10.1007/s00439-024-02655-4. Epub 2024 Mar 25.
7
GPAD: a natural language processing-based application to extract the gene-disease association discovery information from OMIM.GPAD:一种基于自然语言处理的应用程序,用于从 OMIM 中提取基因-疾病关联发现信息。
BMC Bioinformatics. 2024 Feb 27;25(1):84. doi: 10.1186/s12859-024-05693-x.
8
Potential of Artificial Intelligence to Accelerate Drug Development for Rare Diseases.人工智能加速罕见病药物研发的潜力。
Pharmaceut Med. 2024 Mar;38(2):79-86. doi: 10.1007/s40290-023-00504-9. Epub 2024 Feb 5.
9
The Dangers of Acetaminophen for Neurodevelopment Outweigh Scant Evidence for Long-Term Benefits.对乙酰氨基酚对神经发育的危害超过了其长期益处的微弱证据。
Children (Basel). 2023 Dec 29;11(1):44. doi: 10.3390/children11010044.
10
Methylation analysis by targeted bisulfite sequencing in large for gestational age (LGA) newborns: the LARGAN cohort.通过针对大于胎龄儿(LGA)新生儿的靶向亚硫酸氢盐测序进行甲基化分析:LARGAN 队列。
Clin Epigenetics. 2023 Dec 13;15(1):191. doi: 10.1186/s13148-023-01612-8.
Front Genet. 2019 Feb 21;10:59. doi: 10.3389/fgene.2019.00059. eCollection 2019.
4
MECP2 variation in Rett syndrome-An overview of current coverage of genetic and phenotype data within existing databases.MECP2 变异在雷特综合征中的作用——现有数据库中遗传和表型数据的现有覆盖范围概述。
Hum Mutat. 2018 Jul;39(7):914-924. doi: 10.1002/humu.23542. Epub 2018 May 21.
5
From SNPs to pathways: Biological interpretation of type 2 diabetes (T2DM) genome wide association study (GWAS) results.从单核苷酸多态性到通路:2 型糖尿病(T2DM)全基因组关联研究(GWAS)结果的生物学解读。
PLoS One. 2018 Apr 4;13(4):e0193515. doi: 10.1371/journal.pone.0193515. eCollection 2018.
6
Ensembl 2018.Ensembl 2018.
Nucleic Acids Res. 2018 Jan 4;46(D1):D754-D761. doi: 10.1093/nar/gkx1098.
7
DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants.DisGeNET:一个整合人类疾病相关基因和变异信息的综合平台。
Nucleic Acids Res. 2017 Jan 4;45(D1):D833-D839. doi: 10.1093/nar/gkw943. Epub 2016 Oct 19.
8
Genenames.org: the HGNC and VGNC resources in 2017.Genenames.org:2017年的HGNC和VGNC资源。
Nucleic Acids Res. 2017 Jan 4;45(D1):D619-D625. doi: 10.1093/nar/gkw1033. Epub 2016 Oct 30.
9
Transcriptome analysis of human brain tissue identifies reduced expression of complement complex C1Q Genes in Rett syndrome.人类脑组织的转录组分析表明,雷特综合征中补体复合物C1Q基因的表达降低。
BMC Genomics. 2016 Jun 6;17:427. doi: 10.1186/s12864-016-2746-7.
10
Wikidata as a semantic framework for the Gene Wiki initiative.维基数据作为基因维基计划的语义框架。
Database (Oxford). 2016 Mar 17;2016. doi: 10.1093/database/baw015. Print 2016.