• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从文献中挖掘罕见病变异体:以肌酸转运缺陷为重点评估差距。

Rare disease variant curation from literature: assessing gaps with creatine transport deficiency in focus.

机构信息

Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research, Frederick, MD, 21702, USA.

Division of Preclinical Innovation, Therapeutic Development Branch, Therapeutics for Rare and Neglected Diseases (TRND) Program, National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, MD, 20892, USA.

出版信息

BMC Genomics. 2023 Aug 16;24(1):460. doi: 10.1186/s12864-023-09561-5.

DOI:10.1186/s12864-023-09561-5
PMID:37587458
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10433598/
Abstract

BACKGROUND

Approximately 4-8% of the world suffers from a rare disease. Rare diseases are often difficult to diagnose, and many do not have approved therapies. Genetic sequencing has the potential to shorten the current diagnostic process, increase mechanistic understanding, and facilitate research on therapeutic approaches but is limited by the difficulty of novel variant pathogenicity interpretation and the communication of known causative variants. It is unknown how many published rare disease variants are currently accessible in the public domain.

RESULTS

This study investigated the translation of knowledge of variants reported in published manuscripts to publicly accessible variant databases. Variants, symptoms, biochemical assay results, and protein function from literature on the SLC6A8 gene associated with X-linked Creatine Transporter Deficiency (CTD) were curated and reported as a highly annotated dataset of variants with clinical context and functional details. Variants were harmonized, their availability in existing variant databases was analyzed and pathogenicity assignments were compared with impact algorithm predictions. 24% of the pathogenic variants found in PubMed articles were not captured in any database used in this analysis while only 65% of the published variants received an accurate pathogenicity prediction from at least one impact prediction algorithm.

CONCLUSIONS

Despite being published in the literature, pathogenicity data on patient variants may remain inaccessible for genetic diagnosis, therapeutic target identification, mechanistic understanding, or hypothesis generation. Clinical and functional details presented in the literature are important to make pathogenicity assessments. Impact predictions remain imperfect but are improving, especially for single nucleotide exonic variants, however such predictions are less accurate or unavailable for intronic and multi-nucleotide variants. Developing text mining workflows that use natural language processing for identifying diseases, genes and variants, along with impact prediction algorithms and integrating with details on clinical phenotypes and functional assessments might be a promising approach to scale literature mining of variants and assigning correct pathogenicity. The curated variants list created by this effort includes context details to improve any such efforts on variant curation for rare diseases.

摘要

背景

大约有 4-8%的世界人口患有罕见病。罕见病通常难以诊断,并且许多疾病没有批准的治疗方法。基因测序有可能缩短当前的诊断过程,增加对发病机制的理解,并促进治疗方法的研究,但受到新型变异致病性解释的困难以及已知致病变异的传播的限制。目前尚不清楚有多少已发表的罕见病变异可在公共领域获得。

结果

本研究调查了将发表的手稿中报告的变异知识转化为可公开访问的变异数据库的情况。对与 X 连锁肌酸转运蛋白缺乏症(CTD)相关的 SLC6A8 基因的文献中的变异、症状、生化测定结果和蛋白质功能进行了整理,并以具有临床背景和功能细节的高度注释的变异数据集进行了报道。对变异进行了协调,分析了它们在现有变异数据库中的可用性,并将致病性预测与影响算法预测进行了比较。在 PubMed 文章中发现的致病性变异中有 24%未在本分析中使用的任何数据库中捕获,而只有 65%的已发表变异至少从一种影响预测算法中获得了准确的致病性预测。

结论

尽管已在文献中发表,但患者变异的致病性数据可能仍无法用于遗传诊断、治疗靶点识别、发病机制理解或假说生成。文献中提供的临床和功能细节对于进行致病性评估很重要。影响预测仍然不完美,但正在改进,尤其是对于单核苷酸外显子变异,但对于内含子和多核苷酸变异,此类预测的准确性较低或不可用。开发使用自然语言处理识别疾病、基因和变异的文本挖掘工作流程,以及影响预测算法,并结合临床表型和功能评估的详细信息,可能是一种有前途的方法,可以大规模进行文献挖掘变异并正确分配致病性。本研究创建的经过整理的变异列表包含上下文详细信息,可用于改进罕见病变异的整理工作。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/c8df3dbeb04f/12864_2023_9561_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/668165aa66f4/12864_2023_9561_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/263bf9b5ec44/12864_2023_9561_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/0dc57ab96ffb/12864_2023_9561_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/ffc6db5fbd45/12864_2023_9561_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/25ec811e5c89/12864_2023_9561_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/a5fe831e5d26/12864_2023_9561_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/c8df3dbeb04f/12864_2023_9561_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/668165aa66f4/12864_2023_9561_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/263bf9b5ec44/12864_2023_9561_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/0dc57ab96ffb/12864_2023_9561_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/ffc6db5fbd45/12864_2023_9561_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/25ec811e5c89/12864_2023_9561_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/a5fe831e5d26/12864_2023_9561_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e43/10433598/c8df3dbeb04f/12864_2023_9561_Fig2_HTML.jpg

相似文献

1
Rare disease variant curation from literature: assessing gaps with creatine transport deficiency in focus.从文献中挖掘罕见病变异体:以肌酸转运缺陷为重点评估差距。
BMC Genomics. 2023 Aug 16;24(1):460. doi: 10.1186/s12864-023-09561-5.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Cohort-driven variant burden analysis and pathogenicity identification in monogenic autoinflammatory disorders.队列驱动的单基因自身炎症性疾病变异负担分析和致病性鉴定。
J Allergy Clin Immunol. 2023 Aug;152(2):517-527. doi: 10.1016/j.jaci.2023.03.028. Epub 2023 Apr 7.
4
Spectrum of DDC variants causing aromatic l-amino acid decarboxylase (AADC) deficiency and pathogenicity interpretation using ACMG-AMP/ACGS recommendations.导致芳香族 l-氨基酸脱羧酶(AADC)缺乏的 DDC 变异体谱及使用 ACMG-AMP/ACGS 推荐的致病性解读。
Mol Genet Metab. 2022 Dec;137(4):359-381. doi: 10.1016/j.ymgme.2022.11.003. Epub 2022 Nov 12.
5
Characterization of novel SLC6A8 variants with the use of splice-site analysis tools and implementation of a newly developed LOVD database.使用剪接位点分析工具对新型 SLC6A8 变体进行鉴定,并实施新开发的 LOVD 数据库。
Eur J Hum Genet. 2011 Jan;19(1):56-63. doi: 10.1038/ejhg.2010.134. Epub 2010 Aug 18.
6
Estimated carrier frequency of creatine transporter deficiency in females in the general population using functional characterization of novel missense variants in the SLC6A8 gene.利用SLC6A8基因中新错义变异的功能特征估计普通人群中女性肌酸转运体缺乏症的携带者频率。
Gene. 2015 Jul 10;565(2):187-91. doi: 10.1016/j.gene.2015.04.011. Epub 2015 Apr 8.
7
Explicable prioritization of genetic variants by integration of rule-based and machine learning algorithms for diagnosis of rare Mendelian disorders.基于规则和机器学习算法的遗传变异可解释优先级排序,用于罕见孟德尔疾病的诊断。
Hum Genomics. 2024 Mar 21;18(1):28. doi: 10.1186/s40246-024-00595-8.
8
X-linked creatine transporter (SLC6A8) deficiency in females: Difficult to recognize, but a potentially treatable disease.女性X连锁肌酸转运体(SLC6A8)缺乏症:难以识别,但可能是一种可治疗的疾病。
Mol Genet Metab. 2023 Nov;140(3):107694. doi: 10.1016/j.ymgme.2023.107694. Epub 2023 Aug 30.
9
tmVar 2.0: integrating genomic variant information from literature with dbSNP and ClinVar for precision medicine.tmVar 2.0:整合文献中的基因组变异信息与 dbSNP 和 ClinVar,以用于精准医学。
Bioinformatics. 2018 Jan 1;34(1):80-87. doi: 10.1093/bioinformatics/btx541.
10
Diagnosis and Treatment of X-Linked Creatine Transporter Deficiency: Case Report and Literature Review.X连锁肌酸转运体缺乏症的诊断与治疗:病例报告及文献综述
Brain Sci. 2023 Sep 28;13(10):1382. doi: 10.3390/brainsci13101382.

引用本文的文献

1
Comparison of literature mining tools for variant classification: Through the lens of 50 RYR1 variants.比较用于变异分类的文献挖掘工具:以 50 个 RYR1 变异体为例。
Genet Med. 2024 Apr;26(4):101083. doi: 10.1016/j.gim.2024.101083. Epub 2024 Jan 26.