• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基因致病性工具“GenePy”在英国十万人基因组计划中识别出被遗漏的双等位基因诊断结果。

A gene pathogenicity tool 'GenePy' identifies missed biallelic diagnoses in the 100,000 Genomes Project.

作者信息

Seaby Eleanor G, Leggatt Gary, Cheng Guo, Thomas N Simon, Ashton James J, Stafford Imogen, Baralle Diana, Rehm Heidi L, O'Donnell-Luria Anne, Ennis Sarah

机构信息

Human Development and Health, Faculty of Medicine, University Hospital Southampton, Southampton, Hampshire, SO16 6YD, UK.

Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.

出版信息

medRxiv. 2023 Mar 30:2023.03.21.23287545. doi: 10.1101/2023.03.21.23287545.

DOI:10.1101/2023.03.21.23287545
PMID:37034701
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10081430/
Abstract

The 100,000 Genomes Project (100KGP) diagnosed a quarter of recruited affected participants, but 26% of diagnoses were in genes not on the chosen gene panel(s); with many being variants of high impact. However, assessing biallelic variants without a gene panel is challenging, due to the number of variants requiring scrutiny. We sought to identify potential missed biallelic diagnoses independent of the gene panel applied using GenePy - a whole gene pathogenicity metric. GenePy scores all variants called in a given individual, incorporating allele frequency, zygosity, and a user-defined deleterious metric (CADD v1.6 applied herein). GenePy then combines all variant scores for individual genes, generating an aggregate score per gene, per participant. We calculated GenePy scores for 2862 recessive disease genes in 78,216 individuals in 100KGP. For each gene, we ranked participant GenePy scores for that gene, and scrutinised affected individuals without a diagnosis whose scores ranked amongst the top-5 for each gene. We assessed these participants' phenotypes for overlap with the disease gene associated phenotype for which they were highly ranked. Where phenotypes overlapped, we extracted rare variants in the gene of interest and applied phase, ClinVar and ACMG classification looking for putative causal biallelic variants. 3184 affected individuals without a molecular diagnosis had a top-5 ranked GenePy gene score and 682/3184 (21%) had phenotypes overlapping with one of the top-ranking genes. After removing 13 withdrawn participants, in 122/669 (18%) of the phenotype-matched cases, we identified a putative missed diagnosis in a top-ranked gene supported by phasing, ClinVar and ACMG classification. A further 334/669 (50%) of cases have a possible missed diagnosis but require functional validation. Applying GenePy at scale has identified potential diagnoses for 456/3183 (14%) of undiagnosed participants who had a top-5 ranked GenePy score in a recessive disease gene, whilst adding only 1.2 additional variants (per individual) for assessment.

摘要

“十万人基因组计划”(100KGP)诊断出了四分之一的招募到的患病参与者,但26%的诊断结果是在所选基因panel之外的基因中;其中许多是具有高影响的变异。然而,由于需要仔细审查的变异数量众多,在没有基因panel的情况下评估双等位基因变异具有挑战性。我们试图使用GenePy(一种全基因致病性指标)来识别与所应用的基因panel无关的潜在遗漏双等位基因诊断。GenePy对给定个体中检测到的所有变异进行评分,纳入等位基因频率、纯合度和用户定义的有害性指标(本文应用CADD v1.6)。然后,GenePy将单个基因的所有变异分数进行合并,为每个基因、每个参与者生成一个综合分数。我们计算了100KGP中78216名个体中2862个隐性疾病基因的GenePy分数。对于每个基因,我们对该基因的参与者GenePy分数进行排名,并仔细审查那些在每个基因中分数排名前5但未得到诊断的患病个体。我们评估了这些参与者的表型与他们排名靠前的疾病基因相关表型的重叠情况。当表型重叠时,我们提取感兴趣基因中的罕见变异,并应用相位分析、ClinVar和ACMG分类来寻找假定的致病双等位基因变异。3184名未得到分子诊断的患病个体具有排名前5的GenePy基因分数,其中682/3184(21%)的表型与排名靠前的基因之一重叠。在剔除13名退出的参与者后,在122/669(18%)的表型匹配病例中,我们在排名靠前的基因中识别出了一个得到相位分析、ClinVar和ACMG分类支持的假定遗漏诊断。另外334/669(50%)的病例可能存在遗漏诊断,但需要功能验证。大规模应用GenePy为456/3183(14%)在隐性疾病基因中具有排名前5的GenePy分数的未诊断参与者识别出了潜在诊断,同时每个个体仅增加1.2个额外变异用于评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5572/10081430/cb59b3eb9ef1/nihpp-2023.03.21.23287545v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5572/10081430/ee27aba1195a/nihpp-2023.03.21.23287545v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5572/10081430/b1df225162ac/nihpp-2023.03.21.23287545v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5572/10081430/cb59b3eb9ef1/nihpp-2023.03.21.23287545v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5572/10081430/ee27aba1195a/nihpp-2023.03.21.23287545v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5572/10081430/b1df225162ac/nihpp-2023.03.21.23287545v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5572/10081430/cb59b3eb9ef1/nihpp-2023.03.21.23287545v2-f0003.jpg

相似文献

1
A gene pathogenicity tool 'GenePy' identifies missed biallelic diagnoses in the 100,000 Genomes Project.一种基因致病性工具“GenePy”在英国十万人基因组计划中识别出被遗漏的双等位基因诊断结果。
medRxiv. 2023 Mar 30:2023.03.21.23287545. doi: 10.1101/2023.03.21.23287545.
2
A gene pathogenicity tool "GenePy" identifies missed biallelic diagnoses in the 100,000 Genomes Project.一种基因致病性工具“GenePy”在 10 万基因组计划中发现了遗漏的双等位基因诊断。
Genet Med. 2024 Apr;26(4):101073. doi: 10.1016/j.gim.2024.101073. Epub 2024 Jan 18.
3
GenePy - a score for estimating gene pathogenicity in individuals using next-generation sequencing data.GenePy - 一种使用下一代测序数据评估个体基因致病性的评分。
BMC Bioinformatics. 2019 May 16;20(1):254. doi: 10.1186/s12859-019-2877-3.
4
Targeting de novo loss-of-function variants in constrained disease genes improves diagnostic rates in the 100,000 Genomes Project.针对受限疾病基因中的新生功能丧失变异可提高 10 万基因组计划中的诊断率。
Hum Genet. 2023 Mar;142(3):351-362. doi: 10.1007/s00439-022-02509-x. Epub 2022 Dec 7.
5
Critical assessment of variant prioritization methods for rare disease diagnosis within the rare genomes project.对罕见基因组项目中罕见病诊断的变异优先级方法的批判性评估。
Hum Genomics. 2024 Apr 29;18(1):44. doi: 10.1186/s40246-024-00604-w.
6
Use of genome sequencing to hunt for cryptic second-hit variants: analysis of 31 cases recruited to the 100 000 Genomes Project.利用基因组测序寻找隐匿性二次打击变异:对参与 10 万基因组计划的 31 例病例的分析。
J Med Genet. 2023 Nov 27;60(12):1235-1244. doi: 10.1136/jmg-2023-109362.
7
A flexible computational pipeline for research analyses of unsolved clinical exome cases.一种用于未解决临床外显子病例研究分析的灵活计算流程。
NPJ Genom Med. 2020 Dec 10;5(1):54. doi: 10.1038/s41525-020-00161-w.
8
Good performance of the criteria of American College of Medical Genetics and Genomics/Association for Molecular Pathology in prediction of pathogenicity of genetic variants causing thoracic aortic aneurysms and dissections.美国医学遗传学与基因组学学院/分子病理学协会标准在预测导致胸主动脉瘤和夹层的遗传变异致病性方面表现良好。
J Transl Med. 2022 Jan 25;20(1):42. doi: 10.1186/s12967-022-03251-8.
9
A Panel-Agnostic Strategy 'HiPPo' Improves Diagnostic Efficiency in the UK Genomic Medicine Service.一种与面板无关的策略“HiPPo”提高了英国基因组医学服务中的诊断效率。
Healthcare (Basel). 2023 Dec 15;11(24):3179. doi: 10.3390/healthcare11243179.
10
PSAP-Genomic-Regions: A Method Leveraging Population Data to Prioritize Coding and Non-Coding Variants in Whole Genome Sequencing for Rare Disease Diagnosis.PSAP基因组区域:一种利用群体数据对全基因组测序中的编码和非编码变异进行优先级排序以用于罕见病诊断的方法。
Genet Epidemiol. 2025 Jan;49(1):e22593. doi: 10.1002/gepi.22593. Epub 2024 Sep 24.