Suppr超能文献

一种由基因型和表型驱动的基于人工智能的方法,以提高遗传疾病的诊断率。

An AI-based approach driven by genotypes and phenotypes to uplift the diagnostic yield of genetic diseases.

作者信息

Zucca S, Nicora G, De Paoli F, Carta M G, Bellazzi R, Magni P, Rizzo E, Limongelli I

机构信息

enGenome Srl, 27100, Pavia, Italy.

Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.

出版信息

Hum Genet. 2025 Mar;144(2-3):159-171. doi: 10.1007/s00439-023-02638-x. Epub 2024 Mar 23.

Abstract

Identifying disease-causing variants in Rare Disease patients' genome is a challenging problem. To accomplish this task, we describe a machine learning framework, that we called "Suggested Diagnosis", whose aim is to prioritize genetic variants in an exome/genome based on the probability of being disease-causing. To do so, our method leverages standard guidelines for germline variant interpretation as defined by the American College of Human Genomics (ACMG) and the Association for Molecular Pathology (AMP), inheritance information, phenotypic similarity, and variant quality. Starting from (1) the VCF file containing proband's variants, (2) the list of proband's phenotypes encoded in Human Phenotype Ontology terms, and optionally (3) the information about family members (if available), the "Suggested Diagnosis" ranks all the variants according to their machine learning prediction. This method significantly reduces the number of variants that need to be evaluated by geneticists by pinpointing causative variants in the very first positions of the prioritized list. Most importantly, our approach proved to be among the top performers within the CAGI6 Rare Genome Project Challenge, where it was able to rank the true causative variant among the first positions and, uniquely among all the challenge participants, increased the diagnostic yield of 12.5% by solving 2 undiagnosed cases.

摘要

识别罕见病患者基因组中的致病变异是一个具有挑战性的问题。为完成这项任务,我们描述了一个机器学习框架,我们称之为“建议诊断”,其目的是根据致病可能性对全外显子组/基因组中的基因变异进行优先级排序。为此,我们的方法利用了美国人类基因组学学院(ACMG)和分子病理学协会(AMP)定义的种系变异解释标准指南、遗传信息、表型相似性和变异质量。从(1)包含先证者变异的VCF文件、(2)用人类表型本体术语编码的先证者表型列表,以及可选的(3)家庭成员信息(如果可用)开始,“建议诊断”根据机器学习预测对所有变异进行排序。这种方法通过在优先级列表的首位精确找出致病变异,显著减少了遗传学家需要评估的变异数量。最重要的是,我们的方法在CAGI6罕见基因组计划挑战赛中被证明是表现最佳的方法之一,它能够将真正的致病变异排在首位,并且在所有挑战赛参与者中独树一帜,通过解决2例未确诊病例,将诊断率提高了12.5%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8c4/11976766/e873aa7c63eb/439_2023_2638_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验