Ted Rogers Centre for Heart Research, Cardiac Genome Clinic, The Hospital for Sick Children, Toronto, ON, Canada.
Department of Electrical Engineering and Computer Science, York University, Toronto, ON, Canada.
BMC Med Genomics. 2022 Feb 18;15(1):31. doi: 10.1186/s12920-022-01166-3.
Variant interpretation is the main bottleneck in medical genomic sequencing efforts. This usually involves genome analysts manually searching through a multitude of independent databases, often with the aid of several, mostly independent, computational tools. To streamline variant interpretation, we developed the GeneTerpret platform which collates data from current interpretation tools and databases, and applies a phenotype-driven query to categorize the variants identified in the genome(s). The platform assigns quantitative validity scores to genes by query and assembly of the genotype-phenotype data, sequence homology, molecular interactions, expression data, and animal models. It also uses the American College of Medical Genetics and Genomics (ACMG) criteria to categorize variants into five tiers of pathogenicity. The final output is a prioritized list of potentially causal variants/genes.
We tested GeneTerpret by comparing its performance to expert-curated genes (ClinGen's gene-validity database) and variant pathogenicity reports (DECIPHER database). Output from GeneTerpret was 97.2% and 83.5% concordant with the expert-curated sources, respectively. Additionally, similar concordance was observed when GeneTerpret's performance was compared with our internal expert-interpreted clinical datasets.
GeneTerpret is a flexible platform designed to streamline the genome interpretation process, through a unique interface, with improved ease, speed and accuracy. This modular and customizable system allows the user to tailor the component-programs in the analysis process to their preference. GeneTerpret is available online at https://geneterpret.com .
变异解释是医学基因组测序工作的主要瓶颈。这通常涉及基因组分析师手动搜索大量独立的数据库,通常借助于几个,主要是独立的,计算工具。为了简化变异解释,我们开发了 GeneTerpret 平台,该平台汇集了来自当前解释工具和数据库的数据,并应用表型驱动的查询对基因组中识别出的变体进行分类。该平台通过查询和组装基因型-表型数据、序列同源性、分子相互作用、表达数据和动物模型,为基因分配定量有效性分数。它还使用美国医学遗传学与基因组学学院 (ACMG) 的标准将变体分为致病性的五个层次。最终的输出是潜在因果变异/基因的优先级列表。
我们通过将 GeneTerpret 的性能与专家 curated 的基因(ClinGen 的基因有效性数据库)和变体致病性报告(DECIPHER 数据库)进行比较来测试 GeneTerpret。GeneTerpret 的输出分别与专家 curated 的来源的一致性为 97.2%和 83.5%。此外,当将 GeneTerpret 的性能与我们内部专家解释的临床数据集进行比较时,也观察到了类似的一致性。
GeneTerpret 是一个灵活的平台,旨在通过独特的界面,提高易用性、速度和准确性,简化基因组解释过程。该模块化和可定制的系统允许用户根据自己的偏好调整分析过程中的组件程序。GeneTerpret 可在 https://geneterpret.com 上在线使用。