Wellcome Centre for Human Genetics, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK.
Cambridge Genomics Laboratory, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK.
J Med Genet. 2023 Nov 27;60(12):1235-1244. doi: 10.1136/jmg-2023-109362.
Current clinical testing methods used to uncover the genetic basis of rare disease have inherent limitations, which can lead to causative pathogenic variants being missed. Within the rare disease arm of the 100 000 Genomes Project (100kGP), families were recruited under the clinical indication 'single autosomal recessive mutation in rare disease'. These participants presented with strong clinical suspicion for a specific autosomal recessive disorder, but only one suspected pathogenic variant had been identified through standard-of-care testing. Whole genome sequencing (WGS) aimed to identify cryptic 'second-hit' variants.
To investigate the 31 families with available data that remained unsolved following formal review within the 100kGP, SVRare was used to aggregate structural variants present in <1% of 100kGP participants. Small variants were assessed using population allele frequency data and SpliceAI. Literature searches and publicly available online tools were used for further annotation of pathogenicity.
Using these strategies, 8/31 cases were solved, increasing the overall diagnostic yield of this cohort from 10/41 (24.4%) to 18/41 (43.9%). Exemplar cases include a patient with cystic fibrosis harbouring a novel exonic LINE1 insertion in and a patient with generalised arterial calcification of infancy with complex interlinked duplications involving exons 2-6 of . Although ambiguous by short-read WGS, the variant structure was resolved using optical genome mapping and RNA analysis.
Systematic examination of cryptic variants across a multi-disease cohort successfully identifies additional pathogenic variants. WGS data analysis in autosomal recessive rare disease should consider complex structural and small intronic variants as potentially pathogenic second hits.
当前用于揭示罕见病遗传基础的临床检测方法存在固有局限性,这可能导致致病变异被遗漏。在 10 万基因组计划(100kGP)的罕见病分支中,根据“罕见病中单一常染色体隐性突变”的临床指征招募了家庭。这些参与者表现出对特定常染色体隐性疾病的强烈临床怀疑,但通过标准护理测试只鉴定出一个疑似致病变体。全基因组测序(WGS)旨在识别隐匿的“二次打击”变体。
为了研究在 100kGP 中经过正式审查后仍未解决的 31 个具有可用数据的家庭,使用 SVRare 来汇总 100kGP 参与者中 <1%存在的结构变体。使用群体等位基因频率数据和 SpliceAI 评估小变体。进一步注释致病性时使用文献搜索和公共在线工具。
使用这些策略,解决了 8/31 例病例,使该队列的整体诊断率从 10/41(24.4%)提高到 18/41(43.9%)。典型病例包括一名囊性纤维化患者,其 中存在新的外显子 LINE1 插入,以及一名婴儿期全身性动脉钙化患者,其 2-6 号外显子涉及复杂的相互关联的重复。尽管短读 WGS 结果不明确,但通过光学基因组图谱和 RNA 分析解决了 变体结构问题。
对多疾病队列中的隐匿变体进行系统检查可成功鉴定出其他致病变体。常染色体隐性罕见病的 WGS 数据分析应考虑复杂的结构和小内含子变体作为潜在的致病性二次打击。