Lecoquierre François, Quenez Olivier, Fourneaux Steeve, Coutant Sophie, Vezain Myriam, Rolain Marion, Drouot Nathalie, Boland Anne, Olaso Robert, Meyer Vincent, Deleuze Jean-François, Dabbagh Dana, Gilles Isabelle, Gayet Claire, Saugier-Veber Pascale, Goldenberg Alice, Guerrot Anne-Marie, Nicolas Gaël
Univ Rouen Normandie, Inserm U12045 and CHU Rouen, Department of Genetics and Reference Center for Developmental Disorders, FHU-G4 Génomique, F-76000, Rouen, France.
Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), 91057, Evry, France.
Hum Genet. 2023 Jun;142(6):773-783. doi: 10.1007/s00439-023-02553-1. Epub 2023 Apr 19.
Exome sequencing (ES) has become the method of choice for diagnosing rare diseases, while the availability of short-read genome sequencing (SR-GS) in a medical setting is increasing. In addition, new sequencing technologies, such as long-read genome sequencing (LR-GS) and transcriptome sequencing, are being increasingly used. However, the contribution of these techniques compared to widely used ES is not well established, particularly in regards to the analysis of non-coding regions. In a pilot study of five probands affected by an undiagnosed neurodevelopmental disorder, we performed trio-based short-read GS and long-read GS as well as case-only peripheral blood transcriptome sequencing. We identified three new genetic diagnoses, none of which affected the coding regions. More specifically, LR-GS identified a balanced inversion in NSD1, highlighting a rare mechanism of Sotos syndrome. SR-GS identified a homozygous deep intronic variant of KLHL7 resulting in a neoexon inclusion, and a de novo mosaic intronic 22-bp deletion in KMT2D, leading to the diagnosis of Perching and Kabuki syndromes, respectively. All three variants had a significant effect on the transcriptome, which showed decreased gene expression, mono-allelic expression and splicing defects, respectively, further validating the effect of these variants. Overall, in undiagnosed patients, the combination of short and long read GS allowed the detection of cryptic variations not or barely detectable by ES, making it a highly sensitive method at the cost of more complex bioinformatics approaches. Transcriptome sequencing is a valuable complement for the functional validation of variations, particularly in the non-coding genome.
外显子组测序(ES)已成为诊断罕见病的首选方法,同时医学环境中短读长基因组测序(SR-GS)的可用性也在增加。此外,新的测序技术,如长读长基因组测序(LR-GS)和转录组测序,也越来越多地被使用。然而,与广泛使用的ES相比,这些技术的贡献尚未得到充分证实,特别是在非编码区的分析方面。在一项对五名患有未确诊神经发育障碍的先证者的试点研究中,我们进行了基于三联体的短读长GS和长读长GS以及仅病例的外周血转录组测序。我们确定了三个新的基因诊断,其中没有一个影响编码区。更具体地说,LR-GS确定了NSD1中的一个平衡倒位,突出了一种罕见的索托斯综合征机制。SR-GS确定了KLHL7的一个纯合深度内含子变异,导致一个新外显子包含,以及KMT2D中的一个新生嵌合内含子22bp缺失,分别导致了佩尔钦综合征和歌舞伎综合征的诊断。所有这三个变异对转录组都有显著影响,分别表现为基因表达降低、单等位基因表达和剪接缺陷,进一步验证了这些变异的作用。总体而言,在未确诊的患者中,短读长和长读长GS的结合能够检测到ES无法或几乎无法检测到的隐匿变异,使其成为一种高度敏感的方法,但代价是生物信息学方法更为复杂。转录组测序是变异功能验证的宝贵补充,特别是在非编码基因组中。