Telethon Institute of Genetics and Medicine (TIGEM), Via Campi Flegrei, 34, 80078, Pozzuoli, Italy.
Dipartimento di Scienze e Tecnologie Ambientali, Biologiche e Farmaceutiche, Program in Molecular Life Science, University of Campania "Luigi Vanvitelli", Via Vivaldi, 43, 81100, Caserta, Italy.
BMC Genomics. 2023 Apr 18;24(1):206. doi: 10.1186/s12864-023-09300-w.
Inherited retinal diseases (IRD) are genetically heterogeneous disorders that cause the dysfunction or loss of photoreceptor cells and ultimately lead to blindness. To date, next-generation sequencing procedures fail to detect pathogenic sequence variants in coding regions of known IRD disease genes in about 30-40% of patients. One of the possible explanations for this missing heritability is the presence of yet unidentified transcripts of known IRD genes. Here, we aimed to define the transcript composition of IRD genes in the human retina by a meta-analysis of publicly available RNA-seq datasets using an ad-hoc designed pipeline.
We analysed 218 IRD genes and identified 5,054 transcripts, 3,367 of which were not previously reported. We assessed their putative expression levels and focused our attention on 435 transcripts predicted to account for at least 5% of the expression of the corresponding gene. We looked at the possible impact of the newly identified transcripts at the protein level and experimentally validated a subset of them.
This study provides an unprecedented, detailed overview of the complexity of the human retinal transcriptome that can be instrumental in contributing to the resolution of some cases of missing heritability in IRD patients.
遗传性视网膜疾病(IRD)是一种遗传异质性疾病,可导致光感受器细胞的功能障碍或丧失,最终导致失明。迄今为止,下一代测序程序无法在约 30-40%的患者的已知 IRD 疾病基因的编码区域中检测到致病序列变异。这种遗传缺失的一个可能解释是已知 IRD 基因的尚未识别的转录本的存在。在这里,我们旨在通过使用专门设计的管道对公开可用的 RNA-seq 数据集进行荟萃分析,来定义人类视网膜中 IRD 基因的转录本组成。
我们分析了 218 个 IRD 基因,鉴定了 5054 个转录本,其中 3367 个以前没有报道过。我们评估了它们的可能表达水平,并将注意力集中在 435 个预测至少占相应基因表达 5%的转录本上。我们研究了新鉴定的转录本在蛋白质水平上的可能影响,并对其中的一部分进行了实验验证。
本研究提供了对人类视网膜转录组复杂性的前所未有的详细概述,这对于解决一些 IRD 患者遗传缺失的问题可能有帮助。