Institute of Neurology, University College London (UCL), London, UK.
NIHR Great Ormond Street Hospital Biomedical Research Centre, University College London, London, UK.
Sci Adv. 2020 Jun 10;6(24). doi: 10.1126/sciadv.aay8299. Print 2020 Jun.
Growing evidence suggests that human gene annotation remains incomplete; however, it is unclear how this affects different tissues and our understanding of different disorders. Here, we detect previously unannotated transcription from Genotype-Tissue Expression RNA sequencing data across 41 human tissues. We connect this unannotated transcription to known genes, confirming that human gene annotation remains incomplete, even among well-studied genes including 63% of the Online Mendelian Inheritance in Man-morbid catalog and 317 neurodegeneration-associated genes. We find the greatest abundance of unannotated transcription in brain and genes highly expressed in brain are more likely to be reannotated. We explore examples of reannotated disease genes, such as , for which we experimentally validate a previously unidentified, brain-specific, potentially protein-coding exon. We release all tissue-specific transcriptomes through vizER: http://rytenlab.com/browser/app/vizER We anticipate that this resource will facilitate more accurate genetic analysis, with the greatest impact on our understanding of Mendelian and complex neurogenetic disorders.
越来越多的证据表明,人类基因注释仍然不完整;然而,目前尚不清楚这对不同组织以及我们对不同疾病的理解有何影响。在这里,我们通过 41 个人体组织的基因型组织表达 RNA 测序数据检测到以前未注释的转录本。我们将这些未注释的转录本与已知基因联系起来,证实即使在包括在线孟德尔遗传人类疾病目录中的 63%和 317 个神经退行性疾病相关基因在内的研究充分的基因中,人类基因注释仍然不完整。我们发现未注释转录本在大脑中的丰度最高,在大脑中高表达的基因更有可能被重新注释。我们探索了重新注释疾病基因的例子,例如,我们通过实验验证了一个以前未被识别的、大脑特异性的、可能具有编码蛋白功能的外显子。我们通过 vizER 发布了所有组织特异性转录组:http://rytenlab.com/browser/app/vizER 我们预计该资源将促进更准确的遗传分析,对我们理解孟德尔和复杂神经遗传疾病的影响最大。