Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen, Norway.
Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway.
PLoS One. 2024 Apr 18;19(4):e0300350. doi: 10.1371/journal.pone.0300350. eCollection 2024.
Monogenic diabetes is characterized as a group of diseases caused by rare variants in single genes. Like for other rare diseases, multiple genes have been linked to monogenic diabetes with different measures of pathogenicity, but the information on the genes and variants is not unified among different resources, making it challenging to process them informatically. We have developed an automated pipeline for collecting and harmonizing data on genetic variants linked to monogenic diabetes. Furthermore, we have translated variant genetic sequences into protein sequences accounting for all protein isoforms and their variants. This allows researchers to consolidate information on variant genes and proteins linked to monogenic diabetes and facilitates their study using proteomics or structural biology. Our open and flexible implementation using Jupyter notebooks enables tailoring and modifying the pipeline and its application to other rare diseases.
单基因糖尿病的特征是由单个基因中的罕见变异引起的一组疾病。与其他罕见疾病一样,多个基因与单基因糖尿病相关,其致病性的衡量标准也不同,但不同资源之间的基因和变异信息并不统一,这使得对其进行信息处理具有挑战性。我们开发了一个自动化的管道,用于收集和协调与单基因糖尿病相关的遗传变异数据。此外,我们还将变异基因序列翻译成包含所有蛋白质同工型及其变体的蛋白质序列。这使得研究人员能够整合与单基因糖尿病相关的变异基因和蛋白质的信息,并使用蛋白质组学或结构生物学来促进对其的研究。我们使用 Jupyter 笔记本的开放和灵活的实现方式,使得对管道及其在其他罕见疾病中的应用进行定制和修改成为可能。