Bioinformatics and Biostatistics Core, Centre of Genomic and Precision Medicine, National Taiwan University, Taipei, Taiwan.
Department of Biomedical Engineering, College of Biomedical Engineering, China Medical University, Taichung, Taiwan.
J Biomed Inform. 2023 Jul;143:104423. doi: 10.1016/j.jbi.2023.104423. Epub 2023 Jun 10.
Genotype imputation is a commonly used technique that infers un-typed variants into a study's genotype data, allowing better identification of causal variants in disease studies. However, due to overrepresentation of Caucasian studies, there's a lack of understanding of genetic basis of health-outcomes in other ethnic populations. Therefore, facilitating imputation of missing key-predictor-variants that can potentially improve a risk health-outcome prediction model, specifically for Asian ancestry, is of utmost relevance.
We aimed to construct an imputation and analysis web-platform, that primarily facilitates, but is not limited to genotype imputation on East-Asians. The goal is to provide a collaborative imputation platform for researchers in the public domain towards rapidly and efficiently conducting accurate genotype imputation.
We present an online genotype imputation platform, Multi-ethnic Imputation System (MI-System) (https://misystem.cgm.ntu.edu.tw/), that offers users 3 established pipelines, SHAPEIT2-IMPUTE2, SHAPEIT4-IMPUTE5, and Beagle5.1 for conducting imputation analyses. In addition to 1000 Genomes and Hapmap3, a new customized Taiwan Biobank (TWB) reference panel, specifically created for Taiwanese-Chinese ancestry is provided. MI-System further offers functions to create customized reference panels to be used for imputation, conduct quality control, split whole genome data into chromosomes, and convert genome builds.
Users can upload their genotype data and perform imputation with minimum effort and resources. The utility functions further can be utilized to preprocess user uploaded data with easy clicks. MI-System potentially contributes to Asian-population genetics research, while eliminating the requirement for high performing computational resources and bioinformatics expertise. It will enable an increased pace of research and provide a knowledge-base for genetic carriers of complex diseases, therefore greatly enhancing patient-driven research.
Multi-ethnic Imputation System (MI-System), primarily facilitates, but is not limited to, imputation on East-Asians, through 3 established prephasing-imputation pipelines, SHAPEIT2-IMPUTE2, SHAPEIT4-IMPUTE5, and Beagle5.1, where users can upload their genotype data and perform imputation and other utility functions with minimum effort and resources. A new customized Taiwan Biobank (TWB) reference panel, specifically created for Taiwanese-Chinese ancestry is provided. Utility functions include (a) create customized reference panels, (b) conduct quality control, (c) split whole genome data into chromosomes, and (d) convert genome builds. Users can also combine 2 reference panels using the system and use combined panels as reference to conduct imputation using MI-System.
基因分型是一种常用的技术,它可以将未分型的变体推断到研究的基因型数据中,从而更好地识别疾病研究中的因果变体。然而,由于高加索人群研究的代表性过高,对于其他种族群体的健康结果的遗传基础缺乏了解。因此,促进缺失关键预测因子变体的推断,从而有可能改善风险健康结果预测模型,特别是对于亚洲血统,是极其重要的。
我们旨在构建一个基于 imputation 和分析的网络平台,该平台主要促进但不限于东亚人的基因分型。目标是为公共领域的研究人员提供一个协作的 imputation 平台,以快速有效地进行准确的基因分型。
我们展示了一个在线基因分型 imputation 平台,即多民族 imputation 系统(MI-System)(https://misystem.cgm.ntu.edu.tw/),它为用户提供了 3 个已建立的管道,即 SHAPEIT2-IMPUTE2、SHAPEIT4-IMPUTE5 和 Beagle5.1,用于进行 imputation 分析。除了 1000 基因组和 Hapmap3 之外,还提供了一个新的定制的台湾生物银行(TWB)参考面板,专门为台湾汉族血统创建。MI-System 还提供了创建自定义参考面板的功能,用于 imputation、进行质量控制、将全基因组数据拆分为染色体以及转换基因组构建。
用户可以上传他们的基因型数据,并以最小的努力和资源进行 imputation。实用功能还可以通过轻松点击来利用用户上传的数据进行预处理。MI-System 有助于亚洲人群遗传学研究,同时消除了对高性能计算资源和生物信息学专业知识的需求。它将加快研究步伐,并为复杂疾病的遗传携带者提供知识库,从而极大地促进以患者为中心的研究。
多民族 imputation 系统(MI-System)主要通过 3 个已建立的预成 imputation 管道,即 SHAPEIT2-IMPUTE2、SHAPEIT4-IMPUTE5 和 Beagle5.1,促进但不限于东亚人的 imputation,用户可以上传他们的基因型数据,并以最小的努力和资源进行 imputation 和其他实用功能。提供了一个新的定制的台湾生物银行(TWB)参考面板,专门为台湾汉族血统创建。实用功能包括(a)创建自定义参考面板,(b)进行质量控制,(c)将全基因组数据拆分为染色体,以及(d)转换基因组构建。用户还可以使用系统组合 2 个参考面板,并使用组合面板作为参考,使用 MI-System 进行 imputation。