Suppr超能文献

来自阿拉伯联合酋长国人群的特定人群主要等位基因参考基因组。

A Population-Specific Major Allele Reference Genome From The United Arab Emirates Population.

作者信息

Daw Elbait Gihan, Henschel Andreas, Tay Guan K, Al Safar Habiba S

机构信息

Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.

Department of Electrical Engineering and Computer Science, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.

出版信息

Front Genet. 2021 Apr 23;12:660428. doi: 10.3389/fgene.2021.660428. eCollection 2021.

Abstract

The ethnic composition of the population of a country contributes to the uniqueness of each national DNA sequencing project and, ideally, individual reference genomes are required to reduce the confounding nature of ethnic bias. This work represents a representative Whole Genome Sequencing effort of an understudied population. Specifically, high coverage consensus sequences from 120 whole genomes and 33 whole exomes were used to construct the first ever population specific major allele reference genome for the United Arab Emirates (UAE). When this was applied and compared to the archetype hg19 reference, assembly of local Emirati genomes was reduced by ∼19% (i.e., some 1 million fewer calls). In compiling the United Arab Emirates Reference Genome (UAERG), sets of annotated 23,038,090 short (novel: 1,790,171) and 137,713 structural (novel: 8,462) variants; their allele frequencies (AFs) and distribution across the genome were identified. Population-specific genetic characteristics including loss-of-function variants, admixture, and ancestral haplogroup distribution were identified and reported here. We also detect a strong correlation between and admixture components in the UAE. This baseline study was conceived to establish a high-quality reference genome and a genetic variations resource to enable the development of regional population specific initiatives and thus inform the application of population studies and precision medicine in the UAE.

摘要

一个国家的人口种族构成造就了每个国家DNA测序项目的独特性,理想情况下,需要个体参考基因组来减少种族偏见带来的混杂影响。这项工作代表了对一个研究较少的人群进行的具有代表性的全基因组测序工作。具体而言,利用来自120个全基因组和33个全外显子组的高覆盖度一致序列,构建了有史以来第一个针对阿拉伯联合酋长国(阿联酋)人群的主要等位基因参考基因组。当将其应用并与原型hg19参考基因组进行比较时,阿联酋本地基因组的组装减少了约19%(即约少了100万个位点)。在编制阿联酋参考基因组(UAERG)时,确定了23,038,090个短变异(新变异:1,790,171个)和137,713个结构变异(新变异:8,462个)的注释集;确定了它们的等位基因频率(AFs)及其在基因组中的分布。本文确定并报告了包括功能丧失变异、混合和祖先单倍群分布在内的人群特异性遗传特征。我们还在阿联酋检测到了 与混合成分之间的强相关性。这项基线研究旨在建立一个高质量的参考基因组和一个遗传变异资源,以推动制定针对该地区人群的具体计划,从而为阿联酋的人群研究和精准医学应用提供信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60fc/8102833/3b2b1e7f6b84/fgene-12-660428-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验