Department of Molecular Biology, Instituto Nacional de Cardiología Ignacio Chávez, Mexico City, Mexico.
Department of Endocrinology, Instituto Nacional de Cardiología Ignacio Chávez, Mexico City, Mexico.
Life Sci. 2020 Nov 1;260:118313. doi: 10.1016/j.lfs.2020.118313. Epub 2020 Aug 21.
The prevalence and mortality of the outbreak of the COVID-19 pandemic show marked geographic variation. The presence of several subtypes of the coronavirus and the genetic differences in the populations could condition that variation. Thus, the objective of this study was to propose variants in genes that encode proteins related to the SARS-CoV-2 entry into the host cells as possible targets for genetic associations studies.
The allelic frequencies of the polymorphisms in the ACE2, TMPRSS2, TMPRSS11A, cathepsin L (CTSL), and elastase (ELANE) genes were obtained in four populations from the American, African, European, and Asian continents reported in the 1000 Genome Project. Moreover, we evaluated the potential biological effect of these variants using different web-based tools.
In the coding sequences of these genes, we detected one probably-damaging polymorphism located in the TMPRSS2 gene (rs12329760) that produces a change of amino acid. Furthermore, forty-eight polymorphisms with possible functional consequences were detected in the non-coding sequences of the following genes: three in ACE2, seventeen in TMPRSS2, ten in TMPRSS11A, twelve in ELANE, and six in CTSL. These polymorphisms produce binding sites for transcription factors and microRNAs. The minor allele frequencies of these polymorphisms vary in each community; indeed, some of them are high in specific populations.
In summary, using data of the 1000 Genome Project and web-based tools, we propose some polymorphisms, which, depending on the population, could be used for genetic association studies.
COVID-19 大流行的爆发率和死亡率呈现出明显的地域差异。冠状病毒的几个亚型以及人群中的遗传差异可能会导致这种变异。因此,本研究的目的是提出与 SARS-CoV-2 进入宿主细胞相关的编码蛋白的基因变异作为遗传关联研究的可能靶点。
从 1000 基因组计划中报告的来自美洲、非洲、欧洲和亚洲的四个人群中获得 ACE2、TMPRSS2、TMPRSS11A、组织蛋白酶 L(CTSL)和弹性蛋白酶(ELANE)基因中多态性的等位基因频率。此外,我们使用不同的基于网络的工具评估这些变体的潜在生物学效应。
在这些基因的编码序列中,我们检测到一个位于 TMPRSS2 基因(rs12329760)中的可能有害多态性,该多态性导致氨基酸发生变化。此外,在 ACE2 中的三个、TMPRSS2 中的十七个、TMPRSS11A 中的十个、ELANE 中的十二个和 CTSL 中的六个非编码序列中检测到四十八个具有可能功能后果的多态性。这些多态性产生转录因子和 microRNA 的结合位点。这些多态性的次要等位基因频率在每个社区中都有所不同;事实上,其中一些在特定人群中很高。
总之,使用 1000 基因组计划的数据和基于网络的工具,我们提出了一些多态性,这些多态性可能根据人群用于遗传关联研究。