• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大规模基因组测序时代的哈迪-温伯格平衡

Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era.

作者信息

Abramovs Nikita, Brass Andrew, Tassabehji May

机构信息

School of Computer Science, University of Manchester, Manchester, United Kingdom.

Faculty of Biology, Medicine and Health, School of Biological Sciences, University of Manchester, Manchester, United Kingdom.

出版信息

Front Genet. 2020 Mar 13;11:210. doi: 10.3389/fgene.2020.00210. eCollection 2020.

DOI:10.3389/fgene.2020.00210
PMID:32231685
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7083100/
Abstract

Hardy-Weinberg Equilibrium (HWE) is used to estimate the number of homozygous and heterozygous variant carriers based on its allele frequency in populations that are not evolving. Deviations from HWE in large population databases have been used to detect genotyping errors, which can result in extreme heterozygote excess (HetExc). However, HetExc might also be a sign of natural selection since recessive disease causing variants should occur less frequently in a homozygous state in the population, but may reach high allele frequency in a heterozygous state, especially if they are advantageous. We developed a filtering strategy to detect these variants and applied it on genome data from 137,842 individuals. The main limitations of this approach were quality of genotype calls and insufficient population sizes, whereas population structure and inbreeding can reduce sensitivity, but not precision, in certain populations. Nevertheless, we identified 161 HetExc variants in 149 genes, most of which were specific to African/African American populations (∼79.5%). Although the majority of them were not associated with known diseases, or were classified as clinically "benign," they were enriched in genes associated with autosomal recessive diseases. The resulting dataset also contained two known recessive disease causing variants with evidence of heterozygote advantage in the sickle-cell anemia and cystic fibrosis . Finally, we provide supporting evidence of a novel heterozygote advantageous variant in the chromodomain helicase DNA binding protein 6 gene (; involved in influenza virus replication). We anticipate that our approach will aid the detection of rare recessive disease causing variants in the future.

摘要

哈迪-温伯格平衡(HWE)用于根据非进化群体中的等位基因频率来估计纯合子和杂合子变异携带者的数量。在大型群体数据库中,与HWE的偏差已被用于检测基因分型错误,这可能导致极端杂合子过剩(HetExc)。然而,HetExc也可能是自然选择的一个迹象,因为导致隐性疾病的变异在群体中以纯合状态出现的频率应该较低,但在杂合状态下可能达到较高的等位基因频率,特别是如果它们具有优势。我们开发了一种过滤策略来检测这些变异,并将其应用于来自137842个人的基因组数据。这种方法的主要局限性是基因分型调用的质量和群体规模不足,而群体结构和近亲繁殖在某些群体中会降低敏感性,但不会降低精度。尽管如此,我们在149个基因中鉴定出161个HetExc变异,其中大多数是非洲/非裔美国人特有的(约79.5%)。虽然其中大多数与已知疾病无关,或被归类为临床“良性”,但它们在与常染色体隐性疾病相关的基因中富集。所得数据集还包含两个已知的导致隐性疾病的变异,在镰状细胞贫血和囊性纤维化中有杂合子优势的证据。最后,我们提供了染色质结构域解旋酶DNA结合蛋白6基因(参与流感病毒复制)中一个新的杂合子优势变异的支持证据。我们预计我们的方法将有助于未来检测导致罕见隐性疾病的变异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/ca800a657da6/fgene-11-00210-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/b3ce6fd5409b/fgene-11-00210-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/65f8b659258c/fgene-11-00210-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/399bf0fdab44/fgene-11-00210-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/ca800a657da6/fgene-11-00210-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/b3ce6fd5409b/fgene-11-00210-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/65f8b659258c/fgene-11-00210-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/399bf0fdab44/fgene-11-00210-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9141/7083100/ca800a657da6/fgene-11-00210-g004.jpg

相似文献

1
Hardy-Weinberg Equilibrium in the Large Scale Genomic Sequencing Era.大规模基因组测序时代的哈迪-温伯格平衡
Front Genet. 2020 Mar 13;11:210. doi: 10.3389/fgene.2020.00210. eCollection 2020.
2
Hardy-Weinberg Equilibrium in Meta-Analysis Studies and Large-Scale Genomic Sequencing Era.荟萃分析研究和大规模基因组测序时代的哈迪-温伯格平衡。
Asian Pac J Cancer Prev. 2024 Jul 1;25(7):2229-2235. doi: 10.31557/APJCP.2024.25.7.2229.
3
Departure from Hardy Weinberg Equilibrium and Genotyping Error.偏离哈迪-温伯格平衡与基因分型错误。
Front Genet. 2017 Oct 31;8:167. doi: 10.3389/fgene.2017.00167. eCollection 2017.
4
The global prevalence and ethnic heterogeneity of primary ciliary dyskinesia gene variants: a genetic database analysis.原发性纤毛运动障碍基因变异的全球流行率和种族异质性:遗传数据库分析。
Lancet Respir Med. 2022 May;10(5):459-468. doi: 10.1016/S2213-2600(21)00453-7. Epub 2022 Jan 17.
5
Deviations from hardy-weinberg equilibrium in parental and unaffected sibling genotype data.亲本及未患病同胞基因型数据偏离哈迪-温伯格平衡。
Hum Hered. 2009;67(2):104-15. doi: 10.1159/000179558. Epub 2008 Dec 12.
6
Detection of genotyping errors and pseudo-SNPs via deviations from Hardy-Weinberg equilibrium.通过偏离哈迪-温伯格平衡检测基因分型错误和假单核苷酸多态性
Genet Epidemiol. 2005 Nov;29(3):204-14. doi: 10.1002/gepi.20086.
7
Testing for Hardy-Weinberg equilibrium in structured populations using genotype or low-depth next generation sequencing data.使用基因型或低深度下一代测序数据检测结构群体中的哈迪-温伯格平衡。
Mol Ecol Resour. 2019 Sep;19(5):1144-1152. doi: 10.1111/1755-0998.13019. Epub 2019 Jun 12.
8
Recursive Test of Hardy-Weinberg Equilibrium in Tetraploids.四倍体中 Hardy-Weinberg 平衡的递归检验。
Trends Genet. 2021 Jun;37(6):504-513. doi: 10.1016/j.tig.2020.11.006. Epub 2020 Dec 16.
9
Testing Departure from Hardy-Weinberg Proportions.检验偏离哈迪-温伯格平衡比例的情况。
Methods Mol Biol. 2017;1666:83-115. doi: 10.1007/978-1-4939-7274-6_6.
10
Commonly used Hardy-Weinberg equilibrium filtering schemes impact population structure inferences using RADseq data.常用的 Hardy-Weinberg 平衡过滤方案会影响使用 RADseq 数据进行的群体结构推断。
Mol Ecol Resour. 2022 Oct;22(7):2599-2613. doi: 10.1111/1755-0998.13646. Epub 2022 Jun 5.

引用本文的文献

1
Whole-Genome Resequencing Provides Novel Insights Into the Genetic Diversity, Population Structure, and Patterns of Runs of Homozygosity in Mud Crab ().全基因组重测序为青蟹的遗传多样性、种群结构和纯合子连续区域模式提供了新见解。
Evol Appl. 2025 Sep 3;18(9):e70153. doi: 10.1111/eva.70153. eCollection 2025 Sep.
2
RASEL: An Ensemble Model for Selection of Core SNPs and Its Application for Identification and Classification of Cattle Breeds.RASEL:一种用于选择核心单核苷酸多态性的集成模型及其在牛品种鉴定和分类中的应用
Biochem Genet. 2025 Aug 22. doi: 10.1007/s10528-025-11230-z.
3
Effect of HLA restriction on racial and ethnic disparities in access to immune therapies for advanced synovial sarcoma.

本文引用的文献

1
SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0:Python 中的科学计算基础算法。
Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.
2
GeVIR is a continuous gene-level metric that uses variant distribution patterns to prioritize disease candidate genes.GeVIR 是一种连续的基因水平度量标准,它利用变异分布模式来优先考虑疾病候选基因。
Nat Genet. 2020 Jan;52(1):35-39. doi: 10.1038/s41588-019-0560-2. Epub 2019 Dec 23.
3
Gene discovery informatics toolkit defines candidate genes for unexplained infertility and prenatal or infantile mortality.
HLA限制对晚期滑膜肉瘤免疫治疗可及性方面种族和民族差异的影响。
Oncologist. 2025 Jul 4;30(7). doi: 10.1093/oncolo/oyaf193.
4
Genome wide locus-specific ancestry analysis revealed adaptive admixtures in crossbred cattle of India.全基因组位点特异性祖先分析揭示了印度杂交牛中的适应性混合。
Sci Rep. 2025 May 16;15(1):17069. doi: 10.1038/s41598-025-01971-7.
5
Polygenic Risk and Nutrient Intake Interactions on Obesity Outcomes: A Systematic Review and Meta-Analysis of Observational Studies.多基因风险与营养摄入对肥胖结局的相互作用:观察性研究的系统评价与荟萃分析
Obes Rev. 2025 Sep;26(9):e13941. doi: 10.1111/obr.13941. Epub 2025 May 16.
6
Potential protective association of the AA genotype and a allele of CXCR4 rs2228014 polymorphism with COVID-19 severity in adult egyptians.埃及成年人中 CXCR4 rs2228014 多态性的 AA 基因型和 a 等位基因与 COVID-19 严重程度的潜在保护关联。
BMC Infect Dis. 2024 Oct 15;24(1):1158. doi: 10.1186/s12879-024-09602-8.
7
Semi-supervised machine learning method for predicting homogeneous ancestry groups to assess Hardy-Weinberg equilibrium in diverse whole-genome sequencing studies.用于预测同源祖先群体的半监督机器学习方法,以评估多样化全基因组测序研究中的 Hardy-Weinberg 平衡。
Am J Hum Genet. 2024 Oct 3;111(10):2129-2138. doi: 10.1016/j.ajhg.2024.08.018. Epub 2024 Sep 12.
8
Associations of ACE I/D and AGTR1 rs5182 polymorphisms with diabetes and their effects on lipids in an elderly Chinese population.ACE I/D 和 AGTR1 rs5182 多态性与老年中国人群糖尿病的关联及其对血脂的影响。
Lipids Health Dis. 2024 Jul 30;23(1):231. doi: 10.1186/s12944-024-02222-w.
9
Hardy-Weinberg Equilibrium in Meta-Analysis Studies and Large-Scale Genomic Sequencing Era.荟萃分析研究和大规模基因组测序时代的哈迪-温伯格平衡。
Asian Pac J Cancer Prev. 2024 Jul 1;25(7):2229-2235. doi: 10.31557/APJCP.2024.25.7.2229.
10
Patterns of Genetic Diversity and Gene Flow Associated With an Aridity Gradient in Populations of Common Mole-rats, Cryptomys hottentotus hottentotus.种群遗传多样性及基因流模式与干旱梯度相关联:以臀足鼩形田鼠为例
Genome Biol Evol. 2024 Jul 3;16(7). doi: 10.1093/gbe/evae144.
基因发现信息学工具包可确定不明原因不孕以及产前或婴儿死亡的候选基因。
NPJ Genom Med. 2019 Apr 15;4:8. doi: 10.1038/s41525-019-0081-z. eCollection 2019.
4
The UCSC Genome Browser database: 2019 update.UCSC 基因组浏览器数据库:2019 年更新。
Nucleic Acids Res. 2019 Jan 8;47(D1):D853-D858. doi: 10.1093/nar/gky1095.
5
GENCODE reference annotation for the human and mouse genomes.GENCODE 人类和小鼠基因组参考注释。
Nucleic Acids Res. 2019 Jan 8;47(D1):D766-D773. doi: 10.1093/nar/gky955.
6
Allele balance bias identifies systematic genotyping errors and false disease associations.等位基因平衡偏倚可识别系统的基因分型错误和虚假的疾病关联。
Hum Mutat. 2019 Jan;40(1):115-126. doi: 10.1002/humu.23674. Epub 2018 Nov 23.
7
Genetic diseases conferring resistance to infectious diseases.赋予对传染病抵抗力的遗传疾病。
Genes Dis. 2015 Feb 25;2(3):247-254. doi: 10.1016/j.gendis.2015.02.008. eCollection 2015 Sep.
8
Departure from Hardy Weinberg Equilibrium and Genotyping Error.偏离哈迪-温伯格平衡与基因分型错误。
Front Genet. 2017 Oct 31;8:167. doi: 10.3389/fgene.2017.00167. eCollection 2017.
9
Cystic fibrosis carriership and tuberculosis: hints toward an evolutionary selective advantage based on data from the Brazilian territory.囊性纤维化携带者与结核病:基于巴西地区数据对进化选择优势的提示
BMC Infect Dis. 2017 May 12;17(1):340. doi: 10.1186/s12879-017-2448-z.
10
Assessment of the ExAC data set for the presence of individuals with pathogenic genotypes implicated in severe Mendelian pediatric disorders.评估 ExAC 数据集是否存在与严重孟德尔儿科疾病相关的致病性基因型个体。
Genet Med. 2017 Dec;19(12):1300-1308. doi: 10.1038/gim.2017.50. Epub 2017 May 4.