• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于单核苷酸多态性标记的杂合度估计是依赖于背景的,而且往往是错误的。

Estimates of heterozygosity from single nucleotide polymorphism markers are context-dependent and often wrong.

机构信息

School of Biological Sciences, University of Western Australia, Crawley, Western Australia, Australia.

出版信息

Mol Ecol Resour. 2024 May;24(4):e13947. doi: 10.1111/1755-0998.13947. Epub 2024 Mar 3.

DOI:10.1111/1755-0998.13947
PMID:38433491
Abstract

Genetic diversity is frequently described using heterozygosity, particularly in a conservation context. Often, it is estimated using single nucleotide polymorphisms (SNPs); however, it has been shown that heterozygosity values calculated from SNPs can be biased by both study design and filtering parameters. Though solutions have been proposed to address these issues, our own work has found them to be inadequate in some circumstances. Here, we aimed to improve the reliability and comparability of heterozygosity estimates, specifically by investigating how sample size and missing data thresholds influenced the calculation of autosomal heterozygosity (heterozygosity calculated from across the genome, i.e. fixed and variable sites). We also explored how the standard practice of tri- and tetra-allelic site exclusion could bias heterozygosity estimates and influence eventual conclusions relating to genetic diversity. Across three distinct taxa (a frog, Litoria rubella; a tree, Eucalyptus microcarpa; and a grasshopper, Keyacris scurra), we found heterozygosity estimates to be meaningfully affected by sample size and missing data thresholds, partly due to the exclusion of tri- and tetra-allelic sites. These biases were inconsistent both between species and populations, with more diverse populations tending to have their estimates more severely affected, thus having potential to dramatically alter interpretations of genetic diversity. We propose a modified framework for calculating heterozygosity that reduces bias and improves the utility of heterozygosity as a measure of genetic diversity, whilst also highlighting the need for existing population genetic pipelines to be adjusted such that tri- and tetra-allelic sites be included in calculations.

摘要

遗传多样性通常使用杂合度来描述,尤其是在保护生物学中。通常使用单核苷酸多态性(SNP)来估计杂合度;然而,已经表明,从 SNP 计算得出的杂合度值可能会受到研究设计和过滤参数的影响。尽管已经提出了解决这些问题的方法,但我们自己的工作发现,在某些情况下,这些方法并不足够。在这里,我们旨在提高杂合度估计的可靠性和可比性,具体方法是研究样本量和缺失数据阈值如何影响常染色体杂合度(从整个基因组计算得出的杂合度,即固定和可变位点)的计算。我们还探讨了三等位和四等位位点排除的标准实践如何偏倚杂合度估计值,并影响与遗传多样性相关的最终结论。通过对三个不同的分类群(一种青蛙,Litoria rubella;一种树,Eucalyptus microcarpa;和一种蝗虫,Keyacris scurra)进行研究,我们发现样本量和缺失数据阈值会对杂合度估计产生显著影响,部分原因是三等位和四等位位点的排除。这些偏差在物种和种群之间不一致,多样性较高的种群的估计值受到的影响更为严重,因此有可能极大地改变对遗传多样性的解释。我们提出了一种改进的杂合度计算框架,该框架可以减少偏倚,提高杂合度作为遗传多样性衡量标准的实用性,同时也强调需要调整现有的群体遗传分析管道,以便将三等位和四等位位点纳入计算。

相似文献

1
Estimates of heterozygosity from single nucleotide polymorphism markers are context-dependent and often wrong.基于单核苷酸多态性标记的杂合度估计是依赖于背景的,而且往往是错误的。
Mol Ecol Resour. 2024 May;24(4):e13947. doi: 10.1111/1755-0998.13947. Epub 2024 Mar 3.
2
A compilation of tri-allelic SNPs from 1000 Genomes and use of the most polymorphic loci for a large-scale human identification panel.从 1000 基因组中编译三等位基因单核苷酸多态性,并使用多态性最高的基因座构建大规模人类识别面板。
Forensic Sci Int Genet. 2020 May;46:102232. doi: 10.1016/j.fsigen.2020.102232. Epub 2020 Jan 17.
3
Estimating genomic diversity and population differentiation - an empirical comparison of microsatellite and SNP variation in Arabidopsis halleri.估计基因组多样性和种群分化——拟南芥微卫星和单核苷酸多态性变异的实证比较
BMC Genomics. 2017 Jan 11;18(1):69. doi: 10.1186/s12864-016-3459-7.
4
Estimating genome-wide heterozygosity: effects of demographic history and marker type.估算全基因组杂合率:人口历史和标记类型的影响。
Heredity (Edinb). 2014 Mar;112(3):240-7. doi: 10.1038/hdy.2013.99. Epub 2013 Oct 23.
5
Population genetic inferences using immune gene SNPs mirror patterns inferred by microsatellites.使用免疫基因单核苷酸多态性进行的群体遗传学推断反映了微卫星所推断的模式。
Mol Ecol Resour. 2017 May;17(3):481-491. doi: 10.1111/1755-0998.12591. Epub 2016 Aug 29.
6
Forensic genetic informativeness of an SNP panel consisting of 19 multi-allelic SNPs.19 个多位点 SNP 组成的 SNP 面板的法医学遗传信息量。
Forensic Sci Int Genet. 2018 May;34:49-56. doi: 10.1016/j.fsigen.2018.01.006. Epub 2018 Jan 31.
7
Ascertainment bias in estimates of average heterozygosity.平均杂合度估计中的确定偏倚。
Am J Hum Genet. 1996 May;58(5):1033-41.
8
A novel approach to estimating heterozygosity from low-coverage genome sequence.一种从低覆盖度基因组序列估算杂合度的新方法。
Genetics. 2013 Oct;195(2):553-61. doi: 10.1534/genetics.113.154500. Epub 2013 Aug 9.
9
A comparison of biallelic markers and microsatellites for the estimation of population and conservation genetic parameters in Atlantic salmon (Salmo salar).用于估计大西洋鲑(Salmo salar)种群和保护遗传参数的双等位基因标记与微卫星的比较
J Hered. 2007 Nov-Dec;98(7):692-704. doi: 10.1093/jhered/esm093. Epub 2007 Nov 5.
10
Genetic identities and local inbreeding in pure diploid clones with homoplasic markers: SNPs may be misleading.具有同质性标记的纯二倍体克隆中的遗传身份和局部近亲繁殖:单核苷酸多态性可能具有误导性。
Infect Genet Evol. 2015 Jul;33:227-32. doi: 10.1016/j.meegid.2015.05.008. Epub 2015 May 8.

引用本文的文献

1
Conservation Genomics for Threatened New Zealand (Gentianaceae) and Implications for Vulnerable Limestone Ecosystems.新西兰濒危龙胆科植物的保护基因组学及其对脆弱石灰岩生态系统的影响
Ecol Evol. 2025 Jun 17;15(6):e71596. doi: 10.1002/ece3.71596. eCollection 2025 Jun.
2
K-mer-based Approaches to Bridging Pangenomics and Population Genetics.基于K-mer的泛基因组学与群体遗传学关联方法。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf047.
3
Simulating Genetic Mixing in Strongly Structured Populations of the Threatened Southern Brown Bandicoot ().
模拟濒危南方袋狸()高度结构化种群中的基因混合
Evol Appl. 2024 Dec 5;17(12):e70050. doi: 10.1111/eva.70050. eCollection 2024 Dec.
4
Haplotype-resolved genome and population genomics of the threatened garden dormouse in Europe.欧洲濒危花园睡鼠的单体型 resolved 基因组和群体基因组学研究。
Genome Res. 2024 Nov 20;34(11):2094-2107. doi: 10.1101/gr.279066.124.
5
Immediate Genetic Augmentation and Enhanced Habitat Connectivity Are Required to Secure the Future of an Iconic Endangered Freshwater Fish Population.为确保一种标志性濒危淡水鱼种群的未来,需要立即进行基因增强并改善栖息地连通性。
Evol Appl. 2024 Oct 12;17(10):e70019. doi: 10.1111/eva.70019. eCollection 2024 Oct.
6
Next-generation data filtering in the genomics era.基因组学时代的下一代数据过滤。
Nat Rev Genet. 2024 Nov;25(11):750-767. doi: 10.1038/s41576-024-00738-6. Epub 2024 Jun 14.
7
Previously unmeasured genetic diversity explains part of Lewontin's paradox in a -mer-based meta-analysis of 112 plant species.在一项基于112种植物物种的-mer元分析中,先前未测量的遗传多样性解释了部分莱翁汀悖论。
bioRxiv. 2024 Sep 8:2024.05.17.594778. doi: 10.1101/2024.05.17.594778.