• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

突变亚型对等位基因频率谱和群体遗传学推断的影响。

The effect of mutation subtypes on the allele frequency spectrum and population genetics inference.

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.

Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA.

出版信息

G3 (Bethesda). 2023 Apr 11;13(4). doi: 10.1093/g3journal/jkad035.

DOI:10.1093/g3journal/jkad035
PMID:36759699
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10085755/
Abstract

Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distribution of allele frequencies in a sample. While the frequency spectrum serves to summarize patterns of genetic variation, it implicitly assumes mutation types (A→C vs C→T) as interchangeable. However, mutations of different types arise and spread due to spatial and temporal variation in forces such as mutation rate and biased gene conversion that result in heterogeneity in the distribution of allele frequencies across sites. In this work, we explore the impact of this simplification on multiple aspects of population genetic modeling. As a site's mutation rate is strongly affected by flanking nucleotides, we defined a mutation subtype by the base pair change and adjacent nucleotides (e.g. AAA→ATA) and systematically assessed the heterogeneity in the frequency spectrum across 96 distinct 3-mer mutation subtypes using n = 3556 whole-genome sequenced individuals of European ancestry. We observed substantial variation across the subtype-specific frequency spectra, with some of the variation being influenced by molecular factors previously identified for single base mutation types. Estimates of model parameters from demographic inference performed for each mutation subtype's AFS individually varied drastically across the 96 subtypes. In local patterns of variation, a combination of regional subtype composition and local genomic factors shaped the regional frequency spectrum across genomic regions. Our results illustrate how treating variants in large sequencing samples as interchangeable may confound population genetic frameworks and encourages us to consider the unique evolutionary mechanisms of analyzed polymorphisms.

摘要

群体遗传学已经适应了下一代测序技术的进步,这些进步导致遗传数据呈指数级增长。一种分析大型测序数据中遗传变异的常用方法是通过等位基因频率谱,它定义为样本中等位基因频率的分布。虽然频谱有助于总结遗传变异的模式,但它隐含地假设突变类型(A→C 与 C→T)是可互换的。然而,由于突变率和偏向基因转换等因素在空间和时间上的变化,不同类型的突变会产生并传播,从而导致等位基因频率在不同位点的分布产生异质性。在这项工作中,我们探讨了这种简化对群体遗传建模多个方面的影响。由于一个位点的突变率受到侧翼核苷酸的强烈影响,我们通过碱基对变化和相邻核苷酸来定义突变亚型(例如 AAA→ATA),并使用 n = 3556 名具有欧洲血统的全基因组测序个体系统地评估了 96 种不同 3-碱基突变亚型的频谱在频率上的异质性。我们观察到在特定于亚型的频谱中存在大量的变异,其中一些变异受到先前为单碱基突变类型确定的分子因素的影响。为每个突变亚型的 AFS 单独进行的人口推断模型参数的估计在 96 个亚型之间变化很大。在局部变异模式中,区域亚型组成和局部基因组因素的组合塑造了整个基因组区域的区域频谱。我们的结果说明了将大型测序样本中的变体视为可互换可能会混淆群体遗传框架,并鼓励我们考虑所分析多态性的独特进化机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/8900d3841584/jkad035f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/86ecdcd211d6/jkad035f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/8a739d5aa1d1/jkad035f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/9d1192512b70/jkad035f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/5a6571c10891/jkad035f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/8409e45788ee/jkad035f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/8900d3841584/jkad035f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/86ecdcd211d6/jkad035f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/8a739d5aa1d1/jkad035f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/9d1192512b70/jkad035f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/5a6571c10891/jkad035f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/8409e45788ee/jkad035f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ac/10085755/8900d3841584/jkad035f6.jpg

相似文献

1
The effect of mutation subtypes on the allele frequency spectrum and population genetics inference.突变亚型对等位基因频率谱和群体遗传学推断的影响。
G3 (Bethesda). 2023 Apr 11;13(4). doi: 10.1093/g3journal/jkad035.
2
The influence of genomic context on mutation patterns in the human genome inferred from rare variants.从稀有变异推断人类基因组中基因组背景对突变模式的影响。
Genome Res. 2013 Dec;23(12):1974-84. doi: 10.1101/gr.154971.113. Epub 2013 Aug 29.
3
Inductive determination of allele frequency spectrum probabilities in structured populations.结构化群体中等位基因频率谱概率的归纳确定。
Theor Popul Biol. 2019 Oct;129:148-159. doi: 10.1016/j.tpb.2018.10.004. Epub 2019 Jan 11.
4
Genotype-free estimation of allele frequencies reduces bias and improves demographic inference from RADSeq data.无基因型估计等位基因频率可减少偏差并提高 RADSeq 数据的种群遗传推断准确性。
Mol Ecol Resour. 2019 May;19(3):586-596. doi: 10.1111/1755-0998.12990. Epub 2019 Apr 17.
5
Genomic inference using diffusion models and the allele frequency spectrum.基于扩散模型和等位基因频率谱的基因组推断。
Curr Opin Genet Dev. 2018 Dec;53:140-147. doi: 10.1016/j.gde.2018.10.001. Epub 2018 Oct 23.
6
Limited role of generation time changes in driving the evolution of the mutation spectrum in humans.世代时间变化在驱动人类突变谱进化中的有限作用。
Elife. 2023 Feb 13;12:e81188. doi: 10.7554/eLife.81188.
7
Nonparametric coalescent inference of mutation spectrum history and demography.非参数合并推断突变谱历史和人口统计学。
Proc Natl Acad Sci U S A. 2021 May 25;118(21). doi: 10.1073/pnas.2013798118.
8
Efficient inference of population size histories and locus-specific mutation rates from large-sample genomic variation data.从大样本基因组变异数据中高效推断种群大小历史和基因座特异性突变率。
Genome Res. 2015 Feb;25(2):268-79. doi: 10.1101/gr.178756.114. Epub 2015 Jan 6.
9
Relative mutation rates of each nucleotide for another estimated from allele frequency spectra at human gene loci.通过人类基因座的等位基因频率谱估计的另一种情况下每个核苷酸的相对突变率。
Genet Res (Camb). 2009 Aug;91(4):293-303. doi: 10.1017/S0016672309990164.
10
General triallelic frequency spectrum under demographic models with variable population size.人口规模可变的人口统计模型下的一般三等位基因频率谱。
Genetics. 2014 Jan;196(1):295-311. doi: 10.1534/genetics.113.158584. Epub 2013 Nov 8.

本文引用的文献

1
Population sequencing data reveal a compendium of mutational processes in the human germ line.人群测序数据揭示了人类种系中一系列突变过程。
Science. 2021 Aug 27;373(6558):1030-1035. doi: 10.1126/science.aba7408. Epub 2021 Aug 12.
2
fastsimcoal2: demographic inference under complex evolutionary scenarios.fastsimcoal2:复杂进化场景下的人口推断。
Bioinformatics. 2021 Dec 11;37(24):4882-4885. doi: 10.1093/bioinformatics/btab468.
3
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.美国国立卫生研究院生物医学高级研究与发展局(NHLBI)TOPMed 项目中对 53831 个不同基因组进行测序。
Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.
4
Efficiently inferring the demographic history of many populations with allele count data.利用等位基因计数数据高效推断多个群体的人口历史。
J Am Stat Assoc. 2020;115(531):1472-1487. doi: 10.1080/01621459.2019.1635482. Epub 2019 Jul 22.
5
Insights into the Link between the Organization of DNA Replication and the Mutational Landscape.深入了解 DNA 复制的组织与突变景观之间的联系。
Genes (Basel). 2019 Mar 27;10(4):252. doi: 10.3390/genes10040252.
6
Signals of Variation in Human Mutation Rate at Multiple Levels of Sequence Context.人类突变率在多种序列背景下的变化信号。
Mol Biol Evol. 2019 May 1;36(5):955-965. doi: 10.1093/molbev/msz023.
7
Extremely rare variants reveal patterns of germline mutation rate heterogeneity in humans.极罕见的变异揭示了人类种系突变率异质性的模式。
Nat Commun. 2018 Sep 14;9(1):3753. doi: 10.1038/s41467-018-05936-5.
8
Background selection and biased gene conversion affect more than 95% of the human genome and bias demographic inferences.背景选择和有偏基因转换影响了超过 95%的人类基因组,并偏向人口统计学推断。
Elife. 2018 Aug 23;7:e36317. doi: 10.7554/eLife.36317.
9
Efficient computation of the joint sample frequency spectra for multiple populations.多群体联合样本频率谱的高效计算。
J Comput Graph Stat. 2017;26(1):182-194. doi: 10.1080/10618600.2016.1159212. Epub 2017 Feb 16.
10
Mutation Rate Variation is a Primary Determinant of the Distribution of Allele Frequencies in Humans.突变率变异是人类等位基因频率分布的主要决定因素。
PLoS Genet. 2016 Dec 15;12(12):e1006489. doi: 10.1371/journal.pgen.1006489. eCollection 2016 Dec.