• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对低通量基因组测序中的偏差进行建模,以实现准确的群体遗传推断。

Modeling biases from low-pass genome sequencing to enable accurate population genetic inferences.

作者信息

Fonseca Emanuel M, Tran Linh N, Mendoza Hannah, Gutenkunst Ryan N

机构信息

Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA.

出版信息

bioRxiv. 2024 Jul 23:2024.07.19.604366. doi: 10.1101/2024.07.19.604366.

DOI:10.1101/2024.07.19.604366
PMID:39091836
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11291017/
Abstract

Low-pass genome sequencing is cost-effective and enables analysis of large cohorts. However, it introduces biases by reducing heterozygous genotypes and low-frequency alleles, impacting subsequent analyses such as demographic history inference. We developed a probabilistic model of low-pass biases from the Genome Analysis Toolkit (GATK) multi-sample calling pipeline, and we implemented it in the population genomic inference software dadi. We evaluated the model using simulated low-pass datasets and found that it alleviated low-pass biases in inferred demographic parameters. We further validated the model by downsampling 1000 Genomes Project data, demonstrating its effectiveness on real data. Our model is widely applicable and substantially improves model-based inferences from low-pass population genomic data.

摘要

低覆盖度基因组测序具有成本效益,能够对大规模队列进行分析。然而,它通过减少杂合基因型和低频等位基因引入偏差,影响后续分析,如人口历史推断。我们从基因组分析工具包(GATK)多样本调用流程中开发了一个低覆盖度偏差的概率模型,并将其应用于群体基因组推断软件dadi中。我们使用模拟的低覆盖度数据集评估了该模型,发现它减轻了推断人口参数中的低覆盖度偏差。我们通过对千人基因组计划数据进行下采样进一步验证了该模型,证明了其在真实数据上的有效性。我们的模型具有广泛的适用性,显著改进了基于低覆盖度群体基因组数据的模型推断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/c96bc0b525af/nihpp-2024.07.19.604366v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/1548b284be44/nihpp-2024.07.19.604366v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/8f800e6d4947/nihpp-2024.07.19.604366v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/827b9555f4f1/nihpp-2024.07.19.604366v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/c96bc0b525af/nihpp-2024.07.19.604366v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/1548b284be44/nihpp-2024.07.19.604366v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/8f800e6d4947/nihpp-2024.07.19.604366v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/827b9555f4f1/nihpp-2024.07.19.604366v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/11291017/c96bc0b525af/nihpp-2024.07.19.604366v1-f0004.jpg

相似文献

1
Modeling biases from low-pass genome sequencing to enable accurate population genetic inferences.对低通量基因组测序中的偏差进行建模,以实现准确的群体遗传推断。
bioRxiv. 2024 Jul 23:2024.07.19.604366. doi: 10.1101/2024.07.19.604366.
2
Modeling Biases from Low-Pass Genome Sequencing to Enable Accurate Population Genetic Inferences.对低通量基因组测序中的偏差进行建模以实现准确的群体遗传推断。
Mol Biol Evol. 2025 Jan 6;42(1). doi: 10.1093/molbev/msaf002.
3
Genotype-free estimation of allele frequencies reduces bias and improves demographic inference from RADSeq data.无基因型估计等位基因频率可减少偏差并提高 RADSeq 数据的种群遗传推断准确性。
Mol Ecol Resour. 2019 May;19(3):586-596. doi: 10.1111/1755-0998.12990. Epub 2019 Apr 17.
4
Characterizing bias in population genetic inferences from low-coverage sequencing data.从低覆盖测序数据中推断群体遗传时的偏差特征分析。
Mol Biol Evol. 2014 Mar;31(3):723-35. doi: 10.1093/molbev/mst229. Epub 2013 Nov 27.
5
The size and composition of haplotype reference panels impact the accuracy of imputation from low-pass sequencing in cattle.单体型参考面板的大小和组成会影响牛低深度测序数据的准确性。
Genet Sel Evol. 2023 May 11;55(1):33. doi: 10.1186/s12711-023-00809-y.
6
From reference genomes to population genomics: comparing three reference-aligned reduced-representation sequencing pipelines in two wildlife species.从参考基因组到群体基因组学:比较两种野生动物中三种基于参考的简化代表性测序方法。
BMC Genomics. 2019 Jun 3;20(1):453. doi: 10.1186/s12864-019-5806-y.
7
Fast and accurate DNASeq variant calling workflow composed of LUSH toolkit.由 LUSH 工具包组成的快速准确的 DNA 测序变异调用工作流程。
Hum Genomics. 2024 Oct 10;18(1):114. doi: 10.1186/s40246-024-00666-w.
8
Quality control and integration of genotypes from two calling pipelines for whole genome sequence data in the Alzheimer's disease sequencing project.全基因组序列数据阿尔茨海默病测序项目中两种调用管道基因型的质量控制和整合。
Genomics. 2019 Jul;111(4):808-818. doi: 10.1016/j.ygeno.2018.05.004. Epub 2018 May 29.
9
Characterization of a haplotype-reference panel for genotyping by low-pass sequencing in Swiss Large White pigs.用于瑞士大白猪低通量测序基因分型的单倍型参考面板的特征分析
BMC Genomics. 2021 Apr 21;22(1):290. doi: 10.1186/s12864-021-07610-5.
10
Detectability of runs of homozygosity is influenced by analysis parameters and population-specific demographic history.杂合性缺失的可检测性受分析参数和特定人群的人口统计学历史的影响。
PLoS Comput Biol. 2024 Oct 31;20(10):e1012566. doi: 10.1371/journal.pcbi.1012566. eCollection 2024 Oct.

本文引用的文献

1
Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses.降维方法在某些常见的系统地理学分析中与全基因组测序产生相似的结果。
PLoS One. 2023 Nov 30;18(11):e0291941. doi: 10.1371/journal.pone.0291941. eCollection 2023.
2
Imputation of ancient human genomes.古代人类基因组的推断。
Nat Commun. 2023 Jun 20;14(1):3660. doi: 10.1038/s41467-023-39202-0.
3
SLiM 4: Multispecies Eco-Evolutionary Modeling.SLiM 4:多物种生态进化建模。
Am Nat. 2023 May;201(5):E127-E139. doi: 10.1086/723601. Epub 2023 Mar 21.
4
Demes: a standard format for demographic models.人群:人口模型的标准格式。
Genetics. 2022 Nov 1;222(3). doi: 10.1093/genetics/iyac131.
5
The genomic origins of the world's first farmers.世界上第一批农民的基因组起源。
Cell. 2022 May 26;185(11):1842-1859.e18. doi: 10.1016/j.cell.2022.04.008. Epub 2022 May 12.
6
Assessing model adequacy leads to more robust phylogeographic inference.评估模型的充分性可得出更可靠的系统发育地理学推断。
Trends Ecol Evol. 2022 May;37(5):402-410. doi: 10.1016/j.tree.2021.12.007. Epub 2022 Jan 10.
7
Efficient ancestry and mutation simulation with msprime 1.0.利用 msprime 1.0 进行高效的祖先和突变模拟。
Genetics. 2022 Mar 3;220(3). doi: 10.1093/genetics/iyab229.
8
A beginner's guide to low-coverage whole genome sequencing for population genomics.人群基因组学低覆盖度全基因组测序入门指南。
Mol Ecol. 2021 Dec;30(23):5966-5993. doi: 10.1111/mec.16077. Epub 2021 Aug 31.
9
fastsimcoal2: demographic inference under complex evolutionary scenarios.fastsimcoal2:复杂进化场景下的人口推断。
Bioinformatics. 2021 Dec 11;37(24):4882-4885. doi: 10.1093/bioinformatics/btab468.
10
Inferring Genome-Wide Correlations of Mutation Fitness Effects between Populations.推断种群间突变适应度效应的全基因组相关性。
Mol Biol Evol. 2021 Sep 27;38(10):4588-4602. doi: 10.1093/molbev/msab162.