• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

distAngsd:用于下一代测序数据的快速准确的遗传距离推断。

distAngsd: Fast and Accurate Inference of Genetic Distances for Next-Generation Sequencing Data.

机构信息

Section for Geogenetics, Globe Institute, University of Copenhagen, Øster Voldgade 5-7, 1350 København K, Denmark.

Department of Integrative Biology, University of California, 3040 Valley Life Sciences Building 3140, Berkeley, CA 94720-3140, USA.

出版信息

Mol Biol Evol. 2022 Jun 2;39(6). doi: 10.1093/molbev/msac119.

DOI:10.1093/molbev/msac119
PMID:35647675
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9234764/
Abstract

Commonly used methods for inferring phylogenies were designed before the emergence of high-throughput sequencing and can generally not accommodate the challenges associated with noisy, diploid sequencing data. In many applications, diploid genomes are still treated as haploid through the use of ambiguity characters; while the uncertainty in genotype calling-arising as a consequence of the sequencing technology-is ignored. In order to address this problem, we describe two new probabilistic approaches for estimating genetic distances: distAngsd-geno and distAngsd-nuc, both implemented in a software suite named distAngsd. These methods are specifically designed for next-generation sequencing data, utilize the full information from the data, and take uncertainty in genotype calling into account. Through extensive simulations, we show that these new methods are markedly more accurate and have more stable statistical behaviors than other currently available methods for estimating genetic distances-even for very low depth data with high error rates.

摘要

常用的推断系统发育的方法是在高通量测序出现之前设计的,通常无法适应与嘈杂的、二倍体测序数据相关的挑战。在许多应用中,二倍体基因组仍然通过使用歧义字符被视为单倍体;而由于测序技术而产生的基因型调用的不确定性则被忽略。为了解决这个问题,我们描述了两种用于估计遗传距离的新的概率方法:distAngsd-geno 和 distAngsd-nuc,这两种方法都在一个名为 distAngsd 的软件套件中实现。这些方法是专门为下一代测序数据设计的,利用数据的全部信息,并考虑基因型调用的不确定性。通过广泛的模拟,我们表明,这些新方法比其他现有的用于估计遗传距离的方法更准确,并且具有更稳定的统计行为,即使是在错误率很高的深度很低的数据中也是如此。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/e968cc31b92d/msac119f8a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/fb4318d6f4d9/msac119f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/3ff824b07442/msac119f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/12f0d64bb4d7/msac119f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/dfbadf2e3234/msac119f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/c37bf8aefe58/msac119f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/9caed1a56270/msac119f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/0ef62ed6b8da/msac119f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/e968cc31b92d/msac119f8a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/fb4318d6f4d9/msac119f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/3ff824b07442/msac119f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/12f0d64bb4d7/msac119f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/dfbadf2e3234/msac119f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/c37bf8aefe58/msac119f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/9caed1a56270/msac119f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/0ef62ed6b8da/msac119f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d61e/9234764/e968cc31b92d/msac119f8a.jpg

相似文献

1
distAngsd: Fast and Accurate Inference of Genetic Distances for Next-Generation Sequencing Data.distAngsd:用于下一代测序数据的快速准确的遗传距离推断。
Mol Biol Evol. 2022 Jun 2;39(6). doi: 10.1093/molbev/msac119.
2
polyRAD: Genotype Calling with Uncertainty from Sequencing Data in Polyploids and Diploids.polyRAD:多倍体和二倍体测序数据不确定性下的基因型分型
G3 (Bethesda). 2019 Mar 7;9(3):663-673. doi: 10.1534/g3.118.200913.
3
PhredEM: a phred-score-informed genotype-calling approach for next-generation sequencing studies.PhredEM:一种用于下一代测序研究的基于Phred分数的基因型分型方法。
Genet Epidemiol. 2017 Jul;41(5):375-387. doi: 10.1002/gepi.22048. Epub 2017 May 31.
4
One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies.一刀切并不适用——RefEditor:构建个性化二倍体参考基因组以改善下一代测序研究中的读段映射和基因型调用
PLoS Comput Biol. 2015 Aug 12;11(8):e1004448. doi: 10.1371/journal.pcbi.1004448. eCollection 2015 Aug.
5
SNPest: a probabilistic graphical model for estimating genotypes.SNPest:一种用于估计基因型的概率图模型。
BMC Res Notes. 2014 Oct 7;7:698. doi: 10.1186/1756-0500-7-698.
6
Fast and accurate estimation of multidimensional site frequency spectra from low-coverage high-throughput sequencing data.从低覆盖高通量测序数据中快速准确地估计多维位点频率谱。
Gigascience. 2022 May 17;11. doi: 10.1093/gigascience/giac032.
7
Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing.Longshot 可通过单分子长读测序对二倍体基因组进行准确的变异调用。
Nat Commun. 2019 Oct 11;10(1):4660. doi: 10.1038/s41467-019-12493-y.
8
Review of alignment and SNP calling algorithms for next-generation sequencing data.下一代测序数据的比对和单核苷酸多态性(SNP)检测算法综述。
J Appl Genet. 2016 Feb;57(1):71-9. doi: 10.1007/s13353-015-0292-7. Epub 2015 Jun 9.
9
Robust inference of population structure from next-generation sequencing data with systematic differences in sequencing.有系统测序差异的下一代测序数据中群体结构的稳健推断
Bioinformatics. 2018 Apr 1;34(7):1157-1163. doi: 10.1093/bioinformatics/btx708.
10
ngsTools: methods for population genetics analyses from next-generation sequencing data.ngsTools:从下一代测序数据中进行群体遗传学分析的方法。
Bioinformatics. 2014 May 15;30(10):1486-7. doi: 10.1093/bioinformatics/btu041. Epub 2014 Jan 23.

引用本文的文献

1
Postglacial Echoes: Parasite Genomics Uncover Environmental Changes in Postglacial European Lakes.冰期后的回响:寄生虫基因组学揭示欧洲冰期后湖泊的环境变化
Mol Ecol. 2025 Sep;34(17):e70039. doi: 10.1111/mec.70039. Epub 2025 Jul 16.
2
sandwrm: an R package for estimating Wright's neighborhood size and species-level genetic diversity.Sandwrm:一个用于估计赖特邻域大小和物种水平遗传多样性的R软件包。
bioRxiv. 2025 May 24:2025.05.19.654925. doi: 10.1101/2025.05.19.654925.
3
Machine learning solutions for integrating partially overlapping genetic datasets and modelling host-endophyte effects in ryegrass () dry matter yield estimation.

本文引用的文献

1
Genome-wide RAD sequencing resolves the evolutionary history of serrate leaf Juniperus and reveals discordance with chloroplast phylogeny.全基因组 RAD 测序解析锯齿叶崖柏的进化历史,并揭示与叶绿体系统发育的不一致性。
Mol Phylogenet Evol. 2021 Mar;156:107022. doi: 10.1016/j.ympev.2020.107022. Epub 2020 Nov 24.
2
Ancient DNA indicates human population shifts and admixture in northern and southern China.古 DNA 表明中国北方和南方的人口迁移和混合。
Science. 2020 Jul 17;369(6501):282-288. doi: 10.1126/science.aba0909. Epub 2020 May 14.
3
Revisiting the Phylogenetic History of Helminths Through Genomics, the Case of the New Genome.
用于整合部分重叠遗传数据集并模拟黑麦草宿主-内生菌效应以估计干物质产量的机器学习解决方案。
Front Plant Sci. 2025 May 6;16:1543956. doi: 10.3389/fpls.2025.1543956. eCollection 2025.
4
vcfgl: a flexible genotype likelihood simulator for VCF/BCF files.vcfgl:用于VCF/BCF文件的灵活基因型似然模拟器。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf098.
5
Scalable neighbour search and alignment with uvaia.使用uvaia进行可扩展的邻域搜索和比对。
PeerJ. 2024 Mar 6;12:e16890. doi: 10.7717/peerj.16890. eCollection 2024.
通过基因组学重新审视蠕虫的系统发育史:新基因组的案例
Front Genet. 2019 Aug 7;10:708. doi: 10.3389/fgene.2019.00708. eCollection 2019.
4
The Genome Landscape of Tibetan Sheep Reveals Adaptive Introgression from Argali and the History of Early Human Settlements on the Qinghai-Tibetan Plateau.藏绵羊基因组全景揭示了盘羊的适应性渗入和青藏高原早期人类定居的历史。
Mol Biol Evol. 2019 Feb 1;36(2):283-303. doi: 10.1093/molbev/msy208.
5
Whole-genome sequencing of the blue whale and other rorquals finds signatures for introgressive gene flow.对蓝鲸和其他须鲸的全基因组测序发现了渐渗基因流的特征。
Sci Adv. 2018 Apr 4;4(4):eaap9873. doi: 10.1126/sciadv.aap9873. eCollection 2018 Apr.
6
Ancient genomes revisit the ancestry of domestic and Przewalski's horses.古代基因组重新探讨了家马和普氏野马的祖先。
Science. 2018 Apr 6;360(6384):111-114. doi: 10.1126/science.aao3297. Epub 2018 Feb 22.
7
Multiple Origin but Single Domestication Led to .多起源但单驯化导致了…… (原文似乎不完整)
G3 (Bethesda). 2018 Mar 2;8(3):797-803. doi: 10.1534/g3.117.300334.
8
Speciation Generates Mosaic Genomes in Kangaroos.有袋动物的物种形成产生镶嵌基因组。
Genome Biol Evol. 2018 Jan 1;10(1):33-44. doi: 10.1093/gbe/evx245.
9
Phylogenomic inferences from reference-mapped and de novo assembled short-read sequence data using RADseq sequencing of California white oaks (Quercus section Quercus).基于 RADseq 测序的加利福尼亚白橡树(栎属栎亚属)参考映射和从头组装短读序列数据的系统发育基因组推断。
Genome. 2017 Sep;60(9):743-755. doi: 10.1139/gen-2016-0202. Epub 2017 Mar 29.
10
Genomic insights into the peopling of the Southwest Pacific.西南太平洋地区人类定居的基因组学见解。
Nature. 2016 Oct 27;538(7626):510-513. doi: 10.1038/nature19844. Epub 2016 Oct 3.