• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

揭示混合模型中等可能性估计值的范围。

Revealing the range of equally likely estimates in the admixture model.

作者信息

Heinzel Carola Sophia, Baumdicker Franz, Pfaffelhuber Peter

机构信息

Department of Mathematical Stochastics, Albert-Ludwigs-University Freiburg, Freiburg im Breisgau 79104, Germany.

Cluster of Excellence "Controlling Microbes to Fight Infections", Mathematical and Computational Population Genetics, University of Tübingen, Sand 14, Tübingen 72076, Germany.

出版信息

G3 (Bethesda). 2025 Aug 6;15(8). doi: 10.1093/g3journal/jkaf142.

DOI:10.1093/g3journal/jkaf142
PMID:40534544
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12341866/
Abstract

Many ancestry inference tools, including Structure and Admixture, rely on the admixture model to infer both, allele frequencies p and individual admixture proportions q for a collection of individuals relative to a set of hypothetical ancestral populations. We show that under realistic conditions the likelihood in the admixture model is typically flat in some direction around a maximum-likelihood estimate (q^,p^). In particular, the maximum-likelihood estimator is nonunique and there is a complete spectrum of possible estimates. Common inference tools typically identify only a few points within this spectrum. We provide an algorithm which computes the set of equally likely (q,p), when starting from (q^,p^). It is analytic for K=2 ancestral populations and numeric for K>2. We apply our algorithm to data from the 1000 genomes project, and show that inter-European estimators of q can come with a large set of equally likely possibilities. In general, markers with large allele frequency differences between populations in combination with individuals with concentrated admixture proportions lead to small areas with a flat likelihood. Our findings imply that care must be taken when interpreting results from STRUCTURE and ADMIXTURE if populations are not separated well enough.

摘要

许多血统推断工具,包括Structure和Admixture,都依赖于混合模型来推断相对于一组假设的祖先群体的个体集合的等位基因频率p和个体混合比例q。我们表明,在现实条件下,混合模型中的似然性在最大似然估计(q^,p^)周围的某些方向上通常是平坦的。特别是,最大似然估计器是非唯一的,并且存在完整的可能估计范围。常见的推断工具通常只识别该范围内的几个点。我们提供了一种算法,当从(q^,p^)开始时,该算法计算等可能的(q,p)集合。对于K = 2个祖先群体,它是解析的,对于K>2则是数值的。我们将我们的算法应用于千人基因组计划的数据,并表明欧洲内部q的估计值可能伴随着大量等可能的可能性。一般来说,群体之间等位基因频率差异大的标记与混合比例集中的个体相结合,会导致似然性平坦的小区域。我们的发现意味着,如果群体分离得不够好,在解释Structure和Admixture的结果时必须谨慎。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/6f06eb78fb6c/jkaf142a2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/aeae92873414/jkaf142f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/24c507e02c1e/jkaf142f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/ac8ce0f073f0/jkaf142f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/cc92d18b6f98/jkaf142f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/2cc5f67683d0/jkaf142a1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/6f06eb78fb6c/jkaf142a2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/aeae92873414/jkaf142f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/24c507e02c1e/jkaf142f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/ac8ce0f073f0/jkaf142f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/cc92d18b6f98/jkaf142f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/2cc5f67683d0/jkaf142a1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/096a/12341866/6f06eb78fb6c/jkaf142a2.jpg

相似文献

1
Revealing the range of equally likely estimates in the admixture model.揭示混合模型中等可能性估计值的范围。
G3 (Bethesda). 2025 Aug 6;15(8). doi: 10.1093/g3journal/jkaf142.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Short-Term Memory Impairment短期记忆障碍
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
7
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
8
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
10
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

本文引用的文献

1
Inferring ancestry with the hierarchical soft clustering approach tangleGen.使用分层软聚类方法tangleGen推断血统。
Genome Res. 2024 Dec 23;34(12):2244-2255. doi: 10.1101/gr.279399.124.
2
Clumppling: cluster matching and permutation program with integer linear programming.Clumppling:使用整数线性规划进行聚类匹配和排列程序。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btad751.
3
Neural ADMIXTURE for rapid genomic clustering.用于快速基因组聚类的神经混合模型
Nat Comput Sci. 2023 Jul;3(7):621-629. doi: 10.1038/s43588-023-00482-7. Epub 2023 Jul 6.
4
Unsupervised discovery of ancestry-informative markers and genetic admixture proportions in biobank-scale datasets.无监督发现生物库规模数据集的祖先信息标记和遗传混合比例。
Am J Hum Genet. 2023 Feb 2;110(2):314-325. doi: 10.1016/j.ajhg.2022.12.008. Epub 2023 Jan 6.
5
Review of the Forensic Applicability of Biostatistical Methods for Inferring Ancestry from Autosomal Genetic Markers.审查从常染色体遗传标记推断祖先的生物统计学方法的法医学适用性。
Genes (Basel). 2022 Jan 14;13(1):141. doi: 10.3390/genes13010141.
6
Philippine Ayta possess the highest level of Denisovan ancestry in the world.菲律宾阿塔人拥有世界上最高水平的丹尼索瓦人遗传血统。
Curr Biol. 2021 Oct 11;31(19):4219-4230.e10. doi: 10.1016/j.cub.2021.07.022. Epub 2021 Aug 12.
7
Population genomics of the Viking world.北欧海盗世界的人口基因组学。
Nature. 2020 Sep;585(7825):390-396. doi: 10.1038/s41586-020-2688-8. Epub 2020 Sep 16.
8
Large-Scale Comparative Analyses of Tick Genomes Elucidate Their Genetic Diversity and Vector Capacities.大规模比较分析蜱虫基因组,阐明其遗传多样性和媒介能力。
Cell. 2020 Sep 3;182(5):1328-1340.e13. doi: 10.1016/j.cell.2020.07.023. Epub 2020 Aug 18.
9
Predicting geographic location from genetic variation with deep neural networks.利用深度神经网络从遗传变异中预测地理位置。
Elife. 2020 Jun 8;9:e54507. doi: 10.7554/eLife.54507.
10
Evaluation of model fit of inferred admixture proportions.推断的混合比例模型拟合度评估。
Mol Ecol Resour. 2020 Jul;20(4):936-949. doi: 10.1111/1755-0998.13171. Epub 2020 May 25.