• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于网络的大型基因组数据集层次群体结构分析。

Network-based hierarchical population structure analysis for large genomic data sets.

机构信息

Department of Biology, Stanford University, Stanford, California 94305, USA.

Department of Computer Science, Ben-Gurion University of the Negev, Be'er-Sheva, 8410501, Israel.

出版信息

Genome Res. 2019 Dec;29(12):2020-2033. doi: 10.1101/gr.250092.119. Epub 2019 Nov 6.

DOI:10.1101/gr.250092.119
PMID:31694865
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6886512/
Abstract

Analysis of population structure in natural populations using genetic data is a common practice in ecological and evolutionary studies. With large genomic data sets of populations now appearing more frequently across the taxonomic spectrum, it is becoming increasingly possible to reveal many hierarchical levels of structure, including fine-scale genetic clusters. To analyze these data sets, methods need to be appropriately suited to the challenges of extracting multilevel structure from whole-genome data. Here, we present a network-based approach for constructing population structure representations from genetic data. The use of community-detection algorithms from network theory generates a natural hierarchical perspective on the representation that the method produces. The method is computationally efficient, and it requires relatively few assumptions regarding the biological processes that underlie the data. We show the approach by analyzing population structure in the model plant species and in human populations. These examples illustrate how network-based approaches for population structure analysis are well-suited to extracting valuable ecological and evolutionary information in the era of large genomic data sets.

摘要

利用遗传数据分析自然种群的种群结构是生态和进化研究中的常见做法。随着越来越多的种群基因组大数据集出现在分类学范围内,人们越来越有可能揭示出许多层次的结构,包括精细的遗传聚类。为了分析这些数据集,需要采用适当的方法来从全基因组数据中提取多层次结构。在这里,我们提出了一种基于网络的方法,用于从遗传数据构建种群结构表示。网络理论中的社区检测算法的使用为该方法生成的表示产生了自然的层次视角。该方法计算效率高,并且对构成数据的生物学过程的假设相对较少。我们通过分析模式植物物种和人类群体中的种群结构来展示该方法。这些示例说明了基于网络的种群结构分析方法如何在大型基因组数据集时代非常适合提取有价值的生态和进化信息。

相似文献

1
Network-based hierarchical population structure analysis for large genomic data sets.基于网络的大型基因组数据集层次群体结构分析。
Genome Res. 2019 Dec;29(12):2020-2033. doi: 10.1101/gr.250092.119. Epub 2019 Nov 6.
2
3
Resolving the structure of interactomes with hierarchical agglomerative clustering.利用层次凝聚聚类解析互作组学结构。
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S44. doi: 10.1186/1471-2105-12-S1-S44.
4
Modern technologies and algorithms for scaffolding assembled genomes.组装基因组的现代技术和算法。
PLoS Comput Biol. 2019 Jun 5;15(6):e1006994. doi: 10.1371/journal.pcbi.1006994. eCollection 2019 Jun.
5
Hidden Markov Models in Population Genomics.群体基因组学中的隐马尔可夫模型
Methods Mol Biol. 2017;1552:149-164. doi: 10.1007/978-1-4939-6753-7_11.
6
Methods and models for unravelling human evolutionary history.揭示人类进化历史的方法和模型。
Nat Rev Genet. 2015 Dec;16(12):727-40. doi: 10.1038/nrg4005. Epub 2015 Nov 10.
7
NetView: a high-definition network-visualization approach to detect fine-scale population structures from genome-wide patterns of variation.NetView:一种高分辨率网络可视化方法,可从全基因组变异模式中检测精细的种群结构。
PLoS One. 2012;7(10):e48375. doi: 10.1371/journal.pone.0048375. Epub 2012 Oct 31.
8
9
Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform.利用布劳尔-惠勒变换对基因组序列数据库进行大规模压缩。
Bioinformatics. 2012 Jun 1;28(11):1415-9. doi: 10.1093/bioinformatics/bts173. Epub 2012 May 3.
10
PopNet: A Markov Clustering Approach to Study Population Genetic Structure.PopNet:一种用于研究群体遗传结构的马尔可夫聚类方法。
Mol Biol Evol. 2017 Jul 1;34(7):1799-1811. doi: 10.1093/molbev/msx110.

引用本文的文献

1
Evolutionary Influences on Local Patterns of Genetic Relatedness.进化对遗传相关性局部模式的影响。
bioRxiv. 2025 May 28:2025.05.02.651970. doi: 10.1101/2025.05.02.651970.
2
The multi-scale complexity of human genetic variation beyond continental groups.超越大陆群体的人类遗传变异的多尺度复杂性。
bioRxiv. 2024 Dec 16:2024.12.11.627824. doi: 10.1101/2024.12.11.627824.
3
Ancient genomics support deep divergence between Eastern and Western Mediterranean Indo-European languages.古代基因组学研究为东地中海和西地中海印欧语系之间的深度分化提供了支持。

本文引用的文献

1
Recent Evolutionary History of Tigers Highlights Contrasting Roles of Genetic Drift and Selection.老虎的近期进化历史突显了遗传漂变和选择的作用。
Mol Biol Evol. 2021 May 19;38(6):2366-2379. doi: 10.1093/molbev/msab032.
2
On the postglacial spread of human commensal Arabidopsis thaliana: journey to the East.在人类共生拟南芥的后冰川期扩散:走向东方。
New Phytol. 2019 May;222(3):1447-1457. doi: 10.1111/nph.15682. Epub 2019 Feb 12.
3
Insights into Platypus Population Structure and History from Whole-Genome Sequencing.鸭嘴兽种群结构和历史的深入了解来自全基因组测序。
bioRxiv. 2024 Dec 2:2024.12.02.626332. doi: 10.1101/2024.12.02.626332.
4
A Comprehensive Analysis of 3 Moroccan Genomes Revealed Contributions From Both African and European Ancestries.对3个摩洛哥基因组的综合分析揭示了非洲和欧洲血统的贡献。
Evol Bioinform Online. 2024 Feb 6;20:11769343241229278. doi: 10.1177/11769343241229278. eCollection 2024.
5
Population genomics of post-glacial western Eurasia.后冰河时代的西欧人口基因组学。
Nature. 2024 Jan;625(7994):301-311. doi: 10.1038/s41586-023-06865-0. Epub 2024 Jan 10.
6
A rarefaction approach for measuring population differences in rare and common variation.一种用于测量稀有和常见变异中种群差异的稀疏化方法。
Genetics. 2023 May 26;224(2). doi: 10.1093/genetics/iyad070.
7
Prenatal Genetic Testing in the Era of Next Generation Sequencing: A One-Center Canadian Experience.下一代测序时代的产前基因检测:加拿大单中心经验。
Genes (Basel). 2022 Nov 3;13(11):2019. doi: 10.3390/genes13112019.
8
Extracting hierarchical features of cultural variation using network-based clustering.使用基于网络的聚类方法提取文化变异的层次特征。
Evol Hum Sci. 2022;4. doi: 10.1017/ehs.2022.15. Epub 2022 May 2.
Mol Biol Evol. 2018 May 1;35(5):1238-1252. doi: 10.1093/molbev/msy041.
4
CONE: Community Oriented Network Estimation Is a Versatile Framework for Inferring Population Structure in Large-Scale Sequencing Data.CONE:面向社区的网络估计是用于推断大规模测序数据中群体结构的通用框架。
G3 (Bethesda). 2017 Oct 5;7(10):3359-3377. doi: 10.1534/g3.117.300131.
5
African genomes illuminate the early history and transition to selfing in .非洲基因组揭示了 . 的早期历史和向自交的转变。
Proc Natl Acad Sci U S A. 2017 May 16;114(20):5213-5218. doi: 10.1073/pnas.1616736114. Epub 2017 May 4.
6
Application of network methods for understanding evolutionary dynamics in discrete habitats.应用网络方法理解离散栖息地中的进化动态。
Mol Ecol. 2017 Jun;26(11):2850-2863. doi: 10.1111/mec.14059. Epub 2017 Mar 23.
7
On the post-glacial spread of human commensal Arabidopsis thaliana.人类共生拟南芥在冰期后的传播。
Nat Commun. 2017 Feb 9;8:14458. doi: 10.1038/ncomms14458.
8
Clustering of 770,000 genomes reveals post-colonial population structure of North America.对 77 万份基因组进行聚类分析,揭示了北美后殖民时期的人口结构。
Nat Commun. 2017 Feb 7;8:14238. doi: 10.1038/ncomms14238.
9
Scaling probabilistic models of genetic variation to millions of humans.将遗传变异的概率模型扩展到数百万人类。
Nat Genet. 2016 Dec;48(12):1587-1590. doi: 10.1038/ng.3710. Epub 2016 Nov 7.
10
Recent advances in the study of fine-scale population structure in humans.人类精细尺度种群结构研究的最新进展。
Curr Opin Genet Dev. 2016 Dec;41:98-105. doi: 10.1016/j.gde.2016.08.007. Epub 2016 Sep 20.