• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用生成对抗网络自动推断人口统计学参数。

Automatic inference of demographic parameters using generative adversarial networks.

机构信息

Department of Computer Science, Haverford College, Haverford, PA, USA.

Department of Computer Science, Swarthmore College, Swarthmore, PA, USA.

出版信息

Mol Ecol Resour. 2021 Nov;21(8):2689-2705. doi: 10.1111/1755-0998.13386. Epub 2021 May 3.

DOI:10.1111/1755-0998.13386
PMID:33745225
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8596911/
Abstract

Population genetics relies heavily on simulated data for validation, inference and intuition. In particular, since the evolutionary 'ground truth' for real data is always limited, simulated data are crucial for training supervised machine learning methods. Simulation software can accurately model evolutionary processes but requires many hand-selected input parameters. As a result, simulated data often fail to mirror the properties of real genetic data, which limits the scope of methods that rely on it. Here, we develop a novel approach to estimating parameters in population genetic models that automatically adapts to data from any population. Our method, pg-gan, is based on a generative adversarial network that gradually learns to generate realistic synthetic data. We demonstrate that our method is able to recover input parameters in a simulated isolation-with-migration model. We then apply our method to human data from the 1000 Genomes Project and show that we can accurately recapitulate the features of real data.

摘要

群体遗传学在很大程度上依赖模拟数据进行验证、推理和直观理解。特别是,由于真实数据的进化“真实情况”总是有限的,因此模拟数据对于训练监督机器学习方法至关重要。模拟软件可以准确地模拟进化过程,但需要许多手动选择的输入参数。因此,模拟数据往往无法反映真实遗传数据的特性,这限制了依赖于模拟数据的方法的应用范围。在这里,我们开发了一种新的方法来估计群体遗传模型中的参数,该方法可以自动适应来自任何群体的数据。我们的方法 pg-gan 基于生成式对抗网络,该网络逐渐学会生成逼真的合成数据。我们证明了我们的方法能够在模拟的隔离-迁移模型中恢复输入参数。然后,我们将我们的方法应用于 1000 基因组计划中的人类数据,并表明我们可以准确地再现真实数据的特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/ce18e34aa600/MEN-21-2689-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/50511b296104/MEN-21-2689-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/b9cbf67916da/MEN-21-2689-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/a03de4acee9a/MEN-21-2689-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/920ae1a00108/MEN-21-2689-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/6a3b4c52aa9e/MEN-21-2689-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/f7abd3c50778/MEN-21-2689-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/a270a4089511/MEN-21-2689-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/9601f3101049/MEN-21-2689-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/720e43321718/MEN-21-2689-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/ce18e34aa600/MEN-21-2689-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/50511b296104/MEN-21-2689-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/b9cbf67916da/MEN-21-2689-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/a03de4acee9a/MEN-21-2689-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/920ae1a00108/MEN-21-2689-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/6a3b4c52aa9e/MEN-21-2689-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/f7abd3c50778/MEN-21-2689-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/a270a4089511/MEN-21-2689-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/9601f3101049/MEN-21-2689-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/720e43321718/MEN-21-2689-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4020/8596911/ce18e34aa600/MEN-21-2689-g009.jpg

相似文献

1
Automatic inference of demographic parameters using generative adversarial networks.使用生成对抗网络自动推断人口统计学参数。
Mol Ecol Resour. 2021 Nov;21(8):2689-2705. doi: 10.1111/1755-0998.13386. Epub 2021 May 3.
2
INTERPRETING GENERATIVE ADVERSARIAL NETWORKS TO INFER NATURAL SELECTION FROM GENETIC DATA.解读生成对抗网络以从遗传数据中推断自然选择
bioRxiv. 2023 Jul 9:2023.03.07.531546. doi: 10.1101/2023.03.07.531546.
3
Interpreting generative adversarial networks to infer natural selection from genetic data.从遗传数据推断自然选择的生成对抗网络解释。
Genetics. 2024 Apr 3;226(4). doi: 10.1093/genetics/iyae024.
4
Towards effective machine learning in medical imaging analysis: A novel approach and expert evaluation of high-grade glioma 'ground truth' simulation on MRI.迈向医学影像分析中有效的机器学习:一种新颖的方法和对 MRI 上高级别胶质瘤“真实数据”模拟的专家评估。
Int J Med Inform. 2021 Feb;146:104348. doi: 10.1016/j.ijmedinf.2020.104348. Epub 2020 Nov 27.
5
Semi-supervised segmentation of lesion from breast ultrasound images with attentional generative adversarial network.基于注意力生成对抗网络的乳腺超声图像病灶半监督分割。
Comput Methods Programs Biomed. 2020 Jun;189:105275. doi: 10.1016/j.cmpb.2019.105275. Epub 2019 Dec 12.
6
MB-GAN: Microbiome Simulation via Generative Adversarial Network.MB-GAN:基于生成对抗网络的微生物组模拟。
Gigascience. 2021 Feb 5;10(2). doi: 10.1093/gigascience/giab005.
7
Using deep learning to generate synthetic B-mode musculoskeletal ultrasound images.利用深度学习生成合成B模式肌肉骨骼超声图像。
Comput Methods Programs Biomed. 2020 Nov;196:105583. doi: 10.1016/j.cmpb.2020.105583. Epub 2020 Jun 4.
8
Improved automatic detection of herpesvirus secondary envelopment stages in electron microscopy by augmenting training data with synthetic labelled images generated by a generative adversarial network.通过使用生成对抗网络生成的合成标记图像来扩充训练数据,提高电子显微镜中疱疹病毒二次包膜阶段的自动检测。
Cell Microbiol. 2021 Feb;23(2):e13280. doi: 10.1111/cmi.13280. Epub 2020 Nov 16.
9
Conditional generative adversarial network for 3D rigid-body motion correction in MRI.条件生成对抗网络在 MRI 中用于 3D 刚体运动校正。
Magn Reson Med. 2019 Sep;82(3):901-910. doi: 10.1002/mrm.27772. Epub 2019 Apr 22.
10
Generative adversarial networks with decoder-encoder output noises.生成对抗网络与解码器编码器输出噪声。
Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.

引用本文的文献

1
Semi-supervised detection of natural selection with positive-unlabeled learning.基于正例未标注学习的自然选择半监督检测
bioRxiv. 2025 Aug 18:2025.08.15.670602. doi: 10.1101/2025.08.15.670602.
2
On the use of generative models for evolutionary inference of malaria vectors from genomic data.关于使用生成模型从基因组数据进行疟疾病媒进化推断的研究
bioRxiv. 2025 Jun 27:2025.06.26.661760. doi: 10.1101/2025.06.26.661760.
3
Bridging Genomic Research Disparities in Osteoporosis GWAS: Insights for Diverse Populations.弥合骨质疏松症全基因组关联研究中的基因组研究差距:对不同人群的见解

本文引用的文献

1
Detecting adaptive introgression in human evolution using convolutional neural networks.使用卷积神经网络检测人类进化中的适应性基因渗入。
Elife. 2021 May 25;10:e64669. doi: 10.7554/eLife.64669.
2
The Impact of Purifying and Background Selection on the Inference of Population History: Problems and Prospects.净化和背景选择对群体历史推断的影响:问题与展望。
Mol Biol Evol. 2021 Jun 25;38(7):2986-3003. doi: 10.1093/molbev/msab050.
3
Visualizing population structure with variational autoencoders.使用变分自动编码器进行人口结构可视化。
Curr Osteoporos Rep. 2025 May 24;23(1):24. doi: 10.1007/s11914-025-00917-2.
4
ConfuseNN: Interpreting convolutional neural network inferences in population genomics with data shuffling.ConfuseNN:通过数据重排解释群体基因组学中的卷积神经网络推理
bioRxiv. 2025 Mar 27:2025.03.24.644668. doi: 10.1101/2025.03.24.644668.
5
Accessible, realistic genome simulation with selection using stdpopsim.使用stdpopsim进行具有选择的可访问、现实的基因组模拟。
bioRxiv. 2025 Mar 23:2025.03.23.644823. doi: 10.1101/2025.03.23.644823.
6
A likelihood-based framework for demographic inference from genealogical trees.一种基于似然性的从系谱树进行人口统计学推断的框架。
Nat Genet. 2025 Apr;57(4):865-874. doi: 10.1038/s41588-025-02129-x. Epub 2025 Mar 20.
7
Generative AI extracts ecological meaning from the complex three dimensional shapes of bird bills.生成式人工智能从鸟类喙部复杂的三维形状中提取生态意义。
PLoS Comput Biol. 2025 Mar 17;21(3):e1012887. doi: 10.1371/journal.pcbi.1012887. eCollection 2025 Mar.
8
Computational Genomics and Its Applications to Anthropological Questions.计算基因组学及其在人类学问题中的应用。
Am J Biol Anthropol. 2024 Dec;186 Suppl 78(Suppl 78):e70010. doi: 10.1002/ajpa.70010.
9
Digital Image Processing to Detect Adaptive Evolution.用于检测适应性进化的数字图像处理
Mol Biol Evol. 2024 Dec 6;41(12). doi: 10.1093/molbev/msae242.
10
Deep learning insights into distinct patterns of polygenic adaptation across human populations.深度学习对人类群体中多基因适应性的不同模式的见解。
Nucleic Acids Res. 2024 Dec 11;52(22):e102. doi: 10.1093/nar/gkae1027.
G3 (Bethesda). 2021 Jan 18;11(1). doi: 10.1093/g3journal/jkaa036.
4
Creating artificial human genomes using generative neural networks.使用生成式神经网络创建人工人类基因组。
PLoS Genet. 2021 Feb 4;17(2):e1009303. doi: 10.1371/journal.pgen.1009303. eCollection 2021 Feb.
5
A Likelihood-Free Inference Framework for Population Genetic Data using Exchangeable Neural Networks.一种使用可交换神经网络的群体遗传数据无似然推断框架。
Adv Neural Inf Process Syst. 2018 Dec;31:8594-8605.
6
Deep learning for population size history inference: Design, comparison and combination with approximate Bayesian computation.用于种群大小历史推断的深度学习:设计、比较以及与近似贝叶斯计算的结合
Mol Ecol Resour. 2021 Nov;21(8):2645-2660. doi: 10.1111/1755-0998.13224. Epub 2020 Jul 25.
7
A community-maintained standard library of population genetic models.一个社区维护的种群遗传模型标准库。
Elife. 2020 Jun 23;9:e54967. doi: 10.7554/eLife.54967.
8
Predicting the Landscape of Recombination Using Deep Learning.利用深度学习预测重组景观。
Mol Biol Evol. 2020 Jun 1;37(6):1790-1808. doi: 10.1093/molbev/msaa038.
9
De novo mutations across 1,465 diverse genomes reveal mutational insights and reductions in the Amish founder population.对 1465 个不同基因组的从头突变进行分析,揭示了阿什肯纳兹人群的突变特征和突变减少。
Proc Natl Acad Sci U S A. 2020 Feb 4;117(5):2560-2569. doi: 10.1073/pnas.1902766117. Epub 2020 Jan 21.
10
ImaGene: a convolutional neural network to quantify natural selection from genomic data.ImaGene:一种从基因组数据中定量自然选择的卷积神经网络。
BMC Bioinformatics. 2019 Nov 22;20(Suppl 9):337. doi: 10.1186/s12859-019-2927-x.