• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物样本库规模数据集中有害罕见变异的研究设计与抽样

Study design and the sampling of deleterious rare variants in biobank-scale datasets.

作者信息

Steiner Margaret C, Rice Daniel P, Biddanda Arjun, Ianni-Ravn Mariadaria K, Porras Christian, Novembre John

机构信息

Department of Human Genetics, University of Chicago, Chicago, IL 60637.

Media Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139.

出版信息

bioRxiv. 2025 Jan 29:2024.12.02.626424. doi: 10.1101/2024.12.02.626424.

DOI:10.1101/2024.12.02.626424
PMID:39677632
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11642817/
Abstract

One key component of study design in population genetics is the "geographic breadth" of a sample (i.e., how broad a region across which individuals are sampled). How the geographic breadth of a sample impacts observations of rare, deleterious variants is unclear, even though such variants are of particular interest for biomedical and evolutionary applications. Here, in order to gain insight into the effects of sample design on ascertained genetic variants, we formulate a stochastic model of dispersal, genetic drift, selection, mutation, and geographically concentrated sampling. We use this model to understand the effects of the geographic breadth of sampling effort on the discovery of negatively selected variants. We find that samples which are more geographically broad will discover a greater number variants as compared geographically narrow samples (an effect we label "discovery"); though the variants will be detected at lower average frequency than in narrow samples (e.g. as singletons, an effect we label "dilution"). Importantly, these effects are amplified for larger sample sizes and moderated by the magnitude of fitness effects. We validate these results using both population genetic simulations and empirical analyses in the UK Biobank. Our results are particularly important in two contexts: the association of large-effect rare variants with particular phenotypes and the inference of negative selection from allele frequency data. Overall, our findings emphasize the importance of considering geographic breadth when designing and carrying out genetic studies, especially at biobank scale.

摘要

群体遗传学研究设计的一个关键要素是样本的“地理广度”(即个体采样所跨越的区域有多广)。尽管稀有有害变异在生物医学和进化应用中特别受关注,但样本的地理广度如何影响对这些变异的观察尚不清楚。在此,为了深入了解样本设计对已确定的遗传变异的影响,我们构建了一个关于扩散、遗传漂变、选择、突变和地理集中采样的随机模型。我们使用这个模型来理解采样工作的地理广度对负选择变异发现的影响。我们发现,与地理范围狭窄的样本相比,地理范围更广的样本将发现更多的变异(我们将这种效应称为“发现”);不过,这些变异的平均检测频率将低于狭窄样本中的变异(例如作为单例,我们将这种效应称为“稀释”)。重要的是,对于更大的样本量,这些效应会被放大,并且会受到适应度效应大小的调节。我们使用群体遗传模拟和英国生物银行的实证分析来验证这些结果。我们的结果在两种情况下尤为重要:大效应稀有变异与特定表型的关联以及从等位基因频率数据推断负选择。总体而言,我们的研究结果强调了在设计和开展遗传研究时考虑地理广度的重要性,尤其是在生物银行规模的研究中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/baab4706b7f2/nihpp-2024.12.02.626424v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/31bd7cff043a/nihpp-2024.12.02.626424v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/a753aa2f07c4/nihpp-2024.12.02.626424v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/31ddb19f8f59/nihpp-2024.12.02.626424v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/25f81bd10b0f/nihpp-2024.12.02.626424v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/23d35cbe8f86/nihpp-2024.12.02.626424v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/baab4706b7f2/nihpp-2024.12.02.626424v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/31bd7cff043a/nihpp-2024.12.02.626424v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/a753aa2f07c4/nihpp-2024.12.02.626424v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/31ddb19f8f59/nihpp-2024.12.02.626424v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/25f81bd10b0f/nihpp-2024.12.02.626424v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/23d35cbe8f86/nihpp-2024.12.02.626424v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d908/11781414/baab4706b7f2/nihpp-2024.12.02.626424v2-f0006.jpg

相似文献

1
Study design and the sampling of deleterious rare variants in biobank-scale datasets.生物样本库规模数据集中有害罕见变异的研究设计与抽样
bioRxiv. 2025 Jan 29:2024.12.02.626424. doi: 10.1101/2024.12.02.626424.
2
Study design and the sampling of deleterious rare variants in biobank-scale datasets.生物样本库规模数据集中有害罕见变异的研究设计与抽样
Proc Natl Acad Sci U S A. 2025 Jun 10;122(23):e2425196122. doi: 10.1073/pnas.2425196122. Epub 2025 Jun 3.
3
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误:利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。
J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Novel insights into the genetics of smoking behaviour, lung function, and chronic obstructive pulmonary disease (UK BiLEVE): a genetic association study in UK Biobank.对吸烟行为、肺功能和慢性阻塞性肺疾病(英国生物银行)遗传学的新认识:英国生物银行中的一项遗传关联研究。
Lancet Respir Med. 2015 Oct;3(10):769-81. doi: 10.1016/S2213-2600(15)00283-0. Epub 2015 Sep 27.
6
Scaling the discrete-time Wright-Fisher model to biobank-scale datasets.将离散时间 Wright-Fisher 模型扩展到生物库规模数据集。
Genetics. 2023 Nov 1;225(3). doi: 10.1093/genetics/iyad168.
7
Probing the aggregated effects of purifying selection per individual on 1,380 medical phenotypes in the UK Biobank.探究个体中纯化选择对英国生物库中 1380 种医学表型的综合影响。
PLoS Genet. 2021 Jan 25;17(1):e1009337. doi: 10.1371/journal.pgen.1009337. eCollection 2021 Jan.
8
Quantification of frequency-dependent genetic architectures in 25 UK Biobank traits reveals action of negative selection.量化 25 项英国生物库特征中频率相关的遗传结构,揭示负选择的作用。
Nat Commun. 2019 Feb 15;10(1):790. doi: 10.1038/s41467-019-08424-6.
9
Qualitative Study定性研究
10
The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences.DNA序列地理样本中单例的经验分布。
Front Genet. 2017 Sep 29;8:139. doi: 10.3389/fgene.2017.00139. eCollection 2017.

本文引用的文献

1
Biobanking with genetics shapes precision medicine and global health.带有遗传学的生物样本库塑造了精准医学和全球健康。
Nat Rev Genet. 2025 Mar;26(3):191-202. doi: 10.1038/s41576-024-00794-y. Epub 2024 Nov 20.
2
Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.多样性与规模:退伍军人事务部百万退伍军人计划中2068个性状的遗传结构
Science. 2024 Jul 19;385(6706):eadj1182. doi: 10.1126/science.adj1182.
3
Analysis of 14,392 whole genomes reveals 3.5% of Qataris carry medically actionable variants.对 14392 份全基因组进行分析,结果显示 3.5%的卡塔尔人携带具有医学可操作性的变异。
Eur J Hum Genet. 2024 Nov;32(11):1465-1473. doi: 10.1038/s41431-024-01656-1. Epub 2024 Jul 17.
4
Genomic data in the All of Us Research Program.全美国研究计划中的基因组数据。
Nature. 2024 Mar;627(8003):340-346. doi: 10.1038/s41586-023-06957-x. Epub 2024 Feb 19.
5
Mexican Biobank advances population and medical genomics of diverse ancestries.墨西哥生物银行推进了具有不同祖先的人群和医学基因组学研究。
Nature. 2023 Oct;622(7984):775-783. doi: 10.1038/s41586-023-06560-0. Epub 2023 Oct 11.
6
Innovating for a Just and Equitable Future in Genomic and Precision Medicine Research.为基因组与精准医学研究的公正平等未来而创新。
Am J Bioeth. 2023 Jul;23(7):1-4. doi: 10.1080/15265161.2023.2215201.
7
Polygenic scoring accuracy varies across the genetic ancestry continuum.多基因评分准确性在遗传祖先连续体上有所差异。
Nature. 2023 Jun;618(7966):774-781. doi: 10.1038/s41586-023-06079-4. Epub 2023 May 17.
8
Future prospects for human genetics and genomics in drug discovery.人类遗传学和基因组学在药物发现中的未来前景。
Curr Opin Struct Biol. 2023 Jun;80:102568. doi: 10.1016/j.sbi.2023.102568. Epub 2023 Mar 22.
9
Polygenic architecture of rare coding variation across 394,783 exomes.394,783 个外显子中罕见编码变异的多基因结构。
Nature. 2023 Feb;614(7948):492-499. doi: 10.1038/s41586-022-05684-z. Epub 2023 Feb 8.
10
Deleterious Variation in Natural Populations and Implications for Conservation Genetics.自然种群中的有害变异及其对保护遗传学的影响。
Annu Rev Anim Biosci. 2023 Feb 15;11:93-114. doi: 10.1146/annurev-animal-080522-093311. Epub 2022 Nov 4.