• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

为全基因组数据集设计的等位基因年龄估计器在应用于全外显子组数据集时,性能仅略有下降。

Allele age estimators designed for whole-genome datasets show only a moderate reduction in performance when applied to whole-exome datasets.

作者信息

Pivirotto Alyssa, Peles Noah, Hey Jody

机构信息

Department of Biology, Temple University, Philadelphia, PA 19122, USA.

Library & Information Technology Services, Bryn Mawr College, Bryn Mawr, PA 19010, USA.

出版信息

G3 (Bethesda). 2025 Jun 4;15(6). doi: 10.1093/g3journal/jkaf088.

DOI:10.1093/g3journal/jkaf088
PMID:40238934
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12135005/
Abstract

As personalized genomics becomes more affordable, larger numbers of rare variants are being discovered, leading to important initiatives in identifying the functional impacts in relation to disease phenotypes. One way to characterize these variants is to estimate the time the mutation entered the population. However, allele age estimators such as those implemented in the programs Relate, Genealogical Estimator of Variant Age, and Runtc were developed based on the assumption that datasets include the entire genome. We examined the performance of each of these estimators on simulated exome data under a neutral constant population size model, as well as under population expansion and background selection models. We found that each provides usable estimates of allele age from whole-exome datasets. Relate performs the best amongst all 3 estimators with Pearson coefficients of 0.83 and 0.73 (with respect to true simulated values for neutral constant and expansion population models, respectively) with a 12% and 20% decrease in correlation between whole-genome and whole-exome estimations. Of the 3 estimators, Relate is best able to parallelize to yield quick results with little resources; however, Relate is currently only able to scale to thousands of samples making it unable to match the hundreds of thousands of samples being currently released. While more work is needed to expand the capabilities of current methods of estimating allele age, these methods show a modest decrease in performance in the estimation of the age of mutations.

摘要

随着个性化基因组学的成本越来越低,人们发现了大量罕见变异,这引发了一些重要举措,旨在确定这些变异与疾病表型相关的功能影响。表征这些变异的一种方法是估计突变进入群体的时间。然而,诸如Relate、变异年龄谱系估计器和Runtc等程序中实现的等位基因年龄估计器是基于数据集包含整个基因组这一假设开发的。我们在中性恒定群体大小模型以及群体扩张和背景选择模型下,研究了这些估计器在模拟外显子组数据上的性能。我们发现,每个估计器都能从全外显子组数据集中提供可用的等位基因年龄估计。在所有3个估计器中,Relate的表现最佳,其皮尔逊系数分别为0.83和0.73(分别相对于中性恒定和扩张群体模型的真实模拟值),全基因组估计和全外显子组估计之间的相关性分别下降了12%和20%。在这3个估计器中,Relate最能够并行化以用很少的资源快速得出结果;然而,Relate目前只能扩展到数千个样本,这使其无法匹配当前发布的数十万个样本。虽然需要更多工作来扩展当前估计等位基因年龄方法的能力,但这些方法在估计突变年龄时性能仅略有下降。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/8e2293ceb2ec/jkaf088f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/5c376f363cfd/jkaf088f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/58bbb7c758f9/jkaf088f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/04d74e7a6f4a/jkaf088f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/1a4bd7c76294/jkaf088f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/8e2293ceb2ec/jkaf088f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/5c376f363cfd/jkaf088f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/58bbb7c758f9/jkaf088f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/04d74e7a6f4a/jkaf088f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/1a4bd7c76294/jkaf088f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e02c/12135005/8e2293ceb2ec/jkaf088f5.jpg

相似文献

1
Allele age estimators designed for whole-genome datasets show only a moderate reduction in performance when applied to whole-exome datasets.为全基因组数据集设计的等位基因年龄估计器在应用于全外显子组数据集时,性能仅略有下降。
G3 (Bethesda). 2025 Jun 4;15(6). doi: 10.1093/g3journal/jkaf088.
2
Allele age estimators designed for whole genome datasets show only a moderate reduction in performance when applied to whole exome datasets.为全基因组数据集设计的等位基因年龄估计器在应用于全外显子组数据集时,性能仅出现适度下降。
bioRxiv. 2025 Mar 3:2024.02.01.578465. doi: 10.1101/2024.02.01.578465.
3
JWES: a new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping.JWES:一个用于全基因组/外显子组序列数据处理、管理以及基因变异发现、注释、预测和基因分型的新管道。
FEBS Open Bio. 2021 Sep;11(9):2441-2452. doi: 10.1002/2211-5463.13261. Epub 2021 Aug 11.
4
Benchmarking of variant calling software for whole-exome sequencing using gold standard datasets.使用金标准数据集对全外显子组测序的变异检测软件进行基准测试。
Sci Rep. 2025 Apr 21;15(1):13697. doi: 10.1038/s41598-025-97047-7.
5
Performance assessment of variant calling pipelines using human whole exome sequencing and simulated data.使用人类全外显子组测序和模拟数据评估变异调用管道的性能。
BMC Bioinformatics. 2019 Jun 17;20(1):342. doi: 10.1186/s12859-019-2928-9.
6
Evaluation of whole exome sequencing as an alternative to BeadChip and whole genome sequencing in human population genetic analysis.评估全外显子组测序作为人类群体遗传分析中替代 BeadChip 和全基因组测序的方法。
BMC Genomics. 2018 Oct 29;19(1):778. doi: 10.1186/s12864-018-5168-x.
7
Interplay between probe design and test performance: overlap between genomic regions of interest, capture regions and high quality reference calls influence performance of WES-based assays.探针设计与测试性能的相互作用:目标基因组区域、捕获区域和高质量参考调用之间的重叠,会影响基于 WES 的检测方法的性能。
Hum Genet. 2021 Feb;140(2):289-297. doi: 10.1007/s00439-020-02201-y. Epub 2020 Jul 5.
8
Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples.对 746 例癌症样本中的全外显子组和全基因组突变进行回顾性评估。
Nat Commun. 2020 Sep 21;11(1):4748. doi: 10.1038/s41467-020-18151-y.
9
Whole-Exome Sequencing in Czech Patients with Neurogenetic Diseases.捷克神经遗传疾病患者的全外显子组测序。
Genet Test Mol Biomarkers. 2020 May;24(5):264-273. doi: 10.1089/gtmb.2019.0232. Epub 2020 Apr 7.
10
ALPHLARD: a Bayesian method for analyzing HLA genes from whole genome sequence data.ALPHLARD:一种用于分析全基因组序列数据中 HLA 基因的贝叶斯方法。
BMC Genomics. 2018 Nov 1;19(1):790. doi: 10.1186/s12864-018-5169-9.

本文引用的文献

1
A genomic mutational constraint map using variation in 76,156 human genomes.基于 76156 个人类基因组的变异,绘制出基因组突变约束图谱。
Nature. 2024 Jan;625(7993):92-100. doi: 10.1038/s41586-023-06045-0. Epub 2023 Dec 6.
2
Multiple Sources of Uncertainty Confound Inference of Historical Human Generation Times.多种不确定性来源使历史人类世代推断变得复杂。
Mol Biol Evol. 2023 Aug 3;40(8). doi: 10.1093/molbev/msad160.
3
Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits.生物银行规模下的祖先重组图推断实现了复杂性状的系谱分析。
Nat Genet. 2023 May;55(5):768-776. doi: 10.1038/s41588-023-01379-x. Epub 2023 May 1.
4
Evaluation of methods for estimating coalescence times using ancestral recombination graphs.使用祖先重组图估计合并时间的方法评估
Genetics. 2022 May 5;221(1). doi: 10.1093/genetics/iyac044.
5
A unified genealogy of modern and ancient genomes.现代和古代基因组的统一族谱。
Science. 2022 Feb 25;375(6583):eabi8264. doi: 10.1126/science.abi8264.
6
The history and geographic distribution of a KCNQ1 atrial fibrillation risk allele.KCNQ1 心房颤动风险等位基因的历史和地理分布。
Nat Commun. 2021 Nov 8;12(1):6442. doi: 10.1038/s41467-021-26741-7.
7
Exome sequencing and analysis of 454,787 UK Biobank participants.外显子组测序和分析 454787 名英国生物银行参与者。
Nature. 2021 Nov;599(7886):628-634. doi: 10.1038/s41586-021-04103-z. Epub 2021 Oct 18.
8
Rare variant contribution to human disease in 281,104 UK Biobank exomes.281104 名英国生物银行外显子组中罕见变异对人类疾病的贡献。
Nature. 2021 Sep;597(7877):527-532. doi: 10.1038/s41586-021-03855-y. Epub 2021 Aug 10.
9
Recent ultra-rare inherited variants implicate new autism candidate risk genes.近期发现的超罕见遗传变异揭示了新的自闭症候选风险基因。
Nat Genet. 2021 Aug;53(8):1125-1134. doi: 10.1038/s41588-021-00899-8. Epub 2021 Jul 26.
10
KwARG: parsimonious reconstruction of ancestral recombination graphs with recurrent mutation.KwARG:具有反复突变的祖先重组图的简约重建
Bioinformatics. 2021 Oct 11;37(19):3277-3284. doi: 10.1093/bioinformatics/btab351.