• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用区域等位基因频率辅助插入和缺失分析。

Assisting the analysis of insertions and deletions using regional allele frequencies.

机构信息

Center for Precision Genetics and Genomics, Department of Medicine, Columbia University, New York, NY, USA.

Division of Nephrology, Department of Medicine, Columbia University, New York, NY, USA.

出版信息

Funct Integr Genomics. 2024 May 20;24(3):104. doi: 10.1007/s10142-024-01358-3.

DOI:10.1007/s10142-024-01358-3
PMID:38764005
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11414712/
Abstract

Accurate estimation of population allele frequency (AF) is crucial for gene discovery and genetic diagnostics. However, determining AF for frameshift-inducing small insertions and deletions (indels) faces challenges due to discrepancies in mapping and variant calling methods. Here, we propose an innovative approach to assess indel AF. We developed CRAFTS-indels (Calculating Regional Allele Frequency Targeting Small indels), an algorithm that combines AF of distinct indels within a given region and provides "regional AF" (rAF). We tested and validated CRAFTS-indels using three independent datasets: gnomAD v2 (n=125,748 samples), an internal dataset (IGM; n=39,367), and the UK BioBank (UKBB; n=469,835). By comparing rAF against standard AF, we identified rare indels with rAF exceeding standard AF (sAF≤10 and rAF>10) as "rAF-hi" indels. Notably, a high percentage of rare indels were "rAF-hi", with a higher proportion in gnomAD v2 (11-20%) and IGM (11-22%) compared to the UKBB (5-9% depending on the CRAFTS-indels' parameters). Analysis of the overlap of regions based on their rAF with low complexity regions and with ClinVar classification supported the pertinence of rAF. Using the internal dataset, we illustrated the utility of CRAFTS-indel in the analysis of de novo variants and the potential negative impact of rAF-hi indels in gene discovery. In summary, annotation of indels with cohort specific rAF can be used to handle some of the limitations of current annotation pipelines and facilitate detection of novel gene disease associations. CRAFTS-indels offers a user-friendly approach to providing rAF annotation. It can be integrated into public databases such as gnomAD, UKBB and used by ClinVar to revise indel classifications.

摘要

准确估计人群等位基因频率(AF)对于基因发现和遗传诊断至关重要。然而,由于在映射和变异调用方法上的差异,对于导致移码的小插入和缺失(indels)的 AF 确定存在挑战。在这里,我们提出了一种评估 indel AF 的创新方法。我们开发了 CRAFTS-indels(计算靶向小 indels 的区域等位基因频率),这是一种算法,它结合了给定区域内不同 indels 的 AF,并提供了“区域 AF”(rAF)。我们使用三个独立的数据集:gnomAD v2(n=125748 个样本)、内部数据集(IGM;n=39367)和英国生物银行(UKBB;n=469835)对 CRAFTS-indels 进行了测试和验证。通过比较 rAF 与标准 AF,我们确定了 rAF 超过标准 AF(sAF≤10 和 rAF>10)的罕见 indels 为“rAF-hi”indels。值得注意的是,高比例的罕见 indels 是“rAF-hi”,在 gnomAD v2(11-20%)和 IGM(11-22%)中比 UKBB(5-9%,具体取决于 CRAFTS-indels 的参数)更高。基于其 rAF 与低复杂度区域和 ClinVar 分类的重叠分析支持了 rAF 的相关性。使用内部数据集,我们说明了 CRAFTS-indel 在分析新生变体和 rAF-hi indels 在基因发现中的潜在负面影响方面的效用。总之,使用特定于队列的 rAF 注释 indels 可以用于处理当前注释管道的一些限制,并促进新的基因疾病关联的检测。CRAFTS-indels 提供了一种用户友好的方法来提供 rAF 注释。它可以集成到 gnomAD、UKBB 等公共数据库中,并由 ClinVar 用于修改 indel 分类。

相似文献

1
Assisting the analysis of insertions and deletions using regional allele frequencies.利用区域等位基因频率辅助插入和缺失分析。
Funct Integr Genomics. 2024 May 20;24(3):104. doi: 10.1007/s10142-024-01358-3.
2
Evaluation of in silico pathogenicity prediction tools for the classification of small in-frame indels.小框移码突变的计算机预测工具的分类评价。
BMC Med Genomics. 2023 Feb 28;16(1):36. doi: 10.1186/s12920-023-01454-6.
3
Vindel: a simple pipeline for checking indel redundancy.Vindel:一个用于检查插入缺失冗余的简单流程。
BMC Bioinformatics. 2014 Nov 19;15(1):359. doi: 10.1186/s12859-014-0359-1.
4
A universal algorithm for de novo decrypting of heterozygous indel sequences: a tool for personalized medicine.一种用于从头解密杂合插入缺失序列的通用算法:个性化医疗的工具。
Clin Chim Acta. 2008 Mar;389(1-2):7-13. doi: 10.1016/j.cca.2007.11.011. Epub 2007 Nov 23.
5
Optimized detection of insertions/deletions (INDELs) in whole-exome sequencing data.全外显子组测序数据中插入/缺失(INDELs)的优化检测
PLoS One. 2017 Aug 9;12(8):e0182272. doi: 10.1371/journal.pone.0182272. eCollection 2017.
6
SIFT Indel: predictions for the functional effects of amino acid insertions/deletions in proteins.SIFT Indel:蛋白质中氨基酸插入/缺失功能效应的预测。
PLoS One. 2013 Oct 23;8(10):e77940. doi: 10.1371/journal.pone.0077940. eCollection 2013.
7
Detection and characterization of small insertion and deletion genetic variants in modern layer chicken genomes.现代蛋鸡基因组中小插入和缺失遗传变异的检测与特征分析。
BMC Genomics. 2015 Jul 31;16:562. doi: 10.1186/s12864-015-1711-1.
8
Equivalent indels--ambiguous functional classes and redundancy in databases.等效缺失-插入突变--数据库中功能类别不明确和冗余。
PLoS One. 2013 May 2;8(5):e62803. doi: 10.1371/journal.pone.0062803. Print 2013.
9
Small Insertions and Deletions Drive Genomic Plasticity during Adaptive Evolution of Yersinia pestis.小型插入和缺失驱动鼠疫耶尔森氏菌适应进化中的基因组可塑性。
Microbiol Spectr. 2022 Jun 29;10(3):e0224221. doi: 10.1128/spectrum.02242-21. Epub 2022 Apr 19.
10
DDIG-in: discriminating between disease-associated and neutral non-frameshifting micro-indels.DDIG-in:区分疾病相关和中性非移码微插入缺失
Genome Biol. 2013 Mar 13;14(3):R23. doi: 10.1186/gb-2013-14-3-r23.

引用本文的文献

1
Exome analysis links kidney malformations to developmental disorders and reveals causal genes.外显子组分析将肾脏畸形与发育障碍联系起来,并揭示了致病基因。
Nat Commun. 2025 Aug 7;16(1):7290. doi: 10.1038/s41467-025-62319-3.

本文引用的文献

1
A complete reference genome improves analysis of human genetic variation.完整的参考基因组提高了人类遗传变异分析的能力。
Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.
2
Exome sequencing and analysis of 454,787 UK Biobank participants.外显子组测序和分析 454787 名英国生物银行参与者。
Nature. 2021 Nov;599(7886):628-634. doi: 10.1038/s41586-021-04103-z. Epub 2021 Oct 18.
3
Loss-of-function and missense variants in NSD2 cause decreased methylation activity and are associated with a distinct developmental phenotype.
NSD2 的功能丧失和错义变异导致甲基化活性降低,并与独特的发育表型相关。
Genet Med. 2021 Aug;23(8):1474-1483. doi: 10.1038/s41436-021-01158-1. Epub 2021 May 3.
4
ATAV: a comprehensive platform for population-scale genomic analyses.ATAV:一个用于全人群基因组分析的综合平台。
BMC Bioinformatics. 2021 Mar 23;22(1):149. doi: 10.1186/s12859-021-04071-1.
5
Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.
6
Accurate, scalable cohort variant calls using DeepVariant and GLnexus.使用DeepVariant和GLnexus进行准确、可扩展的队列变异检测。
Bioinformatics. 2021 Apr 5;36(24):5582-5589. doi: 10.1093/bioinformatics/btaa1081.
7
Hotspots of Human Mutation.人类突变热点。
Trends Genet. 2021 Aug;37(8):717-729. doi: 10.1016/j.tig.2020.10.003. Epub 2020 Nov 13.
8
Exome sequencing and characterization of 49,960 individuals in the UK Biobank.英国生物银行中 49960 人的外显子组测序和特征分析。
Nature. 2020 Oct;586(7831):749-756. doi: 10.1038/s41586-020-2853-0. Epub 2020 Oct 21.
9
The Human Gene Mutation Database (HGMD): optimizing its use in a clinical diagnostic or research setting.人类基因突变数据库(HGMD):优化其在临床诊断或研究环境中的使用。
Hum Genet. 2020 Oct;139(10):1197-1207. doi: 10.1007/s00439-020-02199-3. Epub 2020 Jun 28.
10
The mutational constraint spectrum quantified from variation in 141,456 humans.从 141456 名人类个体的变异中量化的突变约束谱。
Nature. 2020 May;581(7809):434-443. doi: 10.1038/s41586-020-2308-7. Epub 2020 May 27.