• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

KATK:直接从未映射测序reads 中快速进行稀有变体的基因分型。

KATK: Fast genotyping of rare variants directly from unmapped sequencing reads.

机构信息

Department of Bioinformatics, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia.

出版信息

Hum Mutat. 2021 Jun;42(6):777-786. doi: 10.1002/humu.24197. Epub 2021 Apr 1.

DOI:10.1002/humu.24197
PMID:33715282
Abstract

KATK is a fast and accurate software tool for calling variants directly from raw next-generation sequencing reads. It uses predefined k-mers to retrieve only the reads of interest from the FASTQ file and calls genotypes by aligning retrieved reads locally. KATK does not use data about known polymorphisms and has NC (no call) as the default genotype. The reference or variant allele is called only if there is sufficient evidence for their presence in data. Thus it is not biased against rare variants or de-novo mutations. With simulated datasets, we achieved a false-negative rate of 0.23% (sensitivity 99.77%) and a false discovery rate of 0.19%. Calling all human exonic regions with KATK requires 1-2 h, depending on sequencing coverage.

摘要

KATK 是一款快速、准确的软件工具,可直接从原始的下一代测序读取中调用变体。它使用预定义的 k- mers 从 FASTQ 文件中仅检索感兴趣的读取,并通过局部对齐检索的读取来调用基因型。KATK 不使用关于已知多态性的数据,并且 NC(无调用)是默认的基因型。只有在数据中存在足够的证据表明存在参考或变异等位基因时,才会调用它们。因此,它不会对罕见变体或从头突变产生偏见。在模拟数据集上,我们的假阴性率为 0.23%(灵敏度 99.77%),假阳性率为 0.19%。使用 KATK 调用所有人类外显子区域需要 1-2 小时,具体取决于测序覆盖率。

相似文献

1
KATK: Fast genotyping of rare variants directly from unmapped sequencing reads.KATK:直接从未映射测序reads 中快速进行稀有变体的基因分型。
Hum Mutat. 2021 Jun;42(6):777-786. doi: 10.1002/humu.24197. Epub 2021 Apr 1.
2
FastGT: an alignment-free method for calling common SNVs directly from raw sequencing reads.FastGT:一种从原始测序读段中直接调用常见单核苷酸变异(SNVs)的无需比对方法。
Sci Rep. 2017 May 31;7(1):2537. doi: 10.1038/s41598-017-02487-5.
3
UNDR ROVER - a fast and accurate variant caller for targeted DNA sequencing.UNDR ROVER——一种用于靶向DNA测序的快速且准确的变异检测工具。
BMC Bioinformatics. 2016 Apr 16;17:165. doi: 10.1186/s12859-016-1014-9.
4
Calling known variants and identifying new variants while rapidly aligning sequence data.在快速对齐序列数据的同时,调用已知变异体并识别新变异体。
J Dairy Sci. 2019 Apr;102(4):3216-3229. doi: 10.3168/jds.2018-15172. Epub 2019 Feb 14.
5
The challenge of detecting indels in bacterial genomes from short-read sequencing data.从短读长测序数据中检测细菌基因组插入缺失的挑战。
J Biotechnol. 2017 May 20;250:11-15. doi: 10.1016/j.jbiotec.2017.02.026. Epub 2017 Mar 4.
6
One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies.一刀切并不适用——RefEditor:构建个性化二倍体参考基因组以改善下一代测序研究中的读段映射和基因型调用
PLoS Comput Biol. 2015 Aug 12;11(8):e1004448. doi: 10.1371/journal.pcbi.1004448. eCollection 2015 Aug.
7
Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications.整合基于图谱、组装和单倍型的方法以在临床测序应用中进行变异检测。
Nat Genet. 2014 Aug;46(8):912-918. doi: 10.1038/ng.3036. Epub 2014 Jul 13.
8
Assessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samples.使用270个HapMap样本评估基因分型算法BRLMM对Affymetrix GeneChip Human Mapping 500 K芯片组的批次效应。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S17. doi: 10.1186/1471-2105-9-S9-S17.
9
Precise detection of de novo single nucleotide variants in human genomes.精准检测人类基因组中的新单核苷酸变异。
Proc Natl Acad Sci U S A. 2018 May 22;115(21):5516-5521. doi: 10.1073/pnas.1802244115. Epub 2018 May 7.
10
PhredEM: a phred-score-informed genotype-calling approach for next-generation sequencing studies.PhredEM:一种用于下一代测序研究的基于Phred分数的基因型分型方法。
Genet Epidemiol. 2017 Jul;41(5):375-387. doi: 10.1002/gepi.22048. Epub 2017 May 31.

引用本文的文献

1
Y-mer: a k-mer based method for determining human Y chromosome haplogroups from ultra-low sequencing depth data.Y-mer:一种基于k-mer的方法,用于从超低测序深度数据中确定人类Y染色体单倍群。
Genome Biol. 2025 Aug 12;26(1):243. doi: 10.1186/s13059-025-03714-3.
2
K-mer-based Approaches to Bridging Pangenomics and Population Genetics.基于K-mer的泛基因组学与群体遗传学关联方法。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf047.
3
GeneToCN: an alignment-free method for gene copy number estimation directly from next-generation sequencing reads.
GeneToCN:一种从下一代测序读取中直接进行基因拷贝数估计的无比对方法。
Sci Rep. 2023 Oct 18;13(1):17765. doi: 10.1038/s41598-023-44636-z.