• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Psi-Caller:一种基于短读长的轻量级变异检测工具,具有高速度和准确性。

Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy.

作者信息

Liu Yadong, Jiang Tao, Gao Yan, Liu Bo, Zang Tianyi, Wang Yadong

机构信息

Center for Bioinformatics, Faculty of Computing, Harbin Institute of Technology, Harbin, China.

出版信息

Front Cell Dev Biol. 2021 Aug 13;9:731424. doi: 10.3389/fcell.2021.731424. eCollection 2021.

DOI:10.3389/fcell.2021.731424
PMID:34485311
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8414796/
Abstract

With the rapid development of short-read sequencing technologies, many population-scale resequencing studies have been carried out to study the associations between human genome variants and various phenotypes in recent years. Variant calling is one of the core bioinformatics tasks in such studies to comprehensively discover genomic variants in sequenced samples. Many efforts have been made to develop short read-based variant calling approaches; however, state-of-the-art tools are still computationally expensive. Meanwhile, cutting-edge genomics studies also have higher requirements on the yields of variant calling. Herein, we propose Partial-Order Alignment-based single nucleotide polymorphism (SNV) and Indel caller (Psi-caller), a lightweight variant calling algorithm that simultaneously achieves high performance and yield. Mainly, Psi-caller recognizes and divides the candidate variant site into three categories according to the complexity and location of the signatures and employs various methods including binomial model, partial-order alignment, and de Bruijn graph-based local assembly to handle various categories of candidate variant sites to call and genotype SNVs/Indels, respectively. Benchmarks on simulated and real short-read sequencing data sets demonstrate that Psi-caller is times faster than state-of-the-art tools with higher or equal sensitivity and accuracy. It has the potential to well handle large-scale data sets in cutting-edge genomics studies.

摘要

随着短读长测序技术的快速发展,近年来开展了许多群体规模的重测序研究,以探究人类基因组变异与各种表型之间的关联。变异检测是此类研究中的核心生物信息学任务之一,用于全面发现测序样本中的基因组变异。人们已做出诸多努力来开发基于短读长的变异检测方法;然而,目前最先进的工具在计算上仍然成本高昂。与此同时,前沿的基因组学研究对变异检测的产量也有更高要求。在此,我们提出了基于偏序比对的单核苷酸多态性(SNV)和插入缺失检测工具(Psi-caller),这是一种轻量级的变异检测算法,能同时实现高性能和高产量。主要而言,Psi-caller根据特征的复杂性和位置将候选变异位点识别并划分为三类,并采用包括二项式模型、偏序比对和基于德布鲁因图的局部组装等多种方法来处理各类候选变异位点,分别对SNV/插入缺失进行检测和基因分型。在模拟和真实的短读长测序数据集上的基准测试表明,Psi-caller比最先进的工具快数倍,且具有更高或相当的灵敏度和准确性。它有潜力很好地处理前沿基因组学研究中的大规模数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/cdffce9e72cd/fcell-09-731424-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/7b1b9fb48650/fcell-09-731424-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/66d59acb3967/fcell-09-731424-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/80eee4ccc0d4/fcell-09-731424-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/cdffce9e72cd/fcell-09-731424-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/7b1b9fb48650/fcell-09-731424-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/66d59acb3967/fcell-09-731424-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/80eee4ccc0d4/fcell-09-731424-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02d9/8414796/cdffce9e72cd/fcell-09-731424-g004.jpg

相似文献

1
Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy.Psi-Caller:一种基于短读长的轻量级变异检测工具,具有高速度和准确性。
Front Cell Dev Biol. 2021 Aug 13;9:731424. doi: 10.3389/fcell.2021.731424. eCollection 2021.
2
SNVSniffer: an integrated caller for germline and somatic single-nucleotide and indel mutations.SNVSniffer:一种用于种系和体细胞单核苷酸及插入缺失突变的综合检测工具。
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):47. doi: 10.1186/s12918-016-0300-5.
3
miniSNV: accurate and fast single nucleotide variant calling from nanopore sequencing data.miniSNV:从纳米孔测序数据中进行准确快速的单核苷酸变异calling。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae473.
4
The impact of FASTQ and alignment read order on structural variant calling from long-read sequencing data.FASTQ 和比对读序对长读测序数据结构变异调用的影响。
PeerJ. 2024 Mar 15;12:e17101. doi: 10.7717/peerj.17101. eCollection 2024.
5
Genome-wide somatic variant calling using localized colored de Bruijn graphs.使用局部彩色德布鲁因图进行全基因组体细胞变异检测
Commun Biol. 2018 Mar 22;1:20. doi: 10.1038/s42003-018-0023-9. eCollection 2018.
6
Microindel detection in short-read sequence data.短读序列数据中的微缺失/插入检测。
Bioinformatics. 2010 Mar 15;26(6):722-9. doi: 10.1093/bioinformatics/btq027. Epub 2010 Feb 9.
7
Impact of post-alignment processing in variant discovery from whole exome data.全外显子数据变异发现中比对后处理的影响
BMC Bioinformatics. 2016 Oct 3;17(1):403. doi: 10.1186/s12859-016-1279-z.
8
A universal SNP and small-indel variant caller using deep neural networks.使用深度神经网络的通用 SNP 和小插入缺失变体调用器。
Nat Biotechnol. 2018 Nov;36(10):983-987. doi: 10.1038/nbt.4235. Epub 2018 Sep 24.
9
Evaluating the performance of tools used to call minority variants from whole genome short-read data.评估用于从全基因组短读数据中检测罕见变异的工具的性能。
Wellcome Open Res. 2018 Sep 13;3:21. doi: 10.12688/wellcomeopenres.13538.2. eCollection 2018.
10
ACMGA: a reference-free multiple-genome alignment pipeline for plant species.ACMGA:一种用于植物物种的无参考多基因组比对管道。
BMC Genomics. 2024 May 25;25(1):515. doi: 10.1186/s12864-024-10430-y.

引用本文的文献

1
CDFA: Calibrated deep feature aggregation for screening synergistic drug combinations.CDFA:用于筛选协同药物组合的校准深度特征聚合
Front Pharmacol. 2025 Jul 23;16:1608832. doi: 10.3389/fphar.2025.1608832. eCollection 2025.
2
miniSNV: accurate and fast single nucleotide variant calling from nanopore sequencing data.miniSNV:从纳米孔测序数据中进行准确快速的单核苷酸变异calling。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae473.
3
HapKled: a haplotype-aware structural variant calling approach for Oxford nanopore sequencing data.

本文引用的文献

1
abPOA: an SIMD-based C library for fast partial order alignment using adaptive band.abPOA:一个基于 SIMD 的 C 库,用于使用自适应带实现快速偏序比对。
Bioinformatics. 2021 Aug 9;37(15):2209-2211. doi: 10.1093/bioinformatics/btaa963.
2
Large-Scale Whole-Genome Sequencing of Three Diverse Asian Populations in Singapore.新加坡三个不同亚洲人群的大规模全基因组测序。
Cell. 2019 Oct 17;179(3):736-749.e15. doi: 10.1016/j.cell.2019.09.019.
3
Comprehensive evaluation of structural variation detection algorithms for whole genome sequencing.
HapKled:一种用于牛津纳米孔测序数据的单倍型感知结构变异检测方法。
Front Genet. 2024 Jul 9;15:1435087. doi: 10.3389/fgene.2024.1435087. eCollection 2024.
4
Kled: an ultra-fast and sensitive structural variant detection tool for long-read sequencing data.Kled:一种用于长读测序数据的超快速和敏感的结构变异检测工具。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae049.
5
Cloud-native distributed genomic pileup operations.云原生分布式基因组堆积操作。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac804.
6
Genome sequence assembly algorithms and misassembly identification methods.基因组序列组装算法和错误组装识别方法。
Mol Biol Rep. 2022 Nov;49(11):11133-11148. doi: 10.1007/s11033-022-07919-8. Epub 2022 Sep 23.
全基因组测序结构变异检测算法的综合评估。
Genome Biol. 2019 Jun 3;20(1):117. doi: 10.1186/s13059-019-1720-5.
4
simuG: a general-purpose genome simulator.simuG:一种通用的基因组模拟程序。
Bioinformatics. 2019 Nov 1;35(21):4442-4444. doi: 10.1093/bioinformatics/btz424.
5
Publisher Correction: DNA sequencing at 40: past, present and future.出版商更正:40 年的 DNA 测序:过去、现在与未来。
Nature. 2019 Apr;568(7752):E11. doi: 10.1038/s41586-019-1120-8.
6
Minimap2: pairwise alignment for nucleotide sequences.Minimap2:核苷酸序列的两两比对。
Bioinformatics. 2018 Sep 15;34(18):3094-3100. doi: 10.1093/bioinformatics/bty191.
7
Introducing difference recurrence relations for faster semi-global alignment of long sequences.引入差异递归关系以加快长序列的半全局比对。
BMC Bioinformatics. 2018 Feb 19;19(Suppl 1):45. doi: 10.1186/s12859-018-2014-8.
8
A Comprehensive Workflow for Read Depth-Based Identification of Copy-Number Variation from Whole-Genome Sequence Data.基于读深度的全基因组序列数据拷贝数变异识别的综合工作流程。
Am J Hum Genet. 2018 Jan 4;102(1):142-155. doi: 10.1016/j.ajhg.2017.12.007.
9
The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community.牛津纳米孔MinION测序仪:将纳米孔测序技术带给基因组学界。
Genome Biol. 2016 Nov 25;17(1):239. doi: 10.1186/s13059-016-1103-0.
10
Coming of age: ten years of next-generation sequencing technologies.成年:下一代测序技术的十年
Nat Rev Genet. 2016 May 17;17(6):333-51. doi: 10.1038/nrg.2016.49.