• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于高通量测序数据的基因型模型选择进行 SNP calling。

SNP calling using genotype model selection on high-throughput sequencing data.

机构信息

Department of Statistical Science, School of Mathematics and Computational Science, Sun Yat-Sen University, Guangzhou 510275, China.

出版信息

Bioinformatics. 2012 Mar 1;28(5):643-50. doi: 10.1093/bioinformatics/bts001. Epub 2012 Jan 16.

DOI:10.1093/bioinformatics/bts001
PMID:22253293
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3338331/
Abstract

MOTIVATION

A review of the available single nucleotide polymorphism (SNP) calling procedures for Illumina high-throughput sequencing (HTS) platform data reveals that most rely mainly on base-calling and mapping qualities as sources of error when calling SNPs. Thus, errors not involved in base-calling or alignment, such as those in genomic sample preparation, are not accounted for.

RESULTS

A novel method of consensus and SNP calling, Genotype Model Selection (GeMS), is given which accounts for the errors that occur during the preparation of the genomic sample. Simulations and real data analyses indicate that GeMS has the best performance balance of sensitivity and positive predictive value among the tested SNP callers.

AVAILABILITY

The GeMS package can be downloaded from https://sites.google.com/a/bioinformatics.ucr.edu/xinping-cui/home/software or http://computationalbioenergy.org/software.html.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

对现有的用于 Illumina 高通量测序 (HTS) 平台数据的单核苷酸多态性 (SNP) 调用程序的回顾表明,大多数程序主要依赖碱基调用和映射质量作为 SNP 调用的误差源。因此,未涉及碱基调用或比对的误差,例如基因组样本制备中的误差,未被考虑在内。

结果

给出了一种新的共识和 SNP 调用方法,基因型模型选择 (GeMS),它考虑了基因组样本制备过程中发生的误差。模拟和真实数据分析表明,GeMS 在测试的 SNP 调用程序中具有最佳的灵敏度和阳性预测值的性能平衡。

可用性

GeMS 软件包可从 https://sites.google.com/a/bioinformatics.ucr.edu/xinping-cui/home/software 或 http://computationalbioenergy.org/software.html 下载。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
SNP calling using genotype model selection on high-throughput sequencing data.基于高通量测序数据的基因型模型选择进行 SNP calling。
Bioinformatics. 2012 Mar 1;28(5):643-50. doi: 10.1093/bioinformatics/bts001. Epub 2012 Jan 16.
2
MultiGeMS: detection of SNVs from multiple samples using model selection on high-throughput sequencing data.MultiGeMS:利用高通量测序数据的模型选择从多个样本中检测单核苷酸变异
Bioinformatics. 2016 May 15;32(10):1486-92. doi: 10.1093/bioinformatics/btv753. Epub 2016 Jan 18.
3
TotalReCaller: improved accuracy and performance via integrated alignment and base-calling.TotalReCaller:通过集成的对准和碱基调用提高准确性和性能。
Bioinformatics. 2011 Sep 1;27(17):2330-7. doi: 10.1093/bioinformatics/btr393. Epub 2011 Jun 30.
4
Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms.利用改良的简化代表性测序和 SNP 调用算法的直接比较,生成猩猩群体基因组学的 SNP 数据集。
BMC Genomics. 2014 Jan 10;15:16. doi: 10.1186/1471-2164-15-16.
5
PhredEM: a phred-score-informed genotype-calling approach for next-generation sequencing studies.PhredEM:一种用于下一代测序研究的基于Phred分数的基因型分型方法。
Genet Epidemiol. 2017 Jul;41(5):375-387. doi: 10.1002/gepi.22048. Epub 2017 May 31.
6
An empirical Bayes method for genotyping and SNP detection using multi-sample next-generation sequencing data.基于多样本下一代测序数据的基因分型和 SNP 检测的经验贝叶斯方法。
Bioinformatics. 2016 Nov 1;32(21):3240-3245. doi: 10.1093/bioinformatics/btw409. Epub 2016 Jul 4.
7
Review of alignment and SNP calling algorithms for next-generation sequencing data.下一代测序数据的比对和单核苷酸多态性(SNP)检测算法综述。
J Appl Genet. 2016 Feb;57(1):71-9. doi: 10.1007/s13353-015-0292-7. Epub 2015 Jun 9.
8
Consensus Genotyper for Exome Sequencing (CGES): improving the quality of exome variant genotypes.外显子组测序一致性基因分型器(CGES):提高外显子组变异基因型的质量
Bioinformatics. 2015 Jan 15;31(2):187-93. doi: 10.1093/bioinformatics/btu591. Epub 2014 Sep 29.
9
M(3): an improved SNP calling algorithm for Illumina BeadArray data.M(3):一种用于 Illumina BeadArray 数据的 SNP 调用算法的改进。
Bioinformatics. 2012 Feb 1;28(3):358-65. doi: 10.1093/bioinformatics/btr673. Epub 2011 Dec 8.
10
Toward better understanding of artifacts in variant calling from high-coverage samples.为了更好地理解高覆盖样本中变体调用中的伪影。
Bioinformatics. 2014 Oct 15;30(20):2843-51. doi: 10.1093/bioinformatics/btu356. Epub 2014 Jun 27.

引用本文的文献

1
A high-throughput SNP discovery strategy for RNA-seq data.一种用于 RNA-seq 数据的高通量 SNP 发现策略。
BMC Genomics. 2019 Feb 27;20(1):160. doi: 10.1186/s12864-019-5533-4.
2
SNP Discovery Using a Pangenome: Has the Single Reference Approach Become Obsolete?利用泛基因组进行单核苷酸多态性(SNP)发现:单一参考方法是否已过时?
Biology (Basel). 2017 Mar 11;6(1):21. doi: 10.3390/biology6010021.
3
Single-cell SNP analyses and interpretations based on RNA-Seq data for colon cancer research.基于RNA测序数据的单细胞SNP分析及解读在结肠癌研究中的应用
Sci Rep. 2016 Sep 28;6:34420. doi: 10.1038/srep34420.
4
SNVSniffer: an integrated caller for germline and somatic single-nucleotide and indel mutations.SNVSniffer:一种用于种系和体细胞单核苷酸及插入缺失突变的综合检测工具。
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):47. doi: 10.1186/s12918-016-0300-5.
5
Monovar: single-nucleotide variant detection in single cells.Monovar:单细胞中的单核苷酸变异检测
Nat Methods. 2016 Jun;13(6):505-7. doi: 10.1038/nmeth.3835. Epub 2016 Apr 18.
6
MultiGeMS: detection of SNVs from multiple samples using model selection on high-throughput sequencing data.MultiGeMS:利用高通量测序数据的模型选择从多个样本中检测单核苷酸变异
Bioinformatics. 2016 May 15;32(10):1486-92. doi: 10.1093/bioinformatics/btv753. Epub 2016 Jan 18.
7
An investigation of causes of false positive single nucleotide polymorphisms using simulated reads from a small eukaryote genome.利用来自小型真核生物基因组的模拟读数对单核苷酸多态性假阳性原因的调查。
BMC Bioinformatics. 2015 Nov 11;16:382. doi: 10.1186/s12859-015-0801-z.
8
A hidden Markov approach for ascertaining cSNP genotypes from RNA sequence data in the presence of allelic imbalance by exploiting linkage disequilibrium.一种通过利用连锁不平衡,在存在等位基因不平衡的情况下从RNA序列数据确定cSNP基因型的隐马尔可夫方法。
BMC Bioinformatics. 2015 Feb 22;16:61. doi: 10.1186/s12859-015-0479-2.
9
SNPest: a probabilistic graphical model for estimating genotypes.SNPest:一种用于估计基因型的概率图模型。
BMC Res Notes. 2014 Oct 7;7:698. doi: 10.1186/1756-0500-7-698.
10
From genes to health - challenges and opportunities.从基因到健康——挑战与机遇。
Front Pediatr. 2014 Mar 3;2:12. doi: 10.3389/fped.2014.00012. eCollection 2014.

本文引用的文献

1
A framework for variation discovery and genotyping using next-generation DNA sequencing data.利用下一代 DNA 测序数据进行变异发现和基因分型的框架。
Nat Genet. 2011 May;43(5):491-8. doi: 10.1038/ng.806. Epub 2011 Apr 10.
2
SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples.从多个二倍体样本的低覆盖测序数据中进行 SNP 检测和基因分型。
Genome Res. 2011 Jun;21(6):952-60. doi: 10.1101/gr.113084.110. Epub 2010 Oct 27.
3
Dindel: accurate indel calls from short-read data.Dindel:从短读数据中进行精确的插入缺失突变(Indel)调用。
Genome Res. 2011 Jun;21(6):961-73. doi: 10.1101/gr.112326.110. Epub 2010 Oct 27.
4
Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA.利用 SRMA 对短读长下一代测序数据进行局部重比对以提高变异发现。
Genome Biol. 2010;11(10):R99. doi: 10.1186/gb-2010-11-10-r99. Epub 2010 Oct 8.
5
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
6
Personal genome sequencing: current approaches and challenges.个人基因组测序:当前方法与挑战。
Genes Dev. 2010 Mar 1;24(5):423-31. doi: 10.1101/gad.1864110.
7
High quality SNP calling using Illumina data at shallow coverage.使用 Illumina 数据进行低深度覆盖的高质量 SNP 调用。
Bioinformatics. 2010 Apr 15;26(8):1029-35. doi: 10.1093/bioinformatics/btq092. Epub 2010 Feb 26.
8
SNVMix: predicting single nucleotide variants from next-generation sequencing of tumors.SNVMix:从肿瘤的下一代测序中预测单核苷酸变异。
Bioinformatics. 2010 Mar 15;26(6):730-6. doi: 10.1093/bioinformatics/btq040. Epub 2010 Feb 3.
9
Fast and accurate long-read alignment with Burrows-Wheeler transform.基于 Burrows-Wheeler 变换的快速准确长读比对。
Bioinformatics. 2010 Mar 1;26(5):589-95. doi: 10.1093/bioinformatics/btp698. Epub 2010 Jan 15.
10
A SNP discovery method to assess variant allele probability from next-generation resequencing data.一种 SNP 发现方法,可从下一代重测序数据中评估变异等位基因的概率。
Genome Res. 2010 Feb;20(2):273-80. doi: 10.1101/gr.096388.109. Epub 2009 Dec 17.