Suppr超能文献

PineSAP——序列比对和 SNP 识别流程。

PineSAP--sequence alignment and SNP identification pipeline.

机构信息

Department of Plant Sciences, University of California, Davis, CA 95616, USA.

出版信息

Bioinformatics. 2009 Oct 1;25(19):2609-10. doi: 10.1093/bioinformatics/btp477. Epub 2009 Aug 10.

Abstract

UNLABELLED

The Pine Alignment and SNP Identification Pipeline (PineSAP) provides a high-throughput solution to single nucleotide polymorphism (SNP) prediction using multiple sequence alignments from re-sequencing data. This pipeline integrates a hybrid of customized scripting, existing utilities and machine learning in order to increase the speed and accuracy of SNP calls. The implementation of this pipeline results in significantly improved multiple sequence alignments and SNP identifications when compared with existing solutions. The use of machine learning in the SNP identifications extends the pipeline's application to any eukaryotic species where full genome sequence information is unavailable.

AVAILABILITY

All code used for this pipeline is freely available at the Dendrome project website (http://dendrome.ucdavis.edu/adept2/resequencing.html)

摘要

未标记

Pine 对齐和 SNP 识别管道 (PineSAP) 提供了一种使用重测序数据的多序列比对来预测单核苷酸多态性 (SNP) 的高通量解决方案。该管道集成了自定义脚本、现有实用程序和机器学习的混合,以提高 SNP 调用的速度和准确性。与现有解决方案相比,该管道的实现显著提高了多序列比对和 SNP 识别的速度。在 SNP 识别中使用机器学习扩展了该管道的应用范围,可用于任何没有完整基因组序列信息的真核生物物种。

可用性

此管道使用的所有代码都可在 Dendrome 项目网站上免费获得(http://dendrome.ucdavis.edu/adept2/resequencing.html)

相似文献

1
PineSAP--sequence alignment and SNP identification pipeline.
Bioinformatics. 2009 Oct 1;25(19):2609-10. doi: 10.1093/bioinformatics/btp477. Epub 2009 Aug 10.
3
SNP-PHAGE--High throughput SNP discovery pipeline.
BMC Bioinformatics. 2006 Oct 23;7:468. doi: 10.1186/1471-2105-7-468.
4
A beginners guide to SNP calling from high-throughput DNA-sequencing data.
Hum Genet. 2012 Oct;131(10):1541-54. doi: 10.1007/s00439-012-1213-z. Epub 2012 Aug 11.
5
The Sequence Alignment/Map format and SAMtools.
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.
6
An integrated SNP mining and utilization (ISMU) pipeline for next generation sequencing data.
PLoS One. 2014 Jul 8;9(7):e101754. doi: 10.1371/journal.pone.0101754. eCollection 2014.
7
MSQT for choosing SNP assays from multiple DNA alignments.
Bioinformatics. 2007 Oct 15;23(20):2784-7. doi: 10.1093/bioinformatics/btm428. Epub 2007 Sep 4.
8
kSNP3.0: SNP detection and phylogenetic analysis of genomes without genome alignment or reference genome.
Bioinformatics. 2015 Sep 1;31(17):2877-8. doi: 10.1093/bioinformatics/btv271. Epub 2015 Apr 25.
10
SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation.
Bioinformatics. 2007 Jul 1;23(13):i387-91. doi: 10.1093/bioinformatics/btm192.

引用本文的文献

1
Microbial sequence typing in the genomic era.
Infect Genet Evol. 2018 Sep;63:346-359. doi: 10.1016/j.meegid.2017.09.022. Epub 2017 Sep 21.
3
Micro- and macro-geographic scale effect on the molecular imprint of selection and adaptation in Norway spruce.
PLoS One. 2014 Dec 31;9(12):e115499. doi: 10.1371/journal.pone.0115499. eCollection 2014.
5
The evolutionary genetics of the genes underlying phenotypic associations for loblolly pine (Pinus taeda, Pinaceae).
Genetics. 2013 Dec;195(4):1353-72. doi: 10.1534/genetics.113.157198. Epub 2013 Oct 11.
6
Contrasting patterns of nucleotide diversity for four conifers of Alpine European forests.
Evol Appl. 2012 Nov;5(7):762-75. doi: 10.1111/j.1752-4571.2012.00256.x.
7
Comparative mapping in the Fagaceae and beyond with EST-SSRs.
BMC Plant Biol. 2012 Aug 29;12:153. doi: 10.1186/1471-2229-12-153.
8
EuroPineDB: a high-coverage web database for maritime pine transcriptome.
BMC Genomics. 2011 Jul 15;12:366. doi: 10.1186/1471-2164-12-366.

本文引用的文献

1
Genomics to tree breeding and forest health.
Curr Opin Genet Dev. 2007 Dec;17(6):539-44. doi: 10.1016/j.gde.2007.10.002.
2
DNA reference alignment benchmarks based on tertiary structure of encoded proteins.
Bioinformatics. 2007 Oct 1;23(19):2648-9. doi: 10.1093/bioinformatics/btm389. Epub 2007 Aug 8.
3
An enhanced RNA alignment benchmark for sequence alignment programs.
Algorithms Mol Biol. 2006 Oct 24;1:19. doi: 10.1186/1748-7188-1-19.
4
Application of machine learning in SNP discovery.
BMC Bioinformatics. 2006 Jan 6;7:4. doi: 10.1186/1471-2105-7-4.
5
SNPdetector: a software tool for sensitive and accurate SNP detection.
PLoS Comput Biol. 2005 Oct;1(5):e53. doi: 10.1371/journal.pcbi.0010053. Epub 2005 Oct 28.
6
SNP discovery using advanced algorithms and neural networks.
Bioinformatics. 2005 May 15;21(10):2528-30. doi: 10.1093/bioinformatics/bti354. Epub 2005 Mar 3.
7
ProbCons: Probabilistic consistency-based multiple sequence alignment.
Genome Res. 2005 Feb;15(2):330-40. doi: 10.1101/gr.2821705.
8
Heterogeneity detector: finding heterogeneous positions in Phred/Phrap assemblies.
Bioinformatics. 2004 Nov 1;20(16):2863-4. doi: 10.1093/bioinformatics/bth301. Epub 2004 May 6.
9
A general approach to single-nucleotide polymorphism discovery.
Nat Genet. 1999 Dec;23(4):452-6. doi: 10.1038/70570.
10
Consed: a graphical tool for sequence finishing.
Genome Res. 1998 Mar;8(3):195-202. doi: 10.1101/gr.8.3.195.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验