Hansen Nancy F
National Human Genome Research Institute, Rockville, MD, USA.
Methods Mol Biol. 2016;1418:209-24. doi: 10.1007/978-1-4939-3578-9_11.
The use of next generation nucleotide sequencing to discover and genotype small sequence variants has led to numerous insights into the molecular causes of various diseases. This chapter describes the use of freely available software to align next generation sequencing reads to a reference and then to use the resulting alignments to call, annotate, view, and filter small sequence variants. The suggested variant calling workflow includes read alignment with novoalign, the removal of polymerase chain reaction duplicate sequences with samtools or bamUtils, and the detection of variants with Freebayes or bam2mpg software. ANNOVAR is then used to annotate the predicted variants using gene models, population frequencies, and predicted mutation severity, producing variant files which can be viewed and filtered with the variant display tool VarSifter.
利用新一代核苷酸测序技术来发现小序列变异并进行基因分型,这为深入了解各种疾病的分子病因带来了诸多见解。本章介绍了如何使用免费软件将新一代测序读数与参考序列进行比对,然后利用所得比对结果来识别、注释、查看和筛选小序列变异。建议的变异识别工作流程包括使用 novoalign 进行读数比对、使用 samtools 或 bamUtils 去除聚合酶链反应重复序列,以及使用 Freebayes 或 bam2mpg 软件检测变异。然后使用 ANNOVAR 根据基因模型、群体频率和预测的突变严重程度对预测的变异进行注释,生成可通过变异显示工具 VarSifter 进行查看和筛选的变异文件。