Suppr超能文献

PEPR:用于评估原核生物参考序列的管道。

PEPR: pipelines for evaluating prokaryotic references.

作者信息

Olson Nathan D, Zook Justin M, Samarov Daniel V, Jackson Scott A, Salit Marc L

机构信息

Biosystems and Biomaterials Division, Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA.

Statistical Engineering Division, Information Technology Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA.

出版信息

Anal Bioanal Chem. 2016 Apr;408(11):2975-83. doi: 10.1007/s00216-015-9299-5. Epub 2016 Mar 2.

Abstract

The rapid adoption of microbial whole genome sequencing in public health, clinical testing, and forensic laboratories requires the use of validated measurement processes. Well-characterized, homogeneous, and stable microbial genomic reference materials can be used to evaluate measurement processes, improving confidence in microbial whole genome sequencing results. We have developed a reproducible and transparent bioinformatics tool, PEPR, Pipelines for Evaluating Prokaryotic References, for characterizing the reference genome of prokaryotic genomic materials. PEPR evaluates the quality, purity, and homogeneity of the reference material genome, and purity of the genomic material. The quality of the genome is evaluated using high coverage paired-end sequence data; coverage, paired-end read size and direction, as well as soft-clipping rates, are used to identify mis-assemblies. The homogeneity and purity of the material relative to the reference genome are characterized by comparing base calls from replicate datasets generated using multiple sequencing technologies. Genomic purity of the material is assessed by checking for DNA contaminants. We demonstrate the tool and its output using sequencing data while developing a Staphylococcus aureus candidate genomic reference material. PEPR is open source and available at https://github.com/usnistgov/pepr .

摘要

微生物全基因组测序在公共卫生、临床检测和法医实验室中的迅速应用,需要使用经过验证的测量流程。特征明确、均匀且稳定的微生物基因组参考物质可用于评估测量流程,提高对微生物全基因组测序结果的可信度。我们开发了一种可重复且透明的生物信息学工具PEPR(用于评估原核生物参考的流程),用于表征原核生物基因组材料的参考基因组。PEPR评估参考物质基因组的质量、纯度和均匀性,以及基因组材料的纯度。使用高覆盖度双端序列数据评估基因组的质量;覆盖度、双端读段大小和方向以及软剪切率用于识别错误组装。通过比较使用多种测序技术生成的重复数据集的碱基识别结果,来表征材料相对于参考基因组的均匀性和纯度。通过检查DNA污染物来评估材料的基因组纯度。在开发金黄色葡萄球菌候选基因组参考物质的过程中,我们使用测序数据展示了该工具及其输出结果。PEPR是开源的,可在https://github.com/usnistgov/pepr获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23df/4819933/63ad615a651f/216_2015_9299_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验