Suppr超能文献

RatesTools:一种用于检测家系序列数据中新生种系突变的Nextflow管道。

RatesTools: a Nextflow pipeline for detecting de novo germline mutations in pedigree sequence data.

作者信息

Armstrong Ellie E, Campana Michael G

机构信息

School of Biological Sciences, Washington State University, Pullman, WA 99164, USA.

Department of Biology, Stanford University, Stanford, CA 94305, USA.

出版信息

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac784.

Abstract

SUMMARY

Here, we introduce RatesTools, an automated pipeline to infer de novo mutation rates from parent-offspring trio data of diploid organisms. By providing a reference genome and high-coverage, whole-genome resequencing data of a minimum of three individuals (sire, dam and offspring), RatesTools provides a list of candidate de novo mutations and calculates a putative mutation rate. RatesTools uses several quality filtering steps, such as discarding sites with low mappability and highly repetitive regions, as well as sites with low genotype and mapping qualities to find potential de novo mutations. In addition, RatesTools implements several optional filters based on post hoc assumptions of the heterozygosity and mutation rate of the organism. Filters are highly customizable to user specifications in order to maximize utility across a wide range of applications.

AVAILABILITY AND IMPLEMENTATION

RatesTools is freely available at https://github.com/campanam/RatesTools under a Creative Commons Zero (CC0) license. The pipeline is implemented in Nextflow (Di Tommaso et al., 2017), Ruby (http://www.ruby-lang.org), Bash (https://www.gnu.org/software/bash/) and R (R Core Team, 2020) with reliance upon several other freely available tools. RatesTools is compatible with macOS and Linux operating systems.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

在此,我们介绍RatesTools,这是一种用于从二倍体生物的亲子三代数据中推断新生突变率的自动化流程。通过提供一个参考基因组以及至少三个个体(父本、母本和子代)的高覆盖度全基因组重测序数据,RatesTools会提供一份新生突变候选列表,并计算出一个推定的突变率。RatesTools使用了几个质量过滤步骤,比如丢弃映射性低和高度重复区域的位点,以及基因型和映射质量低的位点,以找到潜在的新生突变。此外,RatesTools基于对生物体杂合性和突变率的事后假设实施了几个可选过滤器。过滤器可根据用户规格进行高度定制,以便在广泛的应用中最大化效用。

可用性与实现方式

RatesTools可在https://github.com/campanam/RatesTools上根据知识共享零(CC0)许可免费获取。该流程在Nextflow(迪·托马索等人,2017年)、Ruby(http://www.ruby-lang.org)、Bash(https://www.gnu.org/software/bash/)和R(R核心团队,2020年)中实现,并依赖其他几个免费工具。RatesTools与macOS和Linux操作系统兼容。

补充信息

补充数据可在《生物信息学》在线版获取。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验