Berlin Institute of Health (BIH), Berlin, Germany.
Charité -Universitätsmedizin Berlin, Berlin, Germany.
PLoS Comput Biol. 2020 Jun 4;16(6):e1007956. doi: 10.1371/journal.pcbi.1007956. eCollection 2020 Jun.
Targeted sequencing of genomic regions is a cost- and time-efficient approach for screening patient cohorts. We present a fast and efficient workflow to analyze highly imbalanced, targeted next-generation sequencing data generated using molecular inversion probe (MIP) capture. Our Snakemake pipeline performs sample demultiplexing, overlap paired-end merging, alignment, MIP-arm trimming, variant calling, coverage analysis and report generation. Further, we support the analysis of probes specifically designed to capture certain structural variants and can assign sex using Y-chromosome-unique probes. In a user-friendly HTML report, we summarize all these results including covered, incomplete or missing regions, called variants and their predicted effects. We developed and tested our pipeline using the hemophilia A & B MIP design from the "My Life, Our Future" initiative. HemoMIPs is available as an open-source tool on GitHub at: https://github.com/kircherlab/hemoMIPs.
靶向测序是一种经济高效的筛选患者群体的方法。我们提出了一种快速有效的工作流程,用于分析使用分子倒置探针(MIP)捕获生成的高度不平衡的靶向下一代测序数据。我们的 Snakemake 管道执行样本去复用、重叠配对末端合并、比对、MIP 臂修剪、变异调用、覆盖度分析和报告生成。此外,我们还支持专门设计用于捕获某些结构变体的探针的分析,并可以使用 Y 染色体特有的探针来分配性别。在用户友好的 HTML 报告中,我们总结了所有这些结果,包括覆盖、不完整或缺失的区域、调用的变体及其预测的影响。我们使用“我的生活,我们的未来”倡议中的血友病 A&B MIP 设计开发并测试了我们的管道。HemoMIPs 可在 GitHub 上作为开源工具获得:https://github.com/kircherlab/hemoMIPs。