文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

SAMtools 和 BCFtools 十二年。

Twelve years of SAMtools and BCFtools.

机构信息

Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK.

Wolfson Wohl Cancer Research Centre, Institute of Cancer Sciences, University of Glasgow, Switchback Road, Glasgow, G61 1QH, UK.

出版信息

Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.


DOI:10.1093/gigascience/giab008
PMID:33590861
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7931819/
Abstract

BACKGROUND: SAMtools and BCFtools are widely used programs for processing and analysing high-throughput sequencing data. They include tools for file format conversion and manipulation, sorting, querying, statistics, variant calling, and effect analysis amongst other methods. FINDINGS: The first version appeared online 12 years ago and has been maintained and further developed ever since, with many new features and improvements added over the years. The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines. CONCLUSION: Both SAMtools and BCFtools are freely available on GitHub under the permissive MIT licence, free for both non-commercial and commercial use. Both packages have been installed >1 million times via Bioconda. The source code and documentation are available from https://www.htslib.org.

摘要

背景:SAMtools 和 BCFtools 是广泛用于处理和分析高通量测序数据的程序。它们包括用于文件格式转换和操作、排序、查询、统计、变异调用和效应分析等方法的工具。

发现:第一个版本于 12 年前在线出现,此后一直得到维护和进一步开发,多年来添加了许多新功能和改进。SAMtools 和 BCFtools 包代表了一组独特的工具,这些工具已被用于许多其他软件项目和无数的基因组管道中。

结论:SAMtools 和 BCFtools 都可以在 GitHub 上根据许可协议免费获取,无论是非商业还是商业用途都可以免费使用。这两个包都通过 Bioconda 被安装了超过 100 万次。源代码和文档可从 https://www.htslib.org 获得。

相似文献

[1]
Twelve years of SAMtools and BCFtools.

Gigascience. 2021-2-16

[2]
HTSlib: C library for reading/writing high-throughput sequencing data.

Gigascience. 2021-2-16

[3]
BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data.

Bioinformatics. 2016-6-1

[4]
Comparison of seven SNP calling pipelines for the next-generation sequencing data of chickens.

PLoS One. 2022

[5]
ILIAD: a suite of automated Snakemake workflows for processing genomic data for downstream applications.

BMC Bioinformatics. 2023-11-8

[6]
BCFtools/csq: haplotype-aware variant consequences.

Bioinformatics. 2017-7-1

[7]
CRAM 3.1: advances in the CRAM file format.

Bioinformatics. 2022-3-4

[8]
RGAAT: A Reference-based Genome Assembly and Annotation Tool for New Genomes and Upgrade of Known Genomes.

Genomics Proteomics Bioinformatics. 2018-12-21

[9]
NGS-QCbox and Raspberry for Parallel, Automated and Rapid Quality Control Analysis of Large-Scale Next Generation Sequencing (Illumina) Data.

PLoS One. 2015-10-13

[10]
The evaluation of Bcftools mpileup and GATK HaplotypeCaller for variant calling in non-human species.

Sci Rep. 2022-7-5

引用本文的文献

[1]
Lineage-specific targets of positive selection in three leaf beetles correspond with defence capacity against their shared parasitoid wasp.

Heredity (Edinb). 2025-9-8

[2]
Genetic data and meteorological conditions suggesting windborne transmission of H5N1 high-pathogenicity avian influenza between commercial poultry outbreaks.

PLoS One. 2025-9-5

[3]
Evaluating the diagnostic capabilities of nanopore sequencing for detection in blacklegged ticks.

bioRxiv. 2025-8-27

[4]
The genome sequence of the zebra danio, (Hamilton, 1822) (Cypriniformes: Danionidae).

Wellcome Open Res. 2025-7-4

[5]
The genome sequence of the China Limpet, Gmelin, 1791 (Patellidae).

Wellcome Open Res. 2025-8-11

[6]
The genome sequence of a cranefly, ( ) Loew, 1873.

Wellcome Open Res. 2024-10-16

[7]
Targeted DNA ADP-ribosylation triggers templated repair in bacteria and base mutagenesis in eukaryotes.

Nat Biotechnol. 2025-9-4

[8]
Deciphering the Gene Expression and Alternative Splicing Basis of Muscle Development Through Interpretable Machine Learning Models.

Biology (Basel). 2025-8-15

[9]
Multi-omics uncovers transcriptional programs of gut-resident memory CD4+ T cells in Crohn's disease.

J Exp Med. 2025-11-3

[10]
Chromatin Accessibility Dynamics Reveal Conserved Transcriptional Regulatory Networks During Insect Metamorphosis in and .

Biology (Basel). 2025-7-22

本文引用的文献

[1]
HTSlib: C library for reading/writing high-throughput sequencing data.

Gigascience. 2021-2-16

[2]
Comparison of Read Mapping and Variant Calling Tools for the Analysis of Plant NGS Data.

Plants (Basel). 2020-4-2

[3]
Systematic comparative analysis of single-nucleotide variant detection methods from single-cell RNA sequencing data.

Genome Biol. 2019-11-19

[4]
Crumble: reference free lossy compression of sequence quality values.

Bioinformatics. 2019-1-15

[5]
Bioconda: sustainable and comprehensive software distribution for the life sciences.

Nat Methods. 2018-7

[6]
BCFtools/csq: haplotype-aware variant consequences.

Bioinformatics. 2017-7-1

[7]
The Ensembl Variant Effect Predictor.

Genome Biol. 2016-6-6

[8]
A Method for Checking Genomic Integrity in Cultured Cell Lines from SNP Genotyping Data.

PLoS One. 2016-5-13

[9]
BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data.

Bioinformatics. 2016-6-1

[10]
Choice of reference-guided sequence assembler and SNP caller for analysis of Listeria monocytogenes short-read sequence data greatly influences rates of error.

BMC Res Notes. 2015-12-8

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索