• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从 bulk 和单细胞 ATAC-seq 中发现单核苷酸变体和插入缺失。

Discovering single nucleotide variants and indels from bulk and single-cell ATAC-seq.

机构信息

Bioinformatics and Systems Biology Graduate Program, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.

Integrative Biology Laboratory, Salk Institute for Biological Studies, 10010 N. Torrey Pines Road, La Jolla, CA 92037, USA.

出版信息

Nucleic Acids Res. 2021 Aug 20;49(14):7986-7994. doi: 10.1093/nar/gkab621.

DOI:10.1093/nar/gkab621
PMID:34313779
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8373110/
Abstract

Genetic variants and de novo mutations in regulatory regions of the genome are typically discovered by whole-genome sequencing (WGS), however WGS is expensive and most WGS reads come from non-regulatory regions. The Assay for Transposase-Accessible Chromatin (ATAC-seq) generates reads from regulatory sequences and could potentially be used as a low-cost 'capture' method for regulatory variant discovery, but its use for this purpose has not been systematically evaluated. Here we apply seven variant callers to bulk and single-cell ATAC-seq data and evaluate their ability to identify single nucleotide variants (SNVs) and insertions/deletions (indels). In addition, we develop an ensemble classifier, VarCA, which combines features from individual variant callers to predict variants. The Genome Analysis Toolkit (GATK) is the best-performing individual caller with precision/recall on a bulk ATAC test dataset of 0.92/0.97 for SNVs and 0.87/0.82 for indels within ATAC-seq peak regions with at least 10 reads. On bulk ATAC-seq reads, VarCA achieves superior performance with precision/recall of 0.99/0.95 for SNVs and 0.93/0.80 for indels. On single-cell ATAC-seq reads, VarCA attains precision/recall of 0.98/0.94 for SNVs and 0.82/0.82 for indels. In summary, ATAC-seq reads can be used to accurately discover non-coding regulatory variants in the absence of whole-genome sequencing data and our ensemble method, VarCA, has the best overall performance.

摘要

基因组调控区的遗传变异和新生突变通常通过全基因组测序(WGS)来发现,然而 WGS 成本高昂,且大多数 WGS 读段来自非调控区。转座酶可及染色质分析(ATAC-seq)可从调控序列中生成读段,并且可能被用作低成本的调控变异发现“捕获”方法,但尚未对其用于该目的的效果进行系统评估。在这里,我们将七种变异调用器应用于批量和单细胞 ATAC-seq 数据,并评估它们识别单核苷酸变异(SNV)和插入/缺失(indel)的能力。此外,我们开发了一种集成分类器 VarCA,它结合了各个变异调用器的特征,以预测变异。基因组分析工具包(GATK)是表现最好的个体调用器,在批量 ATAC 测试数据集上的 SNV 精度/召回率为 0.92/0.97,在 ATAC-seq 峰区域内至少有 10 个读段的 indel 精度/召回率为 0.87/0.82。在批量 ATAC-seq 读段上,VarCA 的 SNV 精度/召回率为 0.99/0.95,indel 精度/召回率为 0.93/0.80,表现优于其他方法。在单细胞 ATAC-seq 读段上,VarCA 的 SNV 精度/召回率为 0.98/0.94,indel 精度/召回率为 0.82/0.82。总之,在没有全基因组测序数据的情况下,ATAC-seq 读段可用于准确发现非编码调控变异,并且我们的集成方法 VarCA 具有最佳的整体性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/740f0eb8f5eb/gkab621fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/a083c2fa34cb/gkab621fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/87e341766f1b/gkab621fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/03d6b95115e5/gkab621fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/4c0bd8df8697/gkab621fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/740f0eb8f5eb/gkab621fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/a083c2fa34cb/gkab621fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/87e341766f1b/gkab621fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/03d6b95115e5/gkab621fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/4c0bd8df8697/gkab621fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ab3/8373110/740f0eb8f5eb/gkab621fig5.jpg

相似文献

1
Discovering single nucleotide variants and indels from bulk and single-cell ATAC-seq.从 bulk 和单细胞 ATAC-seq 中发现单核苷酸变体和插入缺失。
Nucleic Acids Res. 2021 Aug 20;49(14):7986-7994. doi: 10.1093/nar/gkab621.
2
CoRE-ATAC: A deep learning model for the functional classification of regulatory elements from single cell and bulk ATAC-seq data.CoRE-ATAC:一种从单细胞和批量 ATAC-seq 数据中对调控元件进行功能分类的深度学习模型。
PLoS Comput Biol. 2021 Dec 13;17(12):e1009670. doi: 10.1371/journal.pcbi.1009670. eCollection 2021 Dec.
3
From reads to insight: a hitchhiker's guide to ATAC-seq data analysis.从读取到洞察:ATAC-seq 数据分析入门指南。
Genome Biol. 2020 Feb 3;21(1):22. doi: 10.1186/s13059-020-1929-3.
4
ATAC-STARR-seq reveals transcription factor-bound activators and silencers within chromatin-accessible regions of the human genome.ATAC-STARR-seq 揭示了人类基因组中染色质可及区域内转录因子结合的激活子和沉默子。
Genome Res. 2022 Aug 25;32(8):1529-1541. doi: 10.1101/gr.276766.122.
5
Hydrop enables droplet-based single-cell ATAC-seq and single-cell RNA-seq using dissolvable hydrogel beads.Hydrop 可利用可溶解水凝胶珠进行基于液滴的单细胞 ATAC-seq 和单细胞 RNA-seq。
Elife. 2022 Feb 23;11:e73971. doi: 10.7554/eLife.73971.
6
Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants.在检测外显子变异方面,全基因组测序比全外显子测序更强大。
Proc Natl Acad Sci U S A. 2015 Apr 28;112(17):5473-8. doi: 10.1073/pnas.1418631112. Epub 2015 Mar 31.
7
CloudATAC: a cloud-based framework for ATAC-Seq data analysis.CloudATAC:一个基于云的 ATAC-Seq 数据分析框架。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae090.
8
An organism-wide ATAC-seq peak catalog for the bovine and its use to identify regulatory variants.牛的全基因组 ATAC-seq 峰目录及其用于鉴定调控变体的应用
Genome Res. 2023 Oct;33(10):1848-1864. doi: 10.1101/gr.277947.123. Epub 2023 Sep 26.
9
DeNOPA: decoding nucleosome positions sensitively with sparse ATAC-seq data.DeNOPA:使用稀疏 ATAC-seq 数据灵敏地解码核小体位置。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab469.
10
Interrogating the Accessible Chromatin Landscape of Eukaryote Genomes Using ATAC-seq.利用 ATAC-seq 技术研究真核生物基因组的可及染色质景观。
Methods Mol Biol. 2021;2243:183-226. doi: 10.1007/978-1-0716-1103-6_10.

引用本文的文献

1
Discovery of candidate functional non-coding mutations in acute myeloid leukemia using single-cell chromatin accessibility sequencing.利用单细胞染色质可及性测序发现急性髓系白血病中的候选功能性非编码突变
Commun Biol. 2025 May 26;8(1):808. doi: 10.1038/s42003-025-08257-8.
2
Single-cell sequencing advances in research on mesenchymal stem/stromal cells.单细胞测序技术在间充质干细胞研究中的进展。
Hum Cell. 2024 Jul;37(4):904-916. doi: 10.1007/s13577-024-01076-9. Epub 2024 May 14.
3
The Evolutionary Interplay of Somatic and Germline Mutation Rates.
体细胞突变率与种系突变率的进化相互作用
Annu Rev Biomed Data Sci. 2024 Aug;7(1):83-105. doi: 10.1146/annurev-biodatasci-102523-104225. Epub 2024 Jul 24.
4
Phertilizer: Growing a clonal tree from ultra-low coverage single-cell DNA sequencing of tumors.费提利泽:从肿瘤超低覆盖度单细胞 DNA 测序中培育克隆树。
PLoS Comput Biol. 2023 Oct 11;19(10):e1011544. doi: 10.1371/journal.pcbi.1011544. eCollection 2023 Oct.
5
Parallel recovery of chromatin accessibility and gene expression dynamics from frozen human regulatory T cells.从冷冻的人类调节性 T 细胞中平行恢复染色质可及性和基因表达动力学。
Sci Rep. 2023 Apr 4;13(1):5506. doi: 10.1038/s41598-023-32256-6.
6
Deletion mapping of regulatory elements for GATA3 in T cells reveals a distal enhancer involved in allergic diseases.T 细胞中 GATA3 调控元件的缺失作图揭示了一个参与过敏疾病的远端增强子。
Am J Hum Genet. 2023 Apr 6;110(4):703-714. doi: 10.1016/j.ajhg.2023.03.008. Epub 2023 Mar 28.
7
A Novel NCSTN Mutation in a Three-Generation Chinese Family with Hidradenitis Suppurative.一个三代中国家系中化脓性汗腺炎的新型 NCSTN 突变
J Healthc Eng. 2022 Mar 24;2022:1540774. doi: 10.1155/2022/1540774. eCollection 2022.
8
Natural Barcodes for Longitudinal Single Cell Tracking of Leukemic and Immune Cell Dynamics.自然条形码用于白血病和免疫细胞动力学的纵向单细胞跟踪。
Front Immunol. 2022 Jan 3;12:788891. doi: 10.3389/fimmu.2021.788891. eCollection 2021.