• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ATAV:一个用于全人群基因组分析的综合平台。

ATAV: a comprehensive platform for population-scale genomic analyses.

机构信息

Institute for Genomic Medicine, Columbia University Irving Medical Center, New York, NY, 10032, USA.

出版信息

BMC Bioinformatics. 2021 Mar 23;22(1):149. doi: 10.1186/s12859-021-04071-1.

DOI:10.1186/s12859-021-04071-1
PMID:33757430
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7988908/
Abstract

BACKGROUND

A common approach for sequencing studies is to do joint-calling and store variants of all samples in a single file. If new samples are continually added or controls are re-used for several studies, the cost and time required to perform joint-calling for each analysis can become prohibitive.

RESULTS

We present ATAV, an analysis platform for large-scale whole-exome and whole-genome sequencing projects. ATAV stores variant and per site coverage data for all samples in a centralized database, which is efficiently queried by ATAV to support diagnostic analyses for trios and singletons, as well as rare-variant collapsing analyses for finding disease associations in complex diseases. Runtime logs ensure full reproducibility and the modularized ATAV framework makes it extensible to continuous development. Besides helping with the identification of disease-causing variants for a range of diseases, ATAV has also enabled the discovery of disease-genes by rare-variant collapsing on datasets containing more than 20,000 samples. Analyses to date have been performed on data of more than 110,000 individuals demonstrating the scalability of the framework. To allow users to easily access variant-level data directly from the database, we provide a web-based interface, the ATAV data browser ( http://atavdb.org/ ). Through this browser, summary-level data for more than 40,000 samples can be queried by the general public representing a mix of cases and controls of diverse ancestries. Users have access to phenotype categories of variant carriers, as well as predicted ancestry, gender, and quality metrics. In contrast to many other platforms, the data browser is able to show data of newly-added samples in real-time and therefore evolves rapidly as more and more samples are sequenced.

CONCLUSIONS

Through ATAV, users have public access to one of the largest variant databases for patients sequenced at a tertiary care center and can look up any genes or variants of interest. Additionally, since the entire code is freely available on GitHub, ATAV can easily be deployed by other groups that wish to build their own platform, database, and user interface.

摘要

背景

测序研究的常用方法是联合调用,并将所有样本的变体存储在单个文件中。如果不断添加新样本或控制样本用于多项研究,那么为每个分析执行联合调用的成本和时间可能会变得过高。

结果

我们介绍了 ATAV,这是一个用于大规模全外显子组和全基因组测序项目的分析平台。ATAV 将所有样本的变体和每个位点的覆盖数据存储在一个集中式数据库中,ATAV 可以通过该数据库高效地查询这些数据,以支持对三核苷酸重复和单体的诊断分析,以及对复杂疾病进行罕见变体合并分析以寻找疾病关联。运行时日志确保了完全的可重复性,而模块化的 ATAV 框架使其可以扩展到持续的开发中。除了帮助识别各种疾病的致病变体外,ATAV 还通过对包含超过 20,000 个样本的数据集进行罕见变体合并分析,发现了疾病基因。迄今为止,已经对超过 110,000 个人的数据进行了分析,证明了该框架的可扩展性。为了允许用户轻松地从数据库直接访问变体级别的数据,我们提供了一个基于网络的界面,即 ATAV 数据浏览器(http://atavdb.org/)。通过该浏览器,代表不同种族背景的病例和对照的 40,000 多个样本的汇总数据可以由公众查询。用户可以访问变异携带者的表型类别,以及预测的种族、性别和质量指标。与许多其他平台不同,该数据浏览器能够实时显示新添加样本的数据,因此随着越来越多的样本被测序,它会迅速发展。

结论

通过 ATAV,用户可以公开访问一个在三级护理中心测序的患者的最大变体数据库之一,并可以查询任何感兴趣的基因或变体。此外,由于整个代码都可以在 GitHub 上免费获得,因此其他希望构建自己的平台、数据库和用户界面的团体可以轻松地部署 ATAV。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/8a168e110be9/12859_2021_4071_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/5667f4277806/12859_2021_4071_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/b60f8b882c20/12859_2021_4071_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/6db9ab142e25/12859_2021_4071_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/fcc87a7b8096/12859_2021_4071_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/30d63548a407/12859_2021_4071_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/8a168e110be9/12859_2021_4071_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/5667f4277806/12859_2021_4071_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/b60f8b882c20/12859_2021_4071_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/6db9ab142e25/12859_2021_4071_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/fcc87a7b8096/12859_2021_4071_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/30d63548a407/12859_2021_4071_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d376/7988908/8a168e110be9/12859_2021_4071_Fig6_HTML.jpg

相似文献

1
ATAV: a comprehensive platform for population-scale genomic analyses.ATAV:一个用于全人群基因组分析的综合平台。
BMC Bioinformatics. 2021 Mar 23;22(1):149. doi: 10.1186/s12859-021-04071-1.
2
Genome analysis and knowledge-driven variant interpretation with TGex.基因组分析和基于 TGex 的知识驱动的变异解释。
BMC Med Genomics. 2019 Dec 30;12(1):200. doi: 10.1186/s12920-019-0647-8.
3
JWES: a new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping.JWES:一个用于全基因组/外显子组序列数据处理、管理以及基因变异发现、注释、预测和基因分型的新管道。
FEBS Open Bio. 2021 Sep;11(9):2441-2452. doi: 10.1002/2211-5463.13261. Epub 2021 Aug 11.
4
BioBin: a bioinformatics tool for automating the binning of rare variants using publicly available biological knowledge.BioBin:一个生物信息学工具,用于利用公开可用的生物知识自动对稀有变体进行分类。
BMC Med Genomics. 2013;6 Suppl 2(Suppl 2):S6. doi: 10.1186/1755-8794-6-S2-S6. Epub 2013 May 7.
5
Mitochondrial Disease Sequence Data Resource (MSeqDR): a global grass-roots consortium to facilitate deposition, curation, annotation, and integrated analysis of genomic data for the mitochondrial disease clinical and research communities.线粒体疾病序列数据资源(MSeqDR):一个全球基层联盟,旨在促进为线粒体疾病临床和研究群体进行基因组数据的提交、管理、注释及综合分析。
Mol Genet Metab. 2015 Mar;114(3):388-96. doi: 10.1016/j.ymgme.2014.11.016. Epub 2014 Dec 4.
6
A community-based resource for automatic exome variant-calling and annotation in Mendelian disorders.一个基于社区的用于孟德尔疾病中自动外显子组变异检测和注释的资源。
BMC Genomics. 2014;15 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2164-15-S3-S5. Epub 2014 May 6.
7
The ExAC browser: displaying reference data information from over 60 000 exomes.ExAC浏览器:展示来自6万多个外显子组的参考数据信息。
Nucleic Acids Res. 2017 Jan 4;45(D1):D840-D845. doi: 10.1093/nar/gkw971. Epub 2016 Nov 28.
8
Huvariome: a web server resource of whole genome next-generation sequencing allelic frequencies to aid in pathological candidate gene selection.Huvariome:一个用于辅助病理候选基因选择的全基因组下一代测序等位基因频率的网络服务器资源。
J Clin Bioinforma. 2012 Nov 19;2(1):19. doi: 10.1186/2043-9113-2-19.
9
The Clinical Genome and Ancestry Report: An interactive web application for prioritizing clinically implicated variants from genome sequencing data with ancestry composition.临床基因组和祖源报告:一个用于从基因组测序数据中根据祖源成分优先考虑具有临床意义的变异的交互式网络应用程序。
Hum Mutat. 2020 Feb;41(2):387-396. doi: 10.1002/humu.23942. Epub 2019 Nov 15.
10
SnpHub: an easy-to-set-up web server framework for exploring large-scale genomic variation data in the post-genomic era with applications in wheat.SnpHub:一个易于搭建的 Web 服务器框架,用于在后基因组时代探索大规模基因组变异数据,在小麦中有应用。
Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa060.

引用本文的文献

1
Persistent Tic Disorders Are Associated With 17q12 Duplications.持续性抽动障碍与17q12重复相关。
Res Sq. 2025 Aug 19:rs.3.rs-7031850. doi: 10.21203/rs.3.rs-7031850/v1.
2
Exome analysis links kidney malformations to developmental disorders and reveals causal genes.外显子组分析将肾脏畸形与发育障碍联系起来,并揭示了致病基因。
Nat Commun. 2025 Aug 7;16(1):7290. doi: 10.1038/s41467-025-62319-3.
3
Interstitial Cystitis: a phenotype and rare variant exome sequencing study: Interstitial Cystitis: a phenotype and exome sequencing study.

本文引用的文献

1
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.美国国立卫生研究院生物医学高级研究与发展局(NHLBI)TOPMed 项目中对 53831 个不同基因组进行测序。
Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.
2
Assessing the Role of Rare Genetic Variation in Patients With Heart Failure.评估心力衰竭患者中罕见遗传变异的作用。
JAMA Cardiol. 2021 Apr 1;6(4):379-386. doi: 10.1001/jamacardio.2020.6500.
3
Causal Genetic Variants in Stillbirth.死产的因果遗传变异。
间质性膀胱炎:一项表型与罕见变异外显子组测序研究:间质性膀胱炎:一项表型与外显子组测序研究。
medRxiv. 2025 Feb 18:2025.02.16.25322147. doi: 10.1101/2025.02.16.25322147.
4
Natural History and Clinicopathological Associations of TRPC6-Associated Podocytopathy.与TRPC6相关的足细胞病的自然史及临床病理关联
J Am Soc Nephrol. 2025 Feb 1;36(2):274-289. doi: 10.1681/ASN.0000000501. Epub 2024 Oct 1.
5
Increased risk of kidney failure in patients with genetic kidney disorders.遗传性肾脏疾病患者发生肾衰竭的风险增加。
J Clin Invest. 2024 Sep 3;134(17):e178573. doi: 10.1172/JCI178573.
6
Rare variant analyses validate known ALS genes in a multi-ethnic population and identifies ANTXR2 as a candidate in PLS.罕见变异分析在多民族人群中验证了已知的 ALS 基因,并确定 ANTXR2 为 PLS 的候选基因。
BMC Genomics. 2024 Jun 29;25(1):651. doi: 10.1186/s12864-024-10538-1.
7
Assisting the analysis of insertions and deletions using regional allele frequencies.利用区域等位基因频率辅助插入和缺失分析。
Funct Integr Genomics. 2024 May 20;24(3):104. doi: 10.1007/s10142-024-01358-3.
8
FLT3L governs the development of partially overlapping hematopoietic lineages in humans and mice.FLT3L 调控人类和小鼠部分重叠的造血谱系发育。
Cell. 2024 May 23;187(11):2817-2837.e31. doi: 10.1016/j.cell.2024.04.009. Epub 2024 May 3.
9
Developing a genetic testing panel for evaluation of morbidities in kidney transplant recipients.为评估肾移植受者的并发症开发基因检测面板。
Kidney Int. 2024 Jul;106(1):115-125. doi: 10.1016/j.kint.2024.02.021. Epub 2024 Mar 21.
10
Investigating the Relationship Between Rare Genetic Variants and Fibrosis in Pediatric Nonalcoholic Fatty Liver Disease.探究儿童非酒精性脂肪性肝病中罕见基因变异与纤维化之间的关系。
medRxiv. 2024 Mar 4:2024.03.02.24303632. doi: 10.1101/2024.03.02.24303632.
N Engl J Med. 2020 Sep 17;383(12):1107-1116. doi: 10.1056/NEJMoa1908753. Epub 2020 Aug 12.
4
The mutational constraint spectrum quantified from variation in 141,456 humans.从 141456 名人类个体的变异中量化的突变约束谱。
Nature. 2020 May;581(7809):434-443. doi: 10.1038/s41586-020-2308-7. Epub 2020 May 27.
5
Rare-variant collapsing analyses for complex traits: guidelines and applications.复杂性状的罕见变异合并分析:指南与应用。
Nat Rev Genet. 2019 Dec;20(12):747-759. doi: 10.1038/s41576-019-0177-4. Epub 2019 Oct 11.
6
Exome-Based Rare-Variant Analyses in CKD.基于外显子组的慢性肾脏病罕见变异分析。
J Am Soc Nephrol. 2019 Jun;30(6):1109-1122. doi: 10.1681/ASN.2018090909. Epub 2019 May 13.
7
A new approach for rare variation collapsing on functional protein domains implicates specific genic regions in ALS.一种针对功能蛋白结构域罕见变异的新方法提示 ALS 中特定基因区域的作用。
Genome Res. 2019 May;29(5):809-818. doi: 10.1101/gr.243592.118. Epub 2019 Apr 2.
8
A case-control collapsing analysis identifies retinal dystrophy genes associated with ophthalmic disease in patients with no pathogenic ABCA4 variants.一项病例对照合并分析鉴定了与无致病性 ABCA4 变异患者的眼科疾病相关的视网膜营养不良基因。
Genet Med. 2019 Oct;21(10):2336-2344. doi: 10.1038/s41436-019-0495-0. Epub 2019 Mar 30.
9
Improved Pathogenic Variant Localization via a Hierarchical Model of Sub-regional Intolerance.通过亚区域耐受的层次模型提高致病变体定位。
Am J Hum Genet. 2019 Feb 7;104(2):299-309. doi: 10.1016/j.ajhg.2018.12.020. Epub 2019 Jan 24.
10
Diagnostic Utility of Exome Sequencing for Kidney Disease.外显子组测序在肾脏疾病诊断中的应用。
N Engl J Med. 2019 Jan 10;380(2):142-151. doi: 10.1056/NEJMoa1806891. Epub 2018 Dec 26.