• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PEMapper和PECaller为全基因组测序提供了一种简化方法。

PEMapper and PECaller provide a simplified approach to whole-genome sequencing.

作者信息

Johnston H Richard, Chopra Pankaj, Wingo Thomas S, Patel Viren, Epstein Michael P, Mulle Jennifer G, Warren Stephen T, Zwick Michael E, Cutler David J

机构信息

Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322.

Department of Biostatistics and Bioinformatics, Emory University Rollins School of Public Health, Atlanta, GA 30322.

出版信息

Proc Natl Acad Sci U S A. 2017 Mar 7;114(10):E1923-E1932. doi: 10.1073/pnas.1618065114. Epub 2017 Feb 21.

DOI:10.1073/pnas.1618065114
PMID:28223510
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5347547/
Abstract

The analysis of human whole-genome sequencing data presents significant computational challenges. The sheer size of datasets places an enormous burden on computational, disk array, and network resources. Here, we present an integrated computational package, PEMapper/PECaller, that was designed specifically to minimize the burden on networks and disk arrays, create output files that are minimal in size, and run in a highly computationally efficient way, with the single goal of enabling whole-genome sequencing at scale. In addition to improved computational efficiency, we implement a statistical framework that allows for a base by base error model, allowing this package to perform as well or better than the widely used Genome Analysis Toolkit (GATK) in all key measures of performance on human whole-genome sequences.

摘要

人类全基因组测序数据的分析带来了重大的计算挑战。数据集的庞大规模给计算、磁盘阵列和网络资源带来了巨大负担。在此,我们展示了一个集成计算包PEMapper/PECaller,其专门设计用于将网络和磁盘阵列的负担降至最低,创建尺寸最小的输出文件,并以高度计算高效的方式运行,唯一目标是实现大规模全基因组测序。除了提高计算效率外,我们还实施了一个统计框架,该框架允许基于碱基的错误模型,使该软件包在人类全基因组序列的所有关键性能指标上表现与广泛使用的基因组分析工具包(GATK)相当或更优。

相似文献

1
PEMapper and PECaller provide a simplified approach to whole-genome sequencing.PEMapper和PECaller为全基因组测序提供了一种简化方法。
Proc Natl Acad Sci U S A. 2017 Mar 7;114(10):E1923-E1932. doi: 10.1073/pnas.1618065114. Epub 2017 Feb 21.
2
Fast and accurate DNASeq variant calling workflow composed of LUSH toolkit.由 LUSH 工具包组成的快速准确的 DNA 测序变异调用工作流程。
Hum Genomics. 2024 Oct 10;18(1):114. doi: 10.1186/s40246-024-00666-w.
3
SNP-SVant: A Computational Workflow to Predict and Annotate Genomic Variants in Organisms Lacking Benchmarked Variants.SNP-SVant:一种在缺乏基准变异的生物中预测和注释基因组变异的计算工作流程。
Curr Protoc. 2024 May;4(5):e1046. doi: 10.1002/cpz1.1046.
4
Impact of post-alignment processing in variant discovery from whole exome data.全外显子数据变异发现中比对后处理的影响
BMC Bioinformatics. 2016 Oct 3;17(1):403. doi: 10.1186/s12859-016-1279-z.
5
Comparing a few SNP calling algorithms using low-coverage sequencing data.比较几种使用低覆盖度测序数据的 SNP calling 算法。
BMC Bioinformatics. 2013 Sep 17;14:274. doi: 10.1186/1471-2105-14-274.
6
Variant callers for next-generation sequencing data: a comparison study.下一代测序数据的变异调用者:一项比较研究。
PLoS One. 2013 Sep 27;8(9):e75619. doi: 10.1371/journal.pone.0075619. eCollection 2013.
7
Need for speed in accurate whole-genome data analysis: GENALICE MAP challenges BWA/GATK more than PEMapper/PECaller and Isaac.准确的全基因组数据分析对速度的需求:GENALICE MAP 对 BWA/GATK 的挑战超过了 PEMapper/PECaller 和 Isaac。
Proc Natl Acad Sci U S A. 2017 Oct 3;114(40):E8320-E8322. doi: 10.1073/pnas.1713830114. Epub 2017 Sep 15.
8
Accuracy and efficiency of germline variant calling pipelines for human genome data.人类基因组数据种系变异调用管道的准确性和效率。
Sci Rep. 2020 Nov 19;10(1):20222. doi: 10.1038/s41598-020-77218-4.
9
Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms.利用改良的简化代表性测序和 SNP 调用算法的直接比较,生成猩猩群体基因组学的 SNP 数据集。
BMC Genomics. 2014 Jan 10;15:16. doi: 10.1186/1471-2164-15-16.
10
OVarFlow: a resource optimized GATK 4 based Open source Variant calling workFlow.OVarFlow:一种基于资源优化的 GATK4 的开源变异调用工作流程。
BMC Bioinformatics. 2021 Aug 13;22(1):402. doi: 10.1186/s12859-021-04317-y.

引用本文的文献

1
Multiancestry brain pQTL fine-mapping and integration with genome-wide association studies of 21 neurologic and psychiatric conditions.多祖先脑蛋白定量性状基因座精细定位及与21种神经和精神疾病全基因组关联研究的整合
Nat Genet. 2025 Sep 8. doi: 10.1038/s41588-025-02291-2.
2
Chromatin regulators in the TBX1 network confer risk for conotruncal heart defects in 22q11.2DS.TBX1 网络中的染色质调节因子赋予 22q11.2 缺失综合征患者圆锥动脉干心脏缺陷的风险。
NPJ Genom Med. 2023 Jul 18;8(1):17. doi: 10.1038/s41525-023-00363-y.
3
Evaluation of AQP4 functional variants and its association with fragile X-associated tremor/ataxia syndrome.水通道蛋白4功能变异体的评估及其与脆性X相关震颤/共济失调综合征的关联。
Front Aging Neurosci. 2023 Jan 6;14:1073258. doi: 10.3389/fnagi.2022.1073258. eCollection 2022.
4
Loss of function of OTUD7A in the schizophrenia- associated 15q13.3 deletion impairs synapse development and function in human neurons.OTUD7A 功能丧失在精神分裂症相关的 15q13.3 缺失中损害人类神经元的突触发育和功能。
Am J Hum Genet. 2022 Aug 4;109(8):1500-1519. doi: 10.1016/j.ajhg.2022.07.001.
5
Novel Missense Variant Identified in Two Consanguineous Pakistani Families With Developmental Delay, Epilepsy, Intellectual Disability, and Aggressive Behavior.在两个患有发育迟缓、癫痫、智力残疾和攻击性行为的巴基斯坦近亲家庭中发现新型错义变异。
Front Neurol. 2022 Jul 14;13:918022. doi: 10.3389/fneur.2022.918022. eCollection 2022.
6
Identification of as a genetic modifier of fragile X-associated tremor/ataxia syndrome.鉴定为脆性 X 相关震颤/共济失调综合征的遗传修饰剂。
Proc Natl Acad Sci U S A. 2022 May 31;119(22):e2118124119. doi: 10.1073/pnas.2118124119. Epub 2022 May 26.
7
Identifying susceptibility genes for primary ovarian insufficiency on the high-risk genetic background of a fragile X premutation.在脆性 X 前突变的高风险遗传背景下,鉴定原发性卵巢功能不全的易感基因。
Fertil Steril. 2021 Sep;116(3):843-854. doi: 10.1016/j.fertnstert.2021.04.021. Epub 2021 May 18.
8
Identifying genetic factors that contribute to the increased risk of congenital heart defects in infants with Down syndrome.鉴定导致唐氏综合征婴儿先天性心脏缺陷风险增加的遗传因素。
Sci Rep. 2020 Oct 22;10(1):18051. doi: 10.1038/s41598-020-74650-4.
9
CNV Radar: an improved method for somatic copy number alteration characterization in oncology.CNV 雷达:一种改进的肿瘤体细胞拷贝数改变特征分析方法。
BMC Bioinformatics. 2020 Mar 6;21(1):98. doi: 10.1186/s12859-020-3397-x.
10
Genetic contributors to risk of schizophrenia in the presence of a 22q11.2 deletion.22q11.2 缺失时精神分裂症风险的遗传因素。
Mol Psychiatry. 2021 Aug;26(8):4496-4510. doi: 10.1038/s41380-020-0654-3. Epub 2020 Feb 3.

本文引用的文献

1
Whole Genome Sequencing Expands Diagnostic Utility and Improves Clinical Management in Pediatric Medicine.全基因组测序扩大了诊断效用并改善了儿科医学的临床管理。
NPJ Genom Med. 2016 Jan 13;1:15012-. doi: 10.1038/npjgenmed.2015.12.
2
novoBreak: local assembly for breakpoint detection in cancer genomes.novoBreak:用于癌症基因组中断点检测的局部组装
Nat Methods. 2017 Jan;14(1):65-67. doi: 10.1038/nmeth.4084. Epub 2016 Nov 28.
3
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
4
Towards precision medicine.迈向精准医学。
Nat Rev Genet. 2016 Aug 16;17(9):507-22. doi: 10.1038/nrg.2016.86.
5
Advancements in Next-Generation Sequencing.下一代测序技术的进展。
Annu Rev Genomics Hum Genet. 2016 Aug 31;17:95-115. doi: 10.1146/annurev-genom-083115-022413. Epub 2016 Jun 9.
6
The real cost of sequencing: scaling computation to keep pace with data generation.测序的实际成本:扩展计算能力以跟上数据生成的步伐。
Genome Biol. 2016 Mar 23;17:53. doi: 10.1186/s13059-016-0917-0.
7
Systematic comparison of variant calling pipelines using gold standard personal exome variants.使用金标准个人外显子变体对变异检测流程进行系统比较。
Sci Rep. 2015 Dec 7;5:17875. doi: 10.1038/srep17875.
8
Whole-Genome Sequencing Suggests Schizophrenia Risk Mechanisms in Humans with 22q11.2 Deletion Syndrome.全基因组测序揭示22q11.2缺失综合征患者的精神分裂症风险机制。
G3 (Bethesda). 2015 Sep 16;5(11):2453-61. doi: 10.1534/g3.115.021345.
9
Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants.在检测外显子变异方面,全基因组测序比全外显子测序更强大。
Proc Natl Acad Sci U S A. 2015 Apr 28;112(17):5473-8. doi: 10.1073/pnas.1418631112. Epub 2015 Mar 31.
10
Population genetics identifies challenges in analyzing rare variants.群体遗传学揭示了分析罕见变异时所面临的挑战。
Genet Epidemiol. 2015 Mar;39(3):145-8. doi: 10.1002/gepi.21881. Epub 2015 Jan 12.