• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过与基于芯片的基因分型和孟德尔遗传进行验证,评估使用单样本和多样本调用算法进行单核苷酸多态性(SNP)调用的情况。

Evaluation of SNP calling using single and multiple-sample calling algorithms by validation against array base genotyping and Mendelian inheritance.

作者信息

Kumar Pankaj, Al-Shafai Mashael, Al Muftah Wadha Ahmed, Chalhoub Nader, Elsaid Mahmoud F, Aleem Alice Abdel, Suhre Karsten

机构信息

Weill Cornell Medical College in Qatar, Education City, Doha, Qatar.

出版信息

BMC Res Notes. 2014 Oct 22;7:747. doi: 10.1186/1756-0500-7-747.

DOI:10.1186/1756-0500-7-747
PMID:25339461
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4216909/
Abstract

BACKGROUND

With diminishing costs of next generation sequencing (NGS), whole genome analysis becomes a standard tool for identifying genetic causes of inherited diseases. Commercial NGS service providers in general not only provide raw genomic reads, but further deliver SNP calls to their clients. However, the question for the user arises whether to use the SNP data as is, or process the raw sequencing data further through more sophisticated SNP calling pipelines with more advanced algorithms.

RESULTS

Here we report a detailed comparison of SNPs called using the popular GATK multiple-sample calling protocol to SNPs delivered as part of a 40x whole genome sequencing project by Illumina Inc of 171 human genomes of Arab descent (108 unrelated Qatari genomes, 19 trios, and 2 families with rare diseases) and compare them to variants provided by the Illumina CASAVA pipeline. GATK multi-sample calling identifies more variants than the CASAVA pipeline. The additional variants from GATK are robust for Mendelian consistencies but weak in terms of statistical parameters such as TsTv ratio. However, these additional variants do not make a difference in detecting the causative variants in the studied phenotype.

CONCLUSION

Both pipelines, GATK multi-sample calling and Illumina CASAVA single sample calling, have highly similar performance in SNP calling at the level of putatively causative variants.

摘要

背景

随着下一代测序(NGS)成本的降低,全基因组分析成为鉴定遗传性疾病遗传病因的标准工具。一般来说,商业NGS服务提供商不仅提供原始基因组读数,还会向客户提供单核苷酸多态性(SNP)位点的检测结果。然而,用户面临的问题是,是直接使用SNP数据,还是通过使用更先进算法的更复杂SNP检测流程对原始测序数据进行进一步处理。

结果

在此,我们报告了使用流行的基因组分析工具包(GATK)多样本检测协议所检测到的SNP与作为Illumina公司40倍全基因组测序项目一部分所提供的SNP的详细比较,该项目针对171个阿拉伯裔人类基因组(108个无亲缘关系的卡塔尔基因组、19个三联体以及2个患有罕见疾病的家族),并将其与Illumina CASAVA流程所提供的变异进行比较。GATK多样本检测比CASAVA流程识别出更多变异。来自GATK的额外变异在孟德尔一致性方面表现稳健,但在诸如转换/颠换(TsTv)比率等统计参数方面表现较弱。然而,这些额外变异在检测所研究表型的致病变异方面并无差异。

结论

在推测的致病变异水平上,GATK多样本检测流程和Illumina CASAVA单样本检测流程在SNP检测方面具有高度相似的性能。

相似文献

1
Evaluation of SNP calling using single and multiple-sample calling algorithms by validation against array base genotyping and Mendelian inheritance.通过与基于芯片的基因分型和孟德尔遗传进行验证,评估使用单样本和多样本调用算法进行单核苷酸多态性(SNP)调用的情况。
BMC Res Notes. 2014 Oct 22;7:747. doi: 10.1186/1756-0500-7-747.
2
Comparison of seven SNP calling pipelines for the next-generation sequencing data of chickens.比较用于鸡下一代测序数据的七种 SNP 调用管道。
PLoS One. 2022 Jan 31;17(1):e0262574. doi: 10.1371/journal.pone.0262574. eCollection 2022.
3
Variant callers for next-generation sequencing data: a comparison study.下一代测序数据的变异调用者:一项比较研究。
PLoS One. 2013 Sep 27;8(9):e75619. doi: 10.1371/journal.pone.0075619. eCollection 2013.
4
Assessing single nucleotide variant detection and genotype calling on whole-genome sequenced individuals.评估全基因组测序个体中单核苷酸变异检测和基因型调用。
Bioinformatics. 2014 Jun 15;30(12):1707-13. doi: 10.1093/bioinformatics/btu067. Epub 2014 Feb 19.
5
Comparing a few SNP calling algorithms using low-coverage sequencing data.比较几种使用低覆盖度测序数据的 SNP calling 算法。
BMC Bioinformatics. 2013 Sep 17;14:274. doi: 10.1186/1471-2105-14-274.
6
Copy number variations in the genome of the Qatari population.卡塔尔人群基因组中的拷贝数变异
BMC Genomics. 2015 Oct 22;16:834. doi: 10.1186/s12864-015-1991-5.
7
Comparison among three variant callers and assessment of the accuracy of imputation from SNP array data to whole-genome sequence level in chicken.鸡中三种变异检测工具的比较以及从SNP芯片数据到全基因组序列水平的填充准确性评估。
BMC Genomics. 2015 Oct 21;16:824. doi: 10.1186/s12864-015-2059-2.
8
Genome-Wide SNP Calling from Genotyping by Sequencing (GBS) Data: A Comparison of Seven Pipelines and Two Sequencing Technologies.基于测序基因分型(GBS)数据的全基因组单核苷酸多态性(SNP)检测:七种流程和两种测序技术的比较
PLoS One. 2016 Aug 22;11(8):e0161333. doi: 10.1371/journal.pone.0161333. eCollection 2016.
9
Evaluating the Calling Performance of a Rare Disease NGS Panel for Single Nucleotide and Copy Number Variants.评估用于单核苷酸和拷贝数变异的罕见病二代测序(NGS)面板的检测性能
Mol Diagn Ther. 2017 Jun;21(3):303-313. doi: 10.1007/s40291-017-0268-x.
10
Comparison of GATK and DeepVariant by trio sequencing.基于 trio 测序的 GATK 和 DeepVariant 比较。
Sci Rep. 2022 Feb 2;12(1):1809. doi: 10.1038/s41598-022-05833-4.

引用本文的文献

1
Fine-tuning GBS data with comparison of reference and mock genome approaches for advancing genomic selection in less studied farmed species.通过比较参考基因组和模拟基因组方法对无乳链球菌数据进行微调,以推进对研究较少的养殖物种的基因组选择。
BMC Genomics. 2025 Feb 5;26(1):111. doi: 10.1186/s12864-025-11296-4.
2
Integrative computational analyses implicate regulatory genomic elements contributing to spina bifida.综合计算分析表明,调控基因组元件与脊柱裂有关。
Genet Med Open. 2024 Sep 14;2:101894. doi: 10.1016/j.gimo.2024.101894. eCollection 2024.
3
Systems biology analysis of human genomes points to key pathways conferring spina bifida risk.

本文引用的文献

1
SNPAAMapper: An efficient genome-wide SNP variant analysis pipeline for next-generation sequencing data.SNPAAMapper:一种用于下一代测序数据的高效全基因组SNP变异分析流程。
Bioinformation. 2013 Oct 16;9(17):870-2. doi: 10.6026/97320630009870. eCollection 2013.
2
WEP: a high-performance analysis pipeline for whole-exome data.WEP:一种用于全外显子组数据的高性能分析管道。
BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S11. doi: 10.1186/1471-2105-14-S7-S11. Epub 2013 Apr 22.
3
Isaac: ultra-fast whole-genome secondary analysis on Illumina sequencing platforms.
系统生物学分析人类基因组,指出导致脊柱裂风险的关键途径。
Proc Natl Acad Sci U S A. 2021 Dec 21;118(51). doi: 10.1073/pnas.2106844118.
4
Deep sequencing of DNA from urine of kidney allograft recipients to estimate donor/recipient-specific DNA fractions.对肾移植受者尿液中的DNA进行深度测序,以估计供体/受体特异性DNA片段。
PLoS One. 2021 Apr 15;16(4):e0249930. doi: 10.1371/journal.pone.0249930. eCollection 2021.
5
Accurate sequence variant genotyping in cattle using variation-aware genome graphs.利用变异感知基因组图谱对牛进行精确的序列变异基因分型。
Genet Sel Evol. 2019 May 15;51(1):21. doi: 10.1186/s12711-019-0462-x.
6
Integrated Analysis of Whole-Genome ChIP-Seq and RNA-Seq Data of Primary Head and Neck Tumor Samples Associates HPV Integration Sites with Open Chromatin Marks.原发性头颈肿瘤样本的全基因组ChIP-Seq和RNA-Seq数据的综合分析将人乳头瘤病毒(HPV)整合位点与开放染色质标记联系起来。
Cancer Res. 2017 Dec 1;77(23):6538-6550. doi: 10.1158/0008-5472.CAN-17-0833. Epub 2017 Sep 25.
7
Reanalysis of BRCA1/2 negative high risk ovarian cancer patients reveals novel germline risk loci and insights into missing heritability.对BRCA1/2基因阴性的高危卵巢癌患者进行再分析,发现了新的种系风险位点,并对遗传性缺失有了新认识。
PLoS One. 2017 Jun 7;12(6):e0178450. doi: 10.1371/journal.pone.0178450. eCollection 2017.
8
Advantages of Array-Based Technologies for Pre-Emptive Pharmacogenomics Testing.基于芯片技术在预防性药物基因组学检测中的优势。
Microarrays (Basel). 2016 May 28;5(2):12. doi: 10.3390/microarrays5020012.
9
Epigenetic associations of type 2 diabetes and BMI in an Arab population.阿拉伯人群中2型糖尿病与体重指数的表观遗传学关联。
Clin Epigenetics. 2016 Jan 28;8:13. doi: 10.1186/s13148-016-0177-6. eCollection 2016.
10
Genome-wide Polygenic Burden of Rare Deleterious Variants in Sudden Unexpected Death in Epilepsy.基因组范围内罕见有害变异在癫痫性猝死中的多基因负担
EBioMedicine. 2015 Jul 10;2(9):1063-70. doi: 10.1016/j.ebiom.2015.07.005. eCollection 2015 Sep.
艾萨克:在 Illumina 测序平台上进行超快速全基因组二级分析。
Bioinformatics. 2013 Aug 15;29(16):2041-3. doi: 10.1093/bioinformatics/btt314. Epub 2013 Jun 4.
4
Application of massively parallel sequencing to genetic diagnosis in multiplex families with idiopathic sensorineural hearing impairment.应用高通量测序技术对不明原因感音神经性聋家系进行基因诊断。
PLoS One. 2013;8(2):e57369. doi: 10.1371/journal.pone.0057369. Epub 2013 Feb 22.
5
A survey of tools for variant analysis of next-generation genome sequencing data.下一代基因组测序数据变异分析工具综述。
Brief Bioinform. 2014 Mar;15(2):256-78. doi: 10.1093/bib/bbs086. Epub 2013 Jan 21.
6
Atlas2 Cloud: a framework for personal genome analysis in the cloud.Atlas2 云:云端个人基因组分析框架。
BMC Genomics. 2012;13 Suppl 6(Suppl 6):S19. doi: 10.1186/1471-2164-13-S6-S19. Epub 2012 Oct 26.
7
Genotype calling and haplotyping in parent-offspring trios.对亲代-子代三体系进行基因型分型和单体型分型。
Genome Res. 2013 Jan;23(1):142-51. doi: 10.1101/gr.142455.112. Epub 2012 Oct 11.
8
SOAPindel: efficient identification of indels from short paired reads.SOAPindel:从短配对读取中有效识别插入缺失。
Genome Res. 2013 Jan;23(1):195-200. doi: 10.1101/gr.132480.111. Epub 2012 Sep 12.
9
A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.一个用于注释和预测单核苷酸多态性影响的程序,即SnpEff:黑腹果蝇品系w1118、iso-2、iso-3基因组中的单核苷酸多态性。
Fly (Austin). 2012 Apr-Jun;6(2):80-92. doi: 10.4161/fly.19695.
10
The 1000 Genomes Project: data management and community access.1000 基因组计划:数据管理与社区访问。
Nat Methods. 2012 Apr 27;9(5):459-62. doi: 10.1038/nmeth.1974.