• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用混合测序数据评估等位基因频率估计准确性。

An evaluation of allele frequency estimation accuracy using pooled sequencing data.

作者信息

Guo Yan, Cai Qiuyin, Li Chun, Li Jiang, Courtney Regina, Zheng Wei, Long Jirong

机构信息

Department of Cancer Biology, Vanderbilt University, Nashville TN 37232, USA.

出版信息

Int J Comput Biol Drug Des. 2013;6(4):279-93. doi: 10.1504/IJCBDD.2013.056709. Epub 2013 Sep 30.

DOI:10.1504/IJCBDD.2013.056709
PMID:24088264
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4108916/
Abstract

Next generation sequencing technology has matured, and with its current affordability, will replace the SNP chip as the genotyping tool of choice. Even with the current affordability of NGS, large scale studies will require careful study design to reduce cost. In this study, we designed an experiment to assess the accuracy of allele frequency estimated from pooled sequencing data. We compared the allele frequency estimated from sequencing data with the allele frequency estimated from individual SNP chip data and observed high correlations between them. However, by calculating error rate, we found that many SNPs had their allele frequency estimated from sequencing data significantly different from allele frequency estimated from SNP chip data. In conclusion, we found correlation is not an ideal measurement for comparing allele frequencies. And for the purpose of estimating allele frequency, we do not recommend using pooling with NGS as a cheaper alternative to genotype each sample individually.

摘要

新一代测序技术已经成熟,鉴于其目前的可承受性,它将取代单核苷酸多态性(SNP)芯片成为首选的基因分型工具。即使在目前新一代测序技术具有可承受性的情况下,大规模研究仍需要精心设计研究方案以降低成本。在本研究中,我们设计了一项实验来评估从混合测序数据估计的等位基因频率的准确性。我们将从测序数据估计的等位基因频率与从单个SNP芯片数据估计的等位基因频率进行了比较,并观察到它们之间具有高度相关性。然而,通过计算错误率,我们发现许多单核苷酸多态性位点从测序数据估计的等位基因频率与从SNP芯片数据估计的等位基因频率存在显著差异。总之,我们发现相关性并非比较等位基因频率的理想指标。并且出于估计等位基因频率的目的,我们不建议将新一代测序技术的混合样本法作为逐个对每个样本进行基因分型的更廉价替代方法。

相似文献

1
An evaluation of allele frequency estimation accuracy using pooled sequencing data.使用混合测序数据评估等位基因频率估计准确性。
Int J Comput Biol Drug Des. 2013;6(4):279-93. doi: 10.1504/IJCBDD.2013.056709. Epub 2013 Sep 30.
2
A unified approach for allele frequency estimation, SNP detection and association studies based on pooled sequencing data using EM algorithms.基于 EM 算法的基于测序数据的等位基因频率估计、SNP 检测和关联研究的统一方法。
BMC Genomics. 2013;14 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-14-S1-S1. Epub 2013 Jan 21.
3
Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.).从大西洋鲑鱼(Salmo salar L.)混合 DNA 中进行经济有效的全基因组等位基因频率估计。
BMC Genomics. 2013 Jan 16;14:12. doi: 10.1186/1471-2164-14-12.
4
Evaluation of allele frequency estimation using pooled sequencing data simulation.使用混合测序数据模拟评估等位基因频率估计
ScientificWorldJournal. 2013;2013:895496. doi: 10.1155/2013/895496. Epub 2013 Feb 7.
5
Estimation of population allele frequencies from next-generation sequencing data: pool-versus individual-based genotyping.基于下一代测序数据的群体等位基因频率估计:基于池与个体的基因分型。
Mol Ecol. 2013 Jul;22(14):3766-79. doi: 10.1111/mec.12360. Epub 2013 Jun 4.
6
Validation of pooled genotyping on the Affymetrix 500 k and SNP6.0 genotyping platforms using the polynomial-based probe-specific correction.基于多项式探针特异性校正的 Affymetrix 500k 和 SNP6.0 基因分型平台的 pooled genotyping 验证。
BMC Genet. 2009 Dec 14;10:82. doi: 10.1186/1471-2156-10-82.
7
Comparison of genotyping using pooled DNA samples (allelotyping) and individual genotyping using the affymetrix genome-wide human SNP array 6.0.使用汇集 DNA 样本(等位基因分型)和使用 Affymetrix 全基因组人类 SNP 阵列 6.0 进行个体基因分型的比较。
BMC Genomics. 2013 Jul 26;14:506. doi: 10.1186/1471-2164-14-506.
8
Investigation of rare and low-frequency variants using high-throughput sequencing with pooled DNA samples.使用高通量测序对混合 DNA 样本进行罕见和低频变异的研究。
Sci Rep. 2016 Sep 16;6:33256. doi: 10.1038/srep33256.
9
Fish scales and SNP chips: SNP genotyping and allele frequency estimation in individual and pooled DNA from historical samples of Atlantic salmon (Salmo salar).鱼鳞和 SNP 芯片:大西洋鲑(Salmo salar)历史样本个体和混合 DNA 中的 SNP 基因分型和等位基因频率估计。
BMC Genomics. 2013 Jul 3;14:439. doi: 10.1186/1471-2164-14-439.
10
Estimation of allele frequency and association mapping using next-generation sequencing data.利用下一代测序数据进行等位基因频率估计和关联作图。
BMC Bioinformatics. 2011 Jun 11;12:231. doi: 10.1186/1471-2105-12-231.

引用本文的文献

1
High-throughput estimation of allele frequencies using combined pooled-population sequencing and haplotype-based data processing.利用合并群体测序和基于单倍型的数据处理进行等位基因频率的高通量估计。
Plant Methods. 2022 Mar 21;18(1):34. doi: 10.1186/s13007-022-00852-8.
2
Comprehensive evaluation of extracellular small RNA isolation methods from serum in high throughput sequencing.高通量测序中血清细胞外小RNA分离方法的综合评价
BMC Genomics. 2017 Jan 7;18(1):50. doi: 10.1186/s12864-016-3470-z.
3
Multi-perspective quality control of Illumina RNA sequencing data analysis.Illumina RNA 测序数据分析的多角度质量控制。
Brief Funct Genomics. 2017 Jul 1;16(4):194-204. doi: 10.1093/bfgp/elw035.
4
Expression profile and gene age jointly shaped the genome-wide distribution of premature termination codons in a Drosophila melanogaster population.表达谱和基因年龄共同塑造了黑腹果蝇群体中全基因组范围内提前终止密码子的分布。
Mol Biol Evol. 2015 Jan;32(1):216-28. doi: 10.1093/molbev/msu299. Epub 2014 Nov 3.
5
Statistical strategies for microRNAseq batch effect reduction.用于减少微小RNA测序批次效应的统计策略。
Transl Cancer Res. 2014 Jun 1;3(3):260-265. doi: 10.3978/j.issn.2218-676X.2014.06.05.

本文引用的文献

1
Exome sequencing generates high quality data in non-target regions.外显子组测序在非靶区域产生高质量数据。
BMC Genomics. 2012 May 20;13:194. doi: 10.1186/1471-2164-13-194.
2
An evaluation of different target enrichment methods in pooled sequencing designs for complex disease association studies.在复杂疾病关联研究的合并测序设计中,不同目标富集方法的评估。
PLoS One. 2011;6(11):e26279. doi: 10.1371/journal.pone.0026279. Epub 2011 Nov 1.
3
Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease.全基因组关联研究位点的深度重测序鉴定出与炎症性肠病相关的独立稀有变异。
Nat Genet. 2011 Oct 9;43(11):1066-73. doi: 10.1038/ng.952.
4
Amplicon-based high-throughput pooled sequencing identifies mutations in CYP7B1 and SPG7 in sporadic spastic paraplegia patients.基于扩增子的高通量 pooled 测序鉴定散发性痉挛性截瘫患者 CYP7B1 和 SPG7 中的突变。
Clin Genet. 2011 Aug;80(2):148-60. doi: 10.1111/j.1399-0004.2011.01715.x. Epub 2011 Jun 13.
5
Genomic DNA pooling strategy for next-generation sequencing-based rare variant discovery in abdominal aortic aneurysm regions of interest-challenges and limitations.基于下一代测序的腹主动脉瘤感兴趣区域稀有变异发现的基因组 DNA 池化策略——挑战和局限性。
J Cardiovasc Transl Res. 2011 Jun;4(3):271-80. doi: 10.1007/s12265-011-9263-5. Epub 2011 Mar 1.
6
High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiency.高通量、池测序鉴定出人类复合体 I 缺陷中 NUBPL 和 FOXRED1 的突变。
Nat Genet. 2010 Oct;42(10):851-8. doi: 10.1038/ng.659. Epub 2010 Sep 5.
7
Genome-wide association study with DNA pooling identifies variants at CNTNAP2 associated with pseudoexfoliation syndrome.全基因组关联研究与 DNA 池化鉴定与假性剥脱综合征相关的 CNTNAP2 变体。
Eur J Hum Genet. 2011 Feb;19(2):186-93. doi: 10.1038/ejhg.2010.144. Epub 2010 Sep 1.
8
Pooled versus individual genotyping in a breast cancer genome-wide association study.在乳腺癌全基因组关联研究中进行合并分析与个体基因型分析。
Genet Epidemiol. 2010 Sep;34(6):603-12. doi: 10.1002/gepi.20517.
9
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
10
The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants.Sanger 测序的 FASTQ 文件格式,用于包含质量分数的序列,以及 Solexa/Illumina FASTQ 变体。
Nucleic Acids Res. 2010 Apr;38(6):1767-71. doi: 10.1093/nar/gkp1137. Epub 2009 Dec 16.