• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
On the design and analysis of next-generation sequencing genotyping for a cohort with haplotype-informative reads.关于具有单倍型信息性读段的队列的下一代测序基因分型的设计与分析。
Methods. 2015 Jun;79-80:41-6. doi: 10.1016/j.ymeth.2015.01.016. Epub 2015 Jan 30.
2
Joint haplotype phasing and genotype calling of multiple individuals using haplotype informative reads.利用单倍型信息读长对多个个体进行联合单倍型相位确定和基因型调用。
Bioinformatics. 2013 Oct 1;29(19):2427-34. doi: 10.1093/bioinformatics/btt418. Epub 2013 Aug 13.
3
Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data.利用跨越多个单核苷酸多态性的读取信息,从测序数据中推断单倍型。
Bioinformatics. 2013 Sep 15;29(18):2245-52. doi: 10.1093/bioinformatics/btt386. Epub 2013 Jul 3.
4
Genotype calling from next-generation sequencing data using haplotype information of reads.基于读段单倍型信息进行下一代测序数据的基因型推断。
Bioinformatics. 2012 Apr 1;28(7):938-46. doi: 10.1093/bioinformatics/bts047. Epub 2012 Jan 27.
5
Genotype calling and phasing using next-generation sequencing reads and a haplotype scaffold.使用下一代测序reads 和单倍型支架进行基因型调用和相位分析。
Bioinformatics. 2013 Jan 1;29(1):84-91. doi: 10.1093/bioinformatics/bts632. Epub 2012 Oct 23.
6
A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings.一种在临床环境中直接进行单体型定相的长读测序方法。
Int J Mol Sci. 2020 Dec 1;21(23):9177. doi: 10.3390/ijms21239177.
7
A dynamic Bayesian Markov model for phasing and characterizing haplotypes in next-generation sequencing.一种用于下一代测序中相位和特征分析单倍型的动态贝叶斯马尔可夫模型。
Bioinformatics. 2013 Apr 1;29(7):878-85. doi: 10.1093/bioinformatics/btt065. Epub 2013 Feb 13.
8
Haplotype estimation using sequencing reads.使用测序reads 进行单体型估计。
Am J Hum Genet. 2013 Oct 3;93(4):687-96. doi: 10.1016/j.ajhg.2013.09.002.
9
A statistical variant calling approach from pedigree information and local haplotyping with phase informative reads.基于家系信息和局部单体型分析以及具有相位信息读段的统计变异calling 方法。
Bioinformatics. 2013 Nov 15;29(22):2835-43. doi: 10.1093/bioinformatics/btt503. Epub 2013 Sep 3.
10
PERHAPS: Paired-End short Reads-based HAPlotyping from next-generation Sequencing data.或许:基于下一代测序数据的 Paired-End 短读段 HAP 分型。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa320.

本文引用的文献

1
Sequencing depth and coverage: key considerations in genomic analyses.测序深度和覆盖度:基因组分析中的关键考虑因素。
Nat Rev Genet. 2014 Feb;15(2):121-32. doi: 10.1038/nrg3642.
2
Assessing the effect of sequencing depth and sample size in population genetics inferences.评估测序深度和样本量对群体遗传学推断的影响。
PLoS One. 2013 Nov 18;8(11):e79667. doi: 10.1371/journal.pone.0079667. eCollection 2013.
3
Haplotype estimation using sequencing reads.使用测序reads 进行单体型估计。
Am J Hum Genet. 2013 Oct 3;93(4):687-96. doi: 10.1016/j.ajhg.2013.09.002.
4
Joint haplotype phasing and genotype calling of multiple individuals using haplotype informative reads.利用单倍型信息读长对多个个体进行联合单倍型相位确定和基因型调用。
Bioinformatics. 2013 Oct 1;29(19):2427-34. doi: 10.1093/bioinformatics/btt418. Epub 2013 Aug 13.
5
Rare variant detection using family-based sequencing analysis.基于家系测序分析的罕见变异检测。
Proc Natl Acad Sci U S A. 2013 Mar 5;110(10):3985-90. doi: 10.1073/pnas.1222158110. Epub 2013 Feb 20.
6
Population genomics based on low coverage sequencing: how low should we go?基于低覆盖度测序的群体基因组学:我们应该低到什么程度?
Mol Ecol. 2013 Jun;22(11):3028-35. doi: 10.1111/mec.12105. Epub 2012 Nov 22.
7
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
8
Genotype calling and phasing using next-generation sequencing reads and a haplotype scaffold.使用下一代测序reads 和单倍型支架进行基因型调用和相位分析。
Bioinformatics. 2013 Jan 1;29(1):84-91. doi: 10.1093/bioinformatics/bts632. Epub 2012 Oct 23.
9
Analysis and optimal design for association studies using next-generation sequencing with case-control pools.使用病例对照样本池的新一代测序进行关联研究的分析与优化设计
Genet Epidemiol. 2012 Dec;36(8):870-81. doi: 10.1002/gepi.21681. Epub 2012 Sep 12.
10
A high-coverage genome sequence from an archaic Denisovan individual.古丹尼索瓦人个体的高覆盖度基因组序列。
Science. 2012 Oct 12;338(6104):222-6. doi: 10.1126/science.1224344. Epub 2012 Aug 30.

关于具有单倍型信息性读段的队列的下一代测序基因分型的设计与分析。

On the design and analysis of next-generation sequencing genotyping for a cohort with haplotype-informative reads.

作者信息

Zhi Degui, Liu Nianjun, Zhang Kui

机构信息

Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL 35294, United States.

Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL 35294, United States.

出版信息

Methods. 2015 Jun;79-80:41-6. doi: 10.1016/j.ymeth.2015.01.016. Epub 2015 Jan 30.

DOI:10.1016/j.ymeth.2015.01.016
PMID:25644447
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4437872/
Abstract

Next-generation sequencing (NGS) technologies, which can provide base-pair resolution genetic information for all types of genetic variations, are increasingly used in genetics research. However, due to the complex nature of NGS technologies and analytics and their relatively high cost, investigators face practical challenges for both design and analysis. These challenges are further complicated by recent methodological developments that make it possible to use haplotype information in sequencing reads. In light of these developments, we conducted comprehensive simulations to evaluate the effects of sequencing coverage, insert size of paired-end reads, and sample size on genotype calling and haplotype phasing in NGS studies. In contrast to previous studies that typically use idealized scenarios to tease out the effects of individual design and analytic decisions, we used a complete analytical pipeline from read mapping and variant detection to genotype calling and haplotype phasing so that we can assess the joint effects of multiple decisions and thus make more realistic recommendations to investigators. Consistent with previous studies, we found that the use of haplotype information in reads can improve the accuracy of genotype calling and haplotype phasing, and we also found that a mixture of short and long insert sizes of paired-end reads may offer even greater accuracy. However, this benefit is only clear in high coverage sequencing where variant detection is close to perfect. Finally, we observed that LD-based refinement methods do not always outperform single site based methods for genotype calling. Therefore, we should choose analytical methods that are appropriate to the sequencing coverage and sample size in order to use haplotype information in sequencing reads.

摘要

新一代测序(NGS)技术能够为所有类型的基因变异提供碱基对分辨率的遗传信息,在遗传学研究中的应用越来越广泛。然而,由于NGS技术及其分析方法的复杂性以及相对较高的成本,研究人员在设计和分析方面面临实际挑战。最近的方法学发展使得在测序读数中使用单倍型信息成为可能,这进一步加剧了这些挑战。鉴于这些发展,我们进行了全面的模拟,以评估测序覆盖度、双端读数的插入片段大小和样本量对NGS研究中基因型分型和单倍型定相的影响。与以往通常使用理想化场景来梳理单个设计和分析决策影响的研究不同,我们使用了从读段比对、变异检测到基因型分型和单倍型定相的完整分析流程,以便能够评估多个决策的联合影响,从而为研究人员提出更现实的建议。与以往研究一致,我们发现利用读数中的单倍型信息可以提高基因型分型和单倍型定相准确性,并且我们还发现双端读数中短插入片段大小和长插入片段大小混合使用可能会提供更高的准确性。然而,这种优势仅在变异检测接近完美的高覆盖度测序中才明显。最后,我们观察到基于连锁不平衡的优化方法在基因型分型方面并不总是优于基于单位点的方法。因此,为了在测序读数中使用单倍型信息,我们应该选择适合测序覆盖度和样本量的分析方法。