• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

或许:基于下一代测序数据的 Paired-End 短读段 HAP 分型。

PERHAPS: Paired-End short Reads-based HAPlotyping from next-generation Sequencing data.

机构信息

Department of Global Health, School of Public Health, Peking University, Beijing, China.

Genetics and Animal Breeding Group, School of Pharmacy, University of Camerino, Italy.

出版信息

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa320.

DOI:10.1093/bib/bbaa320
PMID:33285565
Abstract

The identification of rare haplotypes may greatly expand our knowledge in the genetic architecture of both complex and monogenic traits. To this aim, we developed PERHAPS (Paired-End short Reads-based HAPlotyping from next-generation Sequencing data), a new and simple approach to directly call haplotypes from short-read, paired-end Next Generation Sequencing (NGS) data. To benchmark this method, we considered the APOE classic polymorphism (*1/*2/*3/4), since it represents one of the best examples of functional polymorphism arising from the haplotype combination of two Single Nucleotide Polymorphisms (SNPs). We leveraged the big Whole Exome Sequencing (WES) and SNP-array data obtained from the multi-ethnic UK BioBank (UKBB, N=48,855). By applying PERHAPS, based on piecing together the paired-end reads according to their FASTQ-labels, we extracted the haplotype data, along with their frequencies and the individual diplotype. Concordance rates between WES directly called diplotypes and the ones generated through statistical pre-phasing and imputation of SNP-array data are extremely high (>99%), either when stratifying the sample by SNP-array genotyping batch or self-reported ethnic group. Hardy-Weinberg Equilibrium tests and the comparison of obtained haplotype frequencies with the ones available from the 1000 Genome Project further supported the reliability of PERHAPS. Notably, we were able to determine the existence of the rare APOE1 haplotype in two unrelated African subjects from UKBB, supporting its presence at appreciable frequency (approximatively 0.5%) in the African Yoruba population. Despite acknowledging some technical shortcomings, PERHAPS represents a novel and simple approach that will partly overcome the limitations in direct haplotype calling from short read-based sequencing.

摘要

从短读长、配对末端下一代测序 (NGS) 数据中直接调用单倍型的新方法,即 PERHAPS(基于配对末端短读的单倍型推断下一代测序数据)。为了验证该方法,我们考虑了 APOE 经典多态性(*1/*2/*3/4),因为它代表了由两个单核苷酸多态性 (SNP) 的单倍型组合引起的功能多态性的最佳范例之一。我们利用从多民族英国生物银行 (UKBB,N=48,855) 获得的全外显子组测序 (WES) 和 SNP 芯片数据。通过应用 PERHAPS,根据其 FASTQ 标签拼接配对末端读取,我们提取了单倍型数据及其频率和个体二倍型。通过 WES 直接调用的二倍型与 SNP 芯片数据的统计预相和导入生成的二倍型之间的一致性率极高 (>99%),无论是按 SNP 芯片基因型批次分层还是按自我报告的族群分层。Hardy-Weinberg 平衡检验和获得的单倍型频率与 1000 基因组计划提供的频率的比较进一步支持了 PERHAPS 的可靠性。值得注意的是,我们能够确定 UKBB 中的两个无关非洲个体中存在罕见的 APOE1 单倍型,支持其在非洲约鲁巴人群中存在可观的频率(约 0.5%)。尽管承认存在一些技术缺陷,但 PERHAPS 代表了一种新颖而简单的方法,将部分克服从基于短读的测序中直接调用单倍型的局限性。

相似文献

1
PERHAPS: Paired-End short Reads-based HAPlotyping from next-generation Sequencing data.或许:基于下一代测序数据的 Paired-End 短读段 HAP 分型。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa320.
2
Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data.利用跨越多个单核苷酸多态性的读取信息,从测序数据中推断单倍型。
Bioinformatics. 2013 Sep 15;29(18):2245-52. doi: 10.1093/bioinformatics/btt386. Epub 2013 Jul 3.
3
A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings.一种在临床环境中直接进行单体型定相的长读测序方法。
Int J Mol Sci. 2020 Dec 1;21(23):9177. doi: 10.3390/ijms21239177.
4
Integrating read-based and population-based phasing for dense and accurate haplotyping of individual genomes.基于读取和基于群体的相位整合,实现个体基因组的密集和精确单倍型分型。
Bioinformatics. 2019 Jul 15;35(14):i242-i248. doi: 10.1093/bioinformatics/btz329.
5
Multi-centre evaluation of a comprehensive preimplantation genetic test through haplotyping-by-sequencing.通过测序单倍型分析对综合植入前遗传学检测的多中心评估。
Hum Reprod. 2019 Aug 1;34(8):1608-1619. doi: 10.1093/humrep/dez106.
6
Haplotype phasing in single-cell DNA-sequencing data.单细胞 DNA 测序数据中的单倍型相位。
Bioinformatics. 2018 Jul 1;34(13):i211-i217. doi: 10.1093/bioinformatics/bty286.
7
WhatsHap: Weighted Haplotype Assembly for Future-Generation Sequencing Reads.WhatsHap:用于下一代测序读数的加权单倍型组装
J Comput Biol. 2015 Jun;22(6):498-509. doi: 10.1089/cmb.2014.0157. Epub 2015 Feb 6.
8
HaploMaker: An improved algorithm for rapid haplotype assembly of genomic sequences.HaploMaker:一种用于快速组装基因组序列单倍型的改进算法。
Gigascience. 2022 May 17;11. doi: 10.1093/gigascience/giac038.
9
Archived neonatal dried blood spot samples can be used for accurate whole genome and exome-targeted next-generation sequencing.存档的新生儿干血斑样本可用于准确的全基因组和外显子靶向下一代测序。
Mol Genet Metab. 2013 Sep-Oct;110(1-2):65-72. doi: 10.1016/j.ymgme.2013.06.004. Epub 2013 Jun 13.
10
Imputation-based assessment of next generation rare exome variant arrays.基于插补法的新一代罕见外显子变异阵列评估
Pac Symp Biocomput. 2014:241-52.

引用本文的文献

1
High-throughput method for the hybridisation-based targeted enrichment of long genomic fragments for PacBio third-generation sequencing.用于基于杂交的长基因组片段靶向富集以进行PacBio第三代测序的高通量方法。
NAR Genom Bioinform. 2022 Jul 13;4(3):lqac051. doi: 10.1093/nargab/lqac051. eCollection 2022 Sep.
2
Challenges at the APOE locus: a robust quality control approach for accurate APOE genotyping.APOE 基因座的挑战:一种用于准确 APOE 基因分型的稳健质量控制方法。
Alzheimers Res Ther. 2022 Feb 4;14(1):22. doi: 10.1186/s13195-022-00962-4.