• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

trioPhaser:利用孟德尔遗传逻辑提高三体型的基因组相位。

trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios.

机构信息

Department of Biology, Brigham Young University, Provo, UT, 84602, USA.

出版信息

BMC Bioinformatics. 2021 Nov 22;22(1):559. doi: 10.1186/s12859-021-04470-4.

DOI:10.1186/s12859-021-04470-4
PMID:34809557
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8607709/
Abstract

BACKGROUND

When analyzing DNA sequence data of an individual, knowing which nucleotide was inherited from each parent can be beneficial when trying to identify certain types of DNA variants. Mendelian inheritance logic can be used to accurately phase (haplotype) the majority (67-83%) of an individual's heterozygous nucleotide positions when genotypes are available for both parents (trio). However, when all members of a trio are heterozygous at a position, Mendelian inheritance logic cannot be used to phase. For such positions, a computational phasing algorithm can be used. Existing phasing algorithms use a haplotype reference panel, sequencing reads, and/or parental genotypes to phase an individual; however, they are limited in that they can only phase certain types of variants, require a specific genotype build, require large amounts of storage capacity, and/or require long run times. We created trioPhaser to address these challenges.

RESULTS

trioPhaser uses gVCF files from an individual and their parents as initial input, and then outputs a phased VCF file. Input trio data are first phased using Mendelian inheritance logic. Then, the positions that cannot be phased using inheritance information alone are phased by the SHAPEIT4 phasing algorithm. Using whole-genome sequencing data of 52 trios, we show that trioPhaser, on average, increases the total number of phased positions by 21.0% and 10.5%, respectively, when compared to the number of positions that SHAPEIT4 or Mendelian inheritance logic can phase when either is used alone. In addition, we show that the accuracy of the phased calls output by trioPhaser are similar to linked-read and read-backed phasing.

CONCLUSION

trioPhaser is a containerized software tool that uses both Mendelian inheritance logic and SHAPEIT4 to phase trios when gVCF files are available. By implementing both phasing methods, more variant positions are phased compared to what either method is able to phase alone.

摘要

背景

当分析个体的 DNA 序列数据时,了解每个亲本遗传的核苷酸可以帮助识别某些类型的 DNA 变体。当父母双方(三亲)的基因型可用时,可以使用孟德尔遗传逻辑准确地对个体的大多数(67-83%)杂合核苷酸位置进行相位(单倍型)。然而,当三亲的所有成员在一个位置都是杂合子时,就不能使用孟德尔遗传逻辑进行相位。对于这种情况,可以使用计算相位算法。现有的相位算法使用单倍型参考面板、测序reads 和/或亲本基因型来对个体进行相位;然而,它们受到限制,因为它们只能相位特定类型的变体,需要特定的基因型构建,需要大量的存储容量,并且/或者需要长时间运行。我们创建了 trioPhaser 来解决这些挑战。

结果

trioPhaser 使用个体及其父母的 gVCF 文件作为初始输入,然后输出一个相位 VCF 文件。输入的三亲数据首先使用孟德尔遗传逻辑进行相位。然后,无法仅使用遗传信息进行相位的位置由 SHAPEIT4 相位算法进行相位。使用 52 个三亲的全基因组测序数据,我们表明,与仅使用 SHAPEIT4 或孟德尔遗传逻辑时可以相位的位置数量相比,trioPhaser 平均分别将相位位置的总数增加了 21.0%和 10.5%。此外,我们表明 trioPhaser 输出的相位调用的准确性与链接读取和读取回推相位相似。

结论

trioPhaser 是一个容器化软件工具,当有 gVCF 文件时,它使用孟德尔遗传逻辑和 SHAPEIT4 对三亲进行相位。通过同时实现这两种相位方法,与每种方法单独相位相比,更多的变体位置被相位。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2cd/8607709/605180d85058/12859_2021_4470_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2cd/8607709/605180d85058/12859_2021_4470_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2cd/8607709/605180d85058/12859_2021_4470_Fig1_HTML.jpg

相似文献

1
trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios.trioPhaser:利用孟德尔遗传逻辑提高三体型的基因组相位。
BMC Bioinformatics. 2021 Nov 22;22(1):559. doi: 10.1186/s12859-021-04470-4.
2
Benchmarking phasing software with a whole-genome sequenced cattle pedigree.利用全基因组测序的牛系谱对相位软件进行基准测试。
BMC Genomics. 2022 Feb 15;23(1):130. doi: 10.1186/s12864-022-08354-6.
3
Read-based phasing of related individuals.相关个体的基于读取的定相分析。
Bioinformatics. 2016 Jun 15;32(12):i234-i242. doi: 10.1093/bioinformatics/btw276.
4
Genotype calling and phasing using next-generation sequencing reads and a haplotype scaffold.使用下一代测序reads 和单倍型支架进行基因型调用和相位分析。
Bioinformatics. 2013 Jan 1;29(1):84-91. doi: 10.1093/bioinformatics/bts632. Epub 2012 Oct 23.
5
A strategy to improve phasing of whole-genome sequenced individuals through integration of familial information from dense genotype panels.一种通过整合来自密集基因型面板的家族信息来改善全基因组测序个体相位的策略。
Genet Sel Evol. 2017 May 16;49(1):46. doi: 10.1186/s12711-017-0321-6.
6
Integrating read-based and population-based phasing for dense and accurate haplotyping of individual genomes.基于读取和基于群体的相位整合,实现个体基因组的密集和精确单倍型分型。
Bioinformatics. 2019 Jul 15;35(14):i242-i248. doi: 10.1093/bioinformatics/btz329.
7
HaplotypeTools: a toolkit for accurately identifying recombination and recombinant genotypes.单体型分析工具:一个用于准确识别重组和重组基因型的工具包。
BMC Bioinformatics. 2021 Nov 22;22(1):560. doi: 10.1186/s12859-021-04473-1.
8
Complete genome phasing of family quartet by combination of genetic, physical and population-based phasing analysis.通过遗传、物理和基于人群的相位分析相结合,对家系四重奏进行全基因组相位分析。
PLoS One. 2013 May 31;8(5):e64571. doi: 10.1371/journal.pone.0064571. Print 2013.
9
Off the street phasing (OTSP): no hassle haplotype phasing for molecular PGD applications.街头去相(OTSP):分子 PGD 应用中无烦恼单体型相。
J Assist Reprod Genet. 2019 Apr;36(4):727-739. doi: 10.1007/s10815-018-1392-1. Epub 2019 Jan 8.
10
Comparison of phasing strategies for whole human genomes.全人类基因组相位策略比较。
PLoS Genet. 2018 Apr 5;14(4):e1007308. doi: 10.1371/journal.pgen.1007308. eCollection 2018 Apr.

引用本文的文献

1
Estimating Gene Conversion Tract Length and Rate From PacBio HiFi Data.从PacBio HiFi数据估计基因转换片段长度和速率
Mol Biol Evol. 2025 Feb 3;42(2). doi: 10.1093/molbev/msaf019.
2
haploMAGIC: accurate phasing and detection of recombination in multiparental populations despite genotyping errors.haploMAGIC:即使存在基因分型错误,也能在多亲本群体中准确进行相位和重组检测。
G3 (Bethesda). 2024 Aug 7;14(8). doi: 10.1093/g3journal/jkae109.
3
Using existing pediatric cancer data from the Gabriella Miller Kids First Data Resource Program.

本文引用的文献

1
Computational methods for chromosome-scale haplotype reconstruction.染色体级别的单倍型重构的计算方法。
Genome Biol. 2021 Apr 12;22(1):101. doi: 10.1186/s13059-021-02328-9.
2
A Survey of Compound Heterozygous Variants in Pediatric Cancers and Structural Birth Defects.小儿癌症和结构性出生缺陷中的复合杂合变异体调查
Front Genet. 2021 Mar 22;12:640242. doi: 10.3389/fgene.2021.640242. eCollection 2021.
3
CompoundHetVIP: Compound Heterozygous Variant Identification Pipeline.复合杂合变异鉴定管道(CompoundHetVIP)。
利用 Gabriella Miller 儿童第一数据资源计划现有的儿科癌症数据。
JNCI Cancer Spectr. 2023 Oct 31;7(6). doi: 10.1093/jncics/pkad079.
4
A novel GATA2 distal enhancer mutation results in MonoMAC syndrome in 2 second cousins.一种新的GATA2远端增强子突变导致2名二级表亲患单核细胞性噬血细胞综合征。
Blood Adv. 2023 Oct 24;7(20):6351-6363. doi: 10.1182/bloodadvances.2023010458.
F1000Res. 2020 Oct 8;9:1211. doi: 10.12688/f1000research.26848.2. eCollection 2020.
4
Compound Heterozygous Variants in Pediatric Cancers: A Systematic Review.儿童癌症中的复合杂合变异:一项系统综述。
Front Genet. 2020 May 19;11:493. doi: 10.3389/fgene.2020.00493. eCollection 2020.
5
Accurate, scalable and integrative haplotype estimation.精确、可扩展且综合的单倍型估计。
Nat Commun. 2019 Nov 28;10(1):5436. doi: 10.1038/s41467-019-13225-y.
6
The Importance of Data Compression in the Field of Genomics.数据压缩在基因组学领域的重要性。
IEEE Pulse. 2019 Mar-Apr;10(2):20-23. doi: 10.1109/MPULS.2019.2899747.
7
An open resource for accurately benchmarking small variant and reference calls.用于准确基准测试小型变体和参考调用的开放资源。
Nat Biotechnol. 2019 May;37(5):561-566. doi: 10.1038/s41587-019-0074-6. Epub 2019 Apr 1.
8
Comparison of phasing strategies for whole human genomes.全人类基因组相位策略比较。
PLoS Genet. 2018 Apr 5;14(4):e1007308. doi: 10.1371/journal.pgen.1007308. eCollection 2018 Apr.
9
Haplotyping germline and cancer genomes with high-throughput linked-read sequencing.利用高通量连锁读长测序对种系和癌症基因组进行单倍型分型
Nat Biotechnol. 2016 Mar;34(3):303-11. doi: 10.1038/nbt.3432. Epub 2016 Feb 1.
10
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.