• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CAPG:综合异源多倍体基因型分析器。

CAPG: comprehensive allopolyploid genotyper.

机构信息

Department of Agronomy, Iowa State University, Ames, IA 50011, USA.

Department of Statistics, Iowa State University, Ames, IA 50011, USA.

出版信息

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac729.

DOI:10.1093/bioinformatics/btac729
PMID:36367243
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9825759/
Abstract

MOTIVATION

Genotyping by sequencing is a powerful tool for investigating genetic variation in plants, but many economically important plants are allopolyploids, where homoeologous similarity obscures the subgenomic origin of reads and confounds allelic and homoeologous SNPs. Recent polyploid genotyping methods use allelic frequencies, rate of heterozygosity, parental cross or other information to resolve read assignment, but good subgenomic references offer the most direct information. The typical strategy aligns reads to the joint reference, performs diploid genotyping within each subgenome, and filters the results, but persistent read misassignment results in an excess of false heterozygous calls.

RESULTS

We introduce the Comprehensive Allopolyploid Genotyper (CAPG), which formulates an explicit likelihood to weight read alignments against both subgenomic references and genotype individual allopolyploids from whole-genome resequencing data. We demonstrate CAPG in allotetraploids, where it performs better than Genome Analysis Toolkit's HaplotypeCaller applied to reads aligned to the combined subgenomic references.

AVAILABILITY AND IMPLEMENTATION

Code and tutorials are available at https://github.com/Kkulkarni1/CAPG.git.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

测序基因分型是研究植物遗传变异的有力工具,但许多具有经济重要性的植物都是异源多倍体,其中同源相似性掩盖了读取的亚基因组起源,并混淆了等位基因和同源 SNP。最近的多倍体基因分型方法使用等位基因频率、杂合率、亲本杂交或其他信息来解决读取分配问题,但良好的亚基因组参考提供了最直接的信息。典型的策略是将读取与联合参考对齐,在每个亚基因组内执行二倍体基因分型,并过滤结果,但持续的读取错误分配会导致假杂合调用过多。

结果

我们引入了综合异源多倍体基因分型器 (CAPG),它针对亚基因组参考和从全基因组重测序数据中个体异源多倍体的基因型制定了显式似然性,以对读取进行加权。我们在异源四倍体中展示了 CAPG,它比应用于对齐到组合亚基因组参考的读取的基因组分析工具包的 HaplotypeCaller 表现更好。

可用性和实现

代码和教程可在 https://github.com/Kkulkarni1/CAPG.git 获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/09b6c1d8f7f2/btac729f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/1485a2d0e8b3/btac729f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/ac7f38ce6ea0/btac729f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/51c08293a45b/btac729f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/5a135743c1e0/btac729f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/09b6c1d8f7f2/btac729f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/1485a2d0e8b3/btac729f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/ac7f38ce6ea0/btac729f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/51c08293a45b/btac729f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/5a135743c1e0/btac729f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6989/9825759/09b6c1d8f7f2/btac729f5.jpg

相似文献

1
CAPG: comprehensive allopolyploid genotyper.CAPG:综合异源多倍体基因型分析器。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac729.
2
SVJedi: genotyping structural variations with long reads.使用长读长进行基因分型结构变异。
Bioinformatics. 2020 Nov 1;36(17):4568-4575. doi: 10.1093/bioinformatics/btaa527.
3
SNP genotyping and parameter estimation in polyploids using low-coverage sequencing data.使用低覆盖度测序数据进行多倍体的 SNP 基因分型和参数估计。
Bioinformatics. 2018 Feb 1;34(3):407-415. doi: 10.1093/bioinformatics/btx587.
4
One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies.一刀切并不适用——RefEditor:构建个性化二倍体参考基因组以改善下一代测序研究中的读段映射和基因型调用
PLoS Comput Biol. 2015 Aug 12;11(8):e1004448. doi: 10.1371/journal.pcbi.1004448. eCollection 2015 Aug.
5
Genotyping inversions and tandem duplications.基因分型倒位和串联重复。
Bioinformatics. 2017 Dec 15;33(24):4015-4023. doi: 10.1093/bioinformatics/btx020.
6
QuASAR: quantitative allele-specific analysis of reads.QuASAR:读取的定量等位基因特异性分析。
Bioinformatics. 2015 Apr 15;31(8):1235-42. doi: 10.1093/bioinformatics/btu802. Epub 2014 Dec 4.
7
Seq2Sat and SatAnalyzer toolkit: Towards comprehensive microsatellite genotyping from sequencing data.Seq2Sat 和 SatAnalyzer 工具包:从测序数据中实现全面的微卫星基因分型。
Mol Ecol Resour. 2024 Apr;24(3):e13929. doi: 10.1111/1755-0998.13929. Epub 2024 Jan 30.
8
Simple, rapid and accurate genotyping-by-sequencing from aligned whole genomes with ArrayMaker.使用ArrayMaker从比对的全基因组中进行简单、快速且准确的测序基因分型。
Bioinformatics. 2015 Feb 15;31(4):599-601. doi: 10.1093/bioinformatics/btu691. Epub 2014 Oct 21.
9
Using genotype array data to compare multi- and single-sample variant calls and improve variant call sets from deep coverage whole-genome sequencing data.利用基因型阵列数据比较多样本和单样本变异检测结果,并改进来自深度覆盖全基因组测序数据的变异检测集。
Bioinformatics. 2017 Apr 15;33(8):1147-1153. doi: 10.1093/bioinformatics/btw786.
10
trfermikit: a tool to discover VNTR-associated deletions.trfermikit:一种用于发现 VNTR 相关缺失的工具。
Bioinformatics. 2022 Feb 7;38(5):1231-1234. doi: 10.1093/bioinformatics/btab805.

引用本文的文献

1
Demographic history inference and the polyploid continuum.人口历史推断与多倍体连续统。
Genetics. 2023 Aug 9;224(4). doi: 10.1093/genetics/iyad107.

本文引用的文献

1
Use of Targeted Amplicon Sequencing in Peanut to Generate Allele Information on Allotetraploid Sub-Genomes.利用靶向扩增子测序技术在花生中生成所有四倍体亚基因组的等位基因信息。
Genes (Basel). 2020 Oct 18;11(10):1220. doi: 10.3390/genes11101220.
2
Sequencing Multiple Cotton Genomes Reveals Complex Structures and Lays Foundation for Breeding.多个棉花基因组测序揭示复杂结构并为育种奠定基础。
Front Plant Sci. 2020 Sep 16;11:560096. doi: 10.3389/fpls.2020.560096. eCollection 2020.
3
AmpliCI: a high-resolution model-based approach for denoising Illumina amplicon data.
AmpliCI:一种基于高分辨率模型的 Illumina 扩增子数据去噪方法。
Bioinformatics. 2021 Jan 29;36(21):5151-5158. doi: 10.1093/bioinformatics/btaa648.
4
Comparison of SNP Calling Pipelines and NGS Platforms to Predict the Genomic Regions Harboring Candidate Genes for Nodulation in Cultivated Peanut.用于预测栽培花生中与结瘤相关候选基因所在基因组区域的SNP检测流程和NGS平台的比较
Front Genet. 2020 Mar 24;11:222. doi: 10.3389/fgene.2020.00222. eCollection 2020.
5
The genome sequence of segmental allotetraploid peanut Arachis hypogaea.花生基因组序列:片段异源四倍体 Arachis hypogaea。
Nat Genet. 2019 May;51(5):877-884. doi: 10.1038/s41588-019-0405-z. Epub 2019 May 1.
6
Whole-genome resequencing reveals Brassica napus origin and genetic loci involved in its improvement.全基因组重测序揭示了油菜的起源和参与其改良的遗传位点。
Nat Commun. 2019 Mar 11;10(1):1154. doi: 10.1038/s41467-019-09134-9.
7
polyRAD: Genotype Calling with Uncertainty from Sequencing Data in Polyploids and Diploids.polyRAD:多倍体和二倍体测序数据不确定性下的基因型分型
G3 (Bethesda). 2019 Mar 7;9(3):663-673. doi: 10.1534/g3.118.200913.
8
Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense.两个栽培的异源四倍体棉花(陆地棉和海岛棉)的参考基因组序列。
Nat Genet. 2019 Feb;51(2):224-229. doi: 10.1038/s41588-018-0282-x. Epub 2018 Dec 3.
9
Genotyping Polyploids from Messy Sequencing Data.从混杂测序数据中进行多倍体基因型分析。
Genetics. 2018 Nov;210(3):789-807. doi: 10.1534/genetics.118.301468. Epub 2018 Sep 5.
10
Haplotype-Based Genotyping in Polyploids.多倍体中基于单倍型的基因分型
Front Plant Sci. 2018 Apr 26;9:564. doi: 10.3389/fpls.2018.00564. eCollection 2018.