• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用 HetMap 从杂合植物基因组的低覆盖度序列数据中进行基因型调用和单倍型推断。

Genotype calling and haplotype inference from low coverage sequence data in heterozygous plant genome using HetMap.

机构信息

School of Life Science, Huizhou University, Huizhou, 516007, China.

State Key Laboratory of Plant Molecular Genetics, National Center for Gene Research, Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200233, China.

出版信息

Theor Appl Genet. 2022 Jun;135(6):2157-2166. doi: 10.1007/s00122-022-04105-z. Epub 2022 May 3.

DOI:10.1007/s00122-022-04105-z
PMID:35504967
Abstract

This study developed a new genotyping method that can accurately infer heterozygous genotype information from the complex plant genome sequence data, which helped discover new alleles in the association studies. Many software packages and pipelines had been developed to handle the sequence data of the model species. However, Genotyping from complex heterozygous plant genome needs further improvement on the previous methods. Here we present a new pipeline available at https://github.com/Ncgrhg/HetMapv1 ) for variant calling and missing genotype imputation from low coverage sequence data for heterozygous plant genomes. To check the performance of the HetMap on the real sequence data, HetMap was applied to both the F hybrid rice population, which consists of 1495 samples and the wild rice population with 446 samples. The high coverage sequence data of four hybrid rice accessions and two wild rice accessions, which were also included in low coverage sequence data, were used to validate the accuracy of genotype inference. The validation results showed that HetMap archieved significant improvement in heterozygous genotype inference accuracy (13.65% for hybrid rice, 26.05% for wild rice) and total accuracy compared with similar software packages. The application of the new genotype with the genome-wide association study also showed improvement of association power in wild rice awn length phenotype. It could archive high genotype inference accuracy in low sequence coverage in a small population with both the natural and constructed recombination population. HetMap provided a powerful tool for the heterozygous plant genome sequence data analysis, which may help to discover new phenotype regions for the plant species with the complex heterozygous genome.

摘要

本研究开发了一种新的基因分型方法,能够从复杂的植物基因组序列数据中准确推断杂合基因型信息,有助于在关联研究中发现新的等位基因。已经开发了许多软件包和流程来处理模式物种的序列数据。然而,复杂杂合植物基因组的基因分型需要对以前的方法进行进一步改进。我们在这里展示了一个新的流程,可在 https://github.com/Ncgrhg/HetMapv1 上获得,用于从低覆盖度的序列数据中对杂合植物基因组进行变异调用和缺失基因型推断。为了检查 HetMap 在真实序列数据上的性能,我们将 HetMap 应用于由 1495 个样本组成的 F 杂交水稻群体和由 446 个样本组成的野生稻群体。还将高覆盖度的四个杂交水稻品系和两个野生水稻品系的序列数据包含在低覆盖度的序列数据中,用于验证基因型推断的准确性。验证结果表明,与类似的软件包相比,HetMap 在杂合基因型推断准确性(杂交水稻为 13.65%,野生稻为 26.05%)和总体准确性方面都有显著提高。将新基因型应用于全基因组关联研究也显示出对野生稻芒长表型关联能力的提高。HetMap 可以在小群体中对自然和构建的重组群体进行低序列覆盖,实现高基因型推断准确性。HetMap 为复杂杂合基因组植物的序列数据分析提供了一个强大的工具,有助于发现新的表型区域。

相似文献

1
Genotype calling and haplotype inference from low coverage sequence data in heterozygous plant genome using HetMap.使用 HetMap 从杂合植物基因组的低覆盖度序列数据中进行基因型调用和单倍型推断。
Theor Appl Genet. 2022 Jun;135(6):2157-2166. doi: 10.1007/s00122-022-04105-z. Epub 2022 May 3.
2
A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy.对低覆盖作物数据集的测序分析方法的比较表明,一种新的工作流程 GB-eaSy 具有优势。
BMC Bioinformatics. 2017 Dec 28;18(1):586. doi: 10.1186/s12859-017-2000-6.
3
LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms.LinkImpute:非模式生物的快速准确基因型填充
G3 (Bethesda). 2015 Sep 15;5(11):2383-90. doi: 10.1534/g3.115.021667.
4
Hybrid peeling for fast and accurate calling, phasing, and imputation with sequence data of any coverage in pedigrees.混合去卷积用于在任何覆盖度的家系序列数据中快速准确地进行调用、定相和插补。
Genet Sel Evol. 2018 Dec 18;50(1):67. doi: 10.1186/s12711-018-0438-2.
5
Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation.牛群中的低深度测序基因分型(GBS):最大化高质量基因型选择和归因准确性的策略。
BMC Genet. 2017 Apr 5;18(1):32. doi: 10.1186/s12863-017-0501-y.
6
Increasing calling accuracy, coverage, and read-depth in sequence data by the use of haplotype blocks.通过使用单倍型块提高序列数据的呼叫准确率、覆盖率和读深度。
PLoS Genet. 2021 Dec 23;17(12):e1009944. doi: 10.1371/journal.pgen.1009944. eCollection 2021 Dec.
7
Genotype Imputation in Genome-Wide Association Studies.全基因组关联研究中的基因型填充
Curr Protoc Hum Genet. 2019 Jun;102(1):e84. doi: 10.1002/cphg.84.
8
Genotype-Corrector: improved genotype calls for genetic mapping in F and RIL populations.基因型校正器:改进 F 和 RIL 群体中遗传图谱的基因型调用。
Sci Rep. 2018 Jul 4;8(1):10088. doi: 10.1038/s41598-018-28294-0.
9
Fast imputation using medium or low-coverage sequence data.使用中等或低覆盖率序列数据进行快速插补。
BMC Genet. 2015 Jul 14;16:82. doi: 10.1186/s12863-015-0243-7.
10
SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation.SPEARS:通过模拟对祖先单倍型重建进行标准性能评估。
Bioinformatics. 2021 May 5;37(6):868-870. doi: 10.1093/bioinformatics/btaa749.

本文引用的文献

1
Genome-wide identification of agronomically important genes in outcrossing crops using OutcrossSeq.利用杂交测序技术对异交作物中的农艺重要基因进行全基因组鉴定。
Mol Plant. 2021 Apr 5;14(4):556-570. doi: 10.1016/j.molp.2021.01.003. Epub 2021 Jan 8.
2
Breeding crops to feed 10 billion.培育作物以养活 100 亿人。
Nat Biotechnol. 2019 Jul;37(7):744-754. doi: 10.1038/s41587-019-0152-9. Epub 2019 Jun 17.
3
A universal SNP and small-indel variant caller using deep neural networks.使用深度神经网络的通用 SNP 和小插入缺失变体调用器。
Nat Biotechnol. 2018 Nov;36(10):983-987. doi: 10.1038/nbt.4235. Epub 2018 Sep 24.
4
Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data.评估用于非配对下一代测序数据的变异调用工具。
Sci Rep. 2017 Feb 24;7:43169. doi: 10.1038/srep43169.
5
An-2 Encodes a Cytokinin Synthesis Enzyme that Regulates Awn Length and Grain Production in Rice.An-2 编码一种细胞分裂素合成酶,该酶调控水稻芒长和籽粒产量。
Mol Plant. 2015 Nov 2;8(11):1635-50. doi: 10.1016/j.molp.2015.08.001. Epub 2015 Aug 15.
6
An efficient and scalable analysis framework for variant extraction and refinement from population-scale DNA sequence data.一种用于从群体规模的DNA序列数据中提取和优化变异体的高效且可扩展的分析框架。
Genome Res. 2015 Jun;25(6):918-25. doi: 10.1101/gr.176552.114. Epub 2015 Apr 16.
7
Genomic analysis of hybrid rice varieties reveals numerous superior alleles that contribute to heterosis.杂交水稻品种的基因组分析揭示了众多有助于杂种优势的优良等位基因。
Nat Commun. 2015 Feb 5;6:6258. doi: 10.1038/ncomms7258.
8
An-1 encodes a basic helix-loop-helix protein that regulates awn development, grain size, and grain number in rice.An-1编码一种碱性螺旋-环-螺旋蛋白,该蛋白调控水稻的芒发育、籽粒大小和籽粒数量。
Plant Cell. 2013 Sep;25(9):3360-76. doi: 10.1105/tpc.113.113589. Epub 2013 Sep 27.
9
The genomic signature of crop-wild introgression in maize.玉米作物-野生近缘种渐渗的基因组特征。
PLoS Genet. 2013 May;9(5):e1003477. doi: 10.1371/journal.pgen.1003477. Epub 2013 May 9.
10
An integrative variant analysis pipeline for accurate genotype/haplotype inference in population NGS data.一种整合的变异分析管道,用于准确推断人群 NGS 数据中的基因型/单倍型。
Genome Res. 2013 May;23(5):833-42. doi: 10.1101/gr.146084.112. Epub 2013 Jan 7.