• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LinkImputeR:用于非模式生物的用户引导基因型分型和填补。

LinkImputeR: user-guided genotype calling and imputation for non-model organisms.

作者信息

Money Daniel, Migicovsky Zoë, Gardner Kyle, Myles Sean

机构信息

Department of Plant and Animal Sciences, Faculty of Agriculture, Dalhousie University, Truro, Nova Scotia, Canada.

出版信息

BMC Genomics. 2017 Jul 10;18(1):523. doi: 10.1186/s12864-017-3873-5.

DOI:10.1186/s12864-017-3873-5
PMID:28693460
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5504746/
Abstract

BACKGROUND

Genomic studies such as genome-wide association and genomic selection require genome-wide genotype data. All existing technologies used to create these data result in missing genotypes, which are often then inferred using genotype imputation software. However, existing imputation methods most often make use only of genotypes that are successfully inferred after having passed a certain read depth threshold. Because of this, any read information for genotypes that did not pass the threshold, and were thus set to missing, is ignored. Most genomic studies also choose read depth thresholds and quality filters without investigating their effects on the size and quality of the resulting genotype data. Moreover, almost all genotype imputation methods require ordered markers and are therefore of limited utility in non-model organisms.

RESULTS

Here we introduce LinkImputeR, a software program that exploits the read count information that is normally ignored, and makes use of all available DNA sequence information for the purposes of genotype calling and imputation. It is specifically designed for non-model organisms since it requires neither ordered markers nor a reference panel of genotypes. Using next-generation DNA sequence (NGS) data from apple, cannabis and grape, we quantify the effect of varying read count and missingness thresholds on the quantity and quality of genotypes generated from LinkImputeR. We demonstrate that LinkImputeR can increase the number of genotype calls by more than an order of magnitude, can improve genotyping accuracy by several percent and can thus improve the power of downstream analyses. Moreover, we show that the effects of quality and read depth filters can differ substantially between data sets and should therefore be investigated on a per-study basis.

CONCLUSIONS

By exploiting DNA sequence data that is normally ignored during genotype calling and imputation, LinkImputeR can significantly improve both the quantity and quality of genotype data generated from NGS technologies. It enables the user to quickly and easily examine the effects of varying thresholds and filters on the number and quality of the resulting genotype calls. In this manner, users can decide on thresholds that are most suitable for their purposes. We show that LinkImputeR can significantly augment the value and utility of NGS data sets, especially in non-model organisms with poor genomic resources.

摘要

背景

全基因组关联研究和基因组选择等基因组学研究需要全基因组基因型数据。用于生成这些数据的所有现有技术都会导致基因型缺失,这些缺失的基因型通常随后会使用基因型填充软件进行推断。然而,现有的填充方法通常仅利用在通过特定读取深度阈值后成功推断出的基因型。因此,任何未通过阈值并因此被设为缺失的基因型的读取信息都被忽略了。大多数基因组学研究在选择读取深度阈值和质量过滤器时,也未考察它们对所得基因型数据的规模和质量的影响。此外,几乎所有的基因型填充方法都需要有序的标记,因此在非模式生物中的实用性有限。

结果

在此,我们介绍LinkImputeR,这是一款软件程序,它利用通常被忽略的读取计数信息,并利用所有可用的DNA序列信息进行基因型分型和填充。它是专门为非模式生物设计的,因为它既不需要有序的标记,也不需要基因型参考面板。利用来自苹果、大麻和葡萄的下一代DNA序列(NGS)数据,我们量化了不同读取计数和缺失阈值对LinkImputeR生成的基因型数量和质量的影响。我们证明,LinkImputeR可以将基因型分型的数量增加一个数量级以上,将基因分型准确性提高几个百分点,从而提高下游分析的效能。此外,我们表明,质量和读取深度过滤器的效果在不同数据集之间可能有很大差异,因此应该针对每项研究进行考察。

结论

通过利用在基因型分型和填充过程中通常被忽略的DNA序列数据,LinkImputeR可以显著提高从NGS技术生成的基因型数据的数量和质量。它使用户能够快速轻松地检查不同阈值和过滤器对所得基因型分型数量和质量产生的影响。通过这种方式,用户可以确定最适合其目的的阈值。我们表明,LinkImputeR可以显著提高NGS数据集的价值和实用性,特别是在基因组资源匮乏的非模式生物中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/811ae55142b5/12864_2017_3873_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/b03fa0c908a6/12864_2017_3873_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/5adb662e6882/12864_2017_3873_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/edc635727b25/12864_2017_3873_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/811ae55142b5/12864_2017_3873_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/b03fa0c908a6/12864_2017_3873_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/5adb662e6882/12864_2017_3873_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/edc635727b25/12864_2017_3873_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3519/5504746/811ae55142b5/12864_2017_3873_Fig4_HTML.jpg

相似文献

1
LinkImputeR: user-guided genotype calling and imputation for non-model organisms.LinkImputeR:用于非模式生物的用户引导基因型分型和填补。
BMC Genomics. 2017 Jul 10;18(1):523. doi: 10.1186/s12864-017-3873-5.
2
Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation.牛群中的低深度测序基因分型(GBS):最大化高质量基因型选择和归因准确性的策略。
BMC Genet. 2017 Apr 5;18(1):32. doi: 10.1186/s12863-017-0501-y.
3
LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms.LinkImpute:非模式生物的快速准确基因型填充
G3 (Bethesda). 2015 Sep 15;5(11):2383-90. doi: 10.1534/g3.115.021667.
4
Fast imputation using medium or low-coverage sequence data.使用中等或低覆盖率序列数据进行快速插补。
BMC Genet. 2015 Jul 14;16:82. doi: 10.1186/s12863-015-0243-7.
5
Evaluating Imputation Algorithms for Low-Depth Genotyping-By-Sequencing (GBS) Data.评估低深度简化基因组测序(GBS)数据的插补算法
PLoS One. 2016 Aug 18;11(8):e0160733. doi: 10.1371/journal.pone.0160733. eCollection 2016.
6
Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation.利用简化基因组测序(GBS)和填充技术对圈养非人灵长类动物进行全基因组特征分析。
BMC Genomics. 2016 Aug 24;17(1):676. doi: 10.1186/s12864-016-2966-x.
7
Reference-free SNP calling: improved accuracy by preventing incorrect calls from repetitive genomic regions.无参考 SNP 调用:通过防止重复基因组区域的错误调用来提高准确性。
Biol Direct. 2012 Jun 8;7:17. doi: 10.1186/1745-6150-7-17.
8
Increasing calling accuracy, coverage, and read-depth in sequence data by the use of haplotype blocks.通过使用单倍型块提高序列数据的呼叫准确率、覆盖率和读深度。
PLoS Genet. 2021 Dec 23;17(12):e1009944. doi: 10.1371/journal.pgen.1009944. eCollection 2021 Dec.
9
Accurate Genotype Imputation in Multiparental Populations from Low-Coverage Sequence.多亲本群体低覆盖度序列下的精确基因型推断。
Genetics. 2018 Sep;210(1):71-82. doi: 10.1534/genetics.118.300885. Epub 2018 Jul 25.
10
Comprehensive Assessment of Genotype Imputation Performance.基因型填充性能的综合评估
Hum Hered. 2018;83(3):107-116. doi: 10.1159/000489758. Epub 2019 Jan 22.

引用本文的文献

1
Chromosome-level haplotype-resolved genome assembly provides insights into the highly heterozygous genome of Italian ryegrass (Lolium multiflorum Lam.).染色体水平单倍型解析的基因组组装为多花黑麦草(Lolium multiflorum Lam.)高度杂合的基因组提供了见解。
Plant Genome. 2025 Sep;18(3):e70079. doi: 10.1002/tpg2.70079.
2
Effect of Chromosomal Localization of NGS-Based Markers on Their Applicability for Analyzing Genetic Variation and Population Structure of Hexaploid Triticale.基于新一代测序(NGS)标记的染色体定位对其分析六倍体小黑麦遗传变异和群体结构适用性的影响。
Int J Mol Sci. 2024 Sep 3;25(17):9568. doi: 10.3390/ijms25179568.
3

本文引用的文献

1
Exploiting Wild Relatives for Genomics-assisted Breeding of Perennial Crops.利用野生近缘种进行多年生作物的基因组辅助育种
Front Plant Sci. 2017 Apr 4;8:460. doi: 10.3389/fpls.2017.00460. eCollection 2017.
2
Assessment of Cultivar Distinctness in Alfalfa: A Comparison of Genotyping-by-Sequencing, Simple-Sequence Repeat Marker, and Morphophysiological Observations.苜蓿品种特异性评估:比较基因型测序、简单序列重复标记和形态生理学观察。
Plant Genome. 2016 Jul;9(2). doi: 10.3835/plantgenome2015.10.0105.
3
Genome to Phenome Mapping in Apple Using Historical Data.
Missing genotype imputation in non-model species using self-organizing maps.
使用自组织映射对非模式物种进行缺失基因型填充
Mol Ecol Resour. 2025 Apr;25(3):e13992. doi: 10.1111/1755-0998.13992. Epub 2024 Jul 6.
4
Unveiling synergistic QTLs associated with slow wilting in soybean (Glycine max [L.] Merr.).揭示与大豆(Glycine max [L.] Merr.)缓慢萎蔫相关的协同 QTL。
Theor Appl Genet. 2024 Mar 19;137(4):85. doi: 10.1007/s00122-024-04585-1.
5
Genome-Wide Association Studies in Sunflower: Towards Sclerotinia sclerotiorum and Diaporthe/Phomopsis Resistance Breeding.向日葵全基因组关联研究:朝着核盘菌和茎点霉/拟茎点霉抗性育种的方向。
Genes (Basel). 2022 Dec 14;13(12):2357. doi: 10.3390/genes13122357.
6
Genotyping-by-sequencing of Canada's apple biodiversity collection.加拿大苹果生物多样性收集品的简化基因组测序
Front Genet. 2022 Aug 25;13:934712. doi: 10.3389/fgene.2022.934712. eCollection 2022.
7
Genotyping, the Usefulness of Imputation to Increase SNP Density, and Imputation Methods and Tools.基因分型、增加单核苷酸多态性(SNP)密度的填充实用性以及填充方法和工具
Methods Mol Biol. 2022;2467:113-138. doi: 10.1007/978-1-0716-2205-6_4.
8
Mapping the sex determination region in the Salix F1 hybrid common parent population confirms a ZW system in six diverse species.在柳属 F1 杂种共同亲本群体中定位性别决定区域,证实了六个不同物种中的 ZW 系统。
G3 (Bethesda). 2022 May 30;12(6). doi: 10.1093/g3journal/jkac071.
9
Multiple evolutionary origins of glyphosate resistance in .草甘膦抗性的多种进化起源于…… (原文表述不完整,此为根据现有内容尽量准确翻译)
Evol Appl. 2022 Feb 8;15(2):316-329. doi: 10.1111/eva.13344. eCollection 2022 Feb.
10
Comparative transcriptomics and eQTL mapping of response to Melampsora americana in selected Salix purpurea F progeny.在选定的紫柳 F 代中,对美洲杨栅锈菌的响应进行比较转录组学和 eQTL 图谱分析。
BMC Genomics. 2022 Jan 22;23(1):71. doi: 10.1186/s12864-021-08254-1.
利用历史数据进行苹果的基因组到表型图谱绘制。
Plant Genome. 2016 Jul;9(2). doi: 10.3835/plantgenome2015.11.0113.
4
Genome-Wide Association Study for Nine Plant Architecture Traits in Sorghum.高粱 9 个株型性状的全基因组关联研究。
Plant Genome. 2016 Jul;9(2). doi: 10.3835/plantgenome2015.06.0044.
5
Rapid genotype imputation from sequence without reference panels.无需参考面板即可从序列中快速进行基因型推算。
Nat Genet. 2016 Aug;48(8):965-969. doi: 10.1038/ng.3594. Epub 2016 Jul 4.
6
Genomic ancestry estimation quantifies use of wild species in grape breeding.基因组血统估计量化了野生种在葡萄育种中的利用情况。
BMC Genomics. 2016 Jun 30;17:478. doi: 10.1186/s12864-016-2834-8.
7
Increasing Genome Sampling and Improving SNP Genotyping for Genotyping-by-Sequencing with New Combinations of Restriction Enzymes.利用限制性内切酶新组合增加基因组采样并改善简化基因组测序中的单核苷酸多态性基因分型
G3 (Bethesda). 2016 Apr 7;6(4):845-56. doi: 10.1534/g3.115.025775.
8
Genotype Imputation with Millions of Reference Samples.使用数百万参考样本进行基因型填充
Am J Hum Genet. 2016 Jan 7;98(1):116-26. doi: 10.1016/j.ajhg.2015.11.020.
9
LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms.LinkImpute:非模式生物的快速准确基因型填充
G3 (Bethesda). 2015 Sep 15;5(11):2383-90. doi: 10.1534/g3.115.021667.
10
The Genetic Structure of Marijuana and Hemp.大麻和工业大麻的遗传结构。
PLoS One. 2015 Aug 26;10(8):e0133292. doi: 10.1371/journal.pone.0133292. eCollection 2015.