• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因分型阵列中新变体的发现可提高基因型保留率并降低确定偏差。

Discovery of novel variants in genotyping arrays improves genotype retention and reduces ascertainment bias.

机构信息

Department of Genetics, University of North Carolina at Chapel Hill, 27599-7264, USA.

出版信息

BMC Genomics. 2012 Jan 19;13:34. doi: 10.1186/1471-2164-13-34.

DOI:10.1186/1471-2164-13-34
PMID:22260749
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3305361/
Abstract

BACKGROUND

High-density genotyping arrays that measure hybridization of genomic DNA fragments to allele-specific oligonucleotide probes are widely used to genotype single nucleotide polymorphisms (SNPs) in genetic studies, including human genome-wide association studies. Hybridization intensities are converted to genotype calls by clustering algorithms that assign each sample to a genotype class at each SNP. Data for SNP probes that do not conform to the expected pattern of clustering are often discarded, contributing to ascertainment bias and resulting in lost information - as much as 50% in a recent genome-wide association study in dogs.

RESULTS

We identified atypical patterns of hybridization intensities that were highly reproducible and demonstrated that these patterns represent genetic variants that were not accounted for in the design of the array platform. We characterized variable intensity oligonucleotide (VINO) probes that display such patterns and are found in all hybridization-based genotyping platforms, including those developed for human, dog, cattle, and mouse. When recognized and properly interpreted, VINOs recovered a substantial fraction of discarded probes and counteracted SNP ascertainment bias. We developed software (MouseDivGeno) that identifies VINOs and improves the accuracy of genotype calling. MouseDivGeno produced highly concordant genotype calls when compared with other methods but it uniquely identified more than 786000 VINOs in 351 mouse samples. We used whole-genome sequence from 14 mouse strains to confirm the presence of novel variants explaining 28000 VINOs in those strains. We also identified VINOs in human HapMap 3 samples, many of which were specific to an African population. Incorporating VINOs in phylogenetic analyses substantially improved the accuracy of a Mus species tree and local haplotype assignment in laboratory mouse strains.

CONCLUSION

The problems of ascertainment bias and missing information due to genotyping errors are widely recognized as limiting factors in genetic studies. We have conducted the first formal analysis of the effect of novel variants on genotyping arrays, and we have shown that these variants account for a large portion of miscalled and uncalled genotypes. Genetic studies will benefit from substantial improvements in the accuracy of their results by incorporating VINOs in their analyses.

摘要

背景

高密度基因分型芯片通过测量基因组 DNA 片段与等位基因特异性寡核苷酸探针的杂交来广泛用于遗传研究中的单核苷酸多态性(SNP)基因分型,包括人类全基因组关联研究。通过聚类算法将杂交强度转换为基因型,聚类算法将每个样本分配到每个 SNP 的基因型类别。不符合聚类预期模式的 SNP 探针数据通常会被丢弃,导致确定偏差,并导致信息丢失 - 在最近的犬全基因组关联研究中高达 50%。

结果

我们确定了高度可重复的杂交强度异常模式,并证明这些模式代表了在芯片平台设计中未考虑到的遗传变异。我们描述了显示这种模式的可变强度寡核苷酸(VINO)探针,并发现它们存在于所有基于杂交的基因分型平台中,包括为人类、狗、牛和老鼠开发的平台。当被识别并正确解释时,VINOs 恢复了大量被丢弃的探针,并抵消了 SNP 确定偏差。我们开发了一种软件(MouseDivGeno),用于识别 VINOs 并提高基因型调用的准确性。与其他方法相比,MouseDivGeno 产生了高度一致的基因型调用,但它在 351 个老鼠样本中唯一识别了超过 786000 个 VINOs。我们使用来自 14 个老鼠品系的全基因组序列来确认这些品系中存在解释 28000 个 VINOs 的新型变体。我们还在人类 HapMap 3 样本中发现了 VINOs,其中许多是特定于非洲人群的。将 VINOs 纳入系统发育分析极大地提高了 Mus 物种树的准确性,并改进了实验室老鼠品系中的局部单倍型分配。

结论

由于基因分型错误导致的确定偏差和信息缺失问题已被广泛认为是遗传研究的限制因素。我们对新型变体对基因分型芯片的影响进行了首次正式分析,并表明这些变体占误报和未报基因型的很大一部分。通过将 VINOs 纳入其分析,遗传研究将从其结果准确性的重大提高中受益。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/30d7be78789c/1471-2164-13-34-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/0e14694d09c6/1471-2164-13-34-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/6e0271835b61/1471-2164-13-34-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/3ddedfa8edb4/1471-2164-13-34-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/4220286a0924/1471-2164-13-34-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/94f8fad43295/1471-2164-13-34-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/d30fd74ca46d/1471-2164-13-34-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/824a1b323192/1471-2164-13-34-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/30d7be78789c/1471-2164-13-34-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/0e14694d09c6/1471-2164-13-34-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/6e0271835b61/1471-2164-13-34-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/3ddedfa8edb4/1471-2164-13-34-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/4220286a0924/1471-2164-13-34-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/94f8fad43295/1471-2164-13-34-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/d30fd74ca46d/1471-2164-13-34-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/824a1b323192/1471-2164-13-34-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/388c/3305361/30d7be78789c/1471-2164-13-34-8.jpg

相似文献

1
Discovery of novel variants in genotyping arrays improves genotype retention and reduces ascertainment bias.基因分型阵列中新变体的发现可提高基因型保留率并降低确定偏差。
BMC Genomics. 2012 Jan 19;13:34. doi: 10.1186/1471-2164-13-34.
2
A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.一种用于Affymetrix SNP微阵列的多阵列多SNP基因分型算法。
Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25.
3
Assessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samples.使用270个HapMap样本评估基因分型算法BRLMM对Affymetrix GeneChip Human Mapping 500 K芯片组的批次效应。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S17. doi: 10.1186/1471-2105-9-S9-S17.
4
Vitis phylogenomics: hybridization intensities from a SNP array outperform genotype calls.葡萄系统基因组学:SNP 芯片的杂交强度优于基因型分析。
PLoS One. 2013 Nov 13;8(11):e78680. doi: 10.1371/journal.pone.0078680. eCollection 2013.
5
Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms.利用改良的简化代表性测序和 SNP 调用算法的直接比较,生成猩猩群体基因组学的 SNP 数据集。
BMC Genomics. 2014 Jan 10;15:16. doi: 10.1186/1471-2164-15-16.
6
Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation.牛群中的低深度测序基因分型(GBS):最大化高质量基因型选择和归因准确性的策略。
BMC Genet. 2017 Apr 5;18(1):32. doi: 10.1186/s12863-017-0501-y.
7
MA-SNP--A new genotype calling method for oligonucleotide SNP arrays modeling the batch effect with a normal mixture model.MA-SNP——一种用于寡核苷酸SNP阵列的新基因型分型方法,使用正态混合模型对批次效应进行建模。
Stat Appl Genet Mol Biol. 2011 Aug 30;10(1):/j/sagmb.2011.10.issue-1/sagmb.2011.10.1.1698/sagmb.2011.10.1.1698.xml. doi: 10.2202/1544-6115.1698.
8
Automated SNP genotype clustering algorithm to improve data completeness in high-throughput SNP genotyping datasets from custom arrays.用于提高来自定制阵列的高通量单核苷酸多态性(SNP)基因分型数据集中数据完整性的自动化SNP基因型聚类算法。
Genomics Proteomics Bioinformatics. 2007 Dec;5(3-4):256-9. doi: 10.1016/S1672-0229(08)60014-5.
9
Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies.未分型标记的全基因组推断准确性及其对关联研究统计效能的影响。
BMC Genet. 2009 Jun 16;10:27. doi: 10.1186/1471-2156-10-27.
10
Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies.同时进行基因型调用和单倍型相位分析可提高全基因组关联研究的基因型准确性,并减少假阳性关联。
Am J Hum Genet. 2009 Dec;85(6):847-61. doi: 10.1016/j.ajhg.2009.11.004.

引用本文的文献

1
Leveraging Whole-Genome Resequencing to Uncover Genetic Diversity and Promote Conservation Strategies for Ruminants in Asia.利用全基因组重测序揭示亚洲反刍动物的遗传多样性并促进保护策略
Animals (Basel). 2025 Mar 13;15(6):831. doi: 10.3390/ani15060831.
2
A cost-effective, high-throughput, highly accurate genotyping method for outbred populations.一种用于远交群体的经济高效、高通量、高精度基因分型方法。
G3 (Bethesda). 2025 Feb 5;15(2). doi: 10.1093/g3journal/jkae291.
3
Postnatal Ovarian Transdifferentiation in the Absence of Estrogen Receptor Signaling Is Dependent on Genetic Background.

本文引用的文献

1
Dynamic Visualization and Comparative Analysis of Multiple Collinear Genomic Data.多个共线基因组数据的动态可视化与比较分析
ACM BCB. 2011 Aug;2011:335-339. doi: 10.1145/2147805.2147846.
2
The genome architecture of the Collaborative Cross mouse genetic reference population.合作研究杂交(CC)鼠遗传参考群体的基因组结构。
Genetics. 2012 Feb;190(2):389-401. doi: 10.1534/genetics.111.132639.
3
Microsatellites behaving badly: empirical evaluation of genotyping errors and subsequent impacts on population studies.行为不良的微卫星:基因分型错误的实证评估及其对种群研究的后续影响
在缺乏雌激素受体信号的情况下,产后卵巢转分化依赖于遗传背景。
Endocrinology. 2024 Nov 26;166(1). doi: 10.1210/endocr/bqae157.
4
The updated mouse universal genotyping array bioinformatic pipeline improves genetic QC in laboratory mice.更新后的小鼠全基因组基因分型芯片生物信息学分析流程提高了实验小鼠的遗传质量控制。
G3 (Bethesda). 2024 Oct 7;14(10). doi: 10.1093/g3journal/jkae193.
5
A Cost-effective, High-throughput, Highly Accurate Genotyping Method for Outbred Populations.一种适用于远交群体的经济高效、高通量、高精度基因分型方法。
bioRxiv. 2024 Jul 18:2024.07.17.603984. doi: 10.1101/2024.07.17.603984.
6
Symbiotic Variations among Wheat Genotypes and Detection of Quantitative Trait Loci for Molecular Interaction with Auxin-Producing PGPR.小麦基因型间的共生变异及与产生长素植物根际促生细菌分子互作数量性状位点的检测
Microorganisms. 2023 Jun 19;11(6):1615. doi: 10.3390/microorganisms11061615.
7
Genetic modifiers regulating DNA replication and double-strand break repair are associated with differences in mammary tumors in mouse models of Li-Fraumeni syndrome.调控 DNA 复制和双链断裂修复的遗传修饰物与 Li-Fraumeni 综合征小鼠模型中乳腺肿瘤的差异相关。
Oncogene. 2021 Aug;40(31):5026-5037. doi: 10.1038/s41388-021-01892-5. Epub 2021 Jun 28.
8
Content and Performance of the MiniMUGA Genotyping Array: A New Tool To Improve Rigor and Reproducibility in Mouse Research.MiniMUGA 基因分型阵列的内容和性能:提高小鼠研究严谨性和可重复性的新工具。
Genetics. 2020 Dec;216(4):905-930. doi: 10.1534/genetics.120.303596. Epub 2020 Oct 16.
9
An Axiom SNP genotyping array for Douglas-fir.用于花旗松的公理 SNP 基因分型阵列。
BMC Genomics. 2020 Jan 3;21(1):9. doi: 10.1186/s12864-019-6383-9.
10
High throughput genotyping of structural variations in a complex plant genome using an original Affymetrix® axiom® array.利用原始的 Affymetrix® axiom® 阵列对复杂植物基因组中的结构变异进行高通量基因分型。
BMC Genomics. 2019 Nov 13;20(1):848. doi: 10.1186/s12864-019-6136-9.
Genet Mol Res. 2011 Oct 19;10(4):2534-53. doi: 10.4238/2011.October.19.1.
4
Mouse genomic variation and its effect on phenotypes and gene regulation.小鼠基因组变异及其对表型和基因调控的影响。
Nature. 2011 Sep 14;477(7364):289-94. doi: 10.1038/nature10413.
5
Subspecific origin and haplotype diversity in the laboratory mouse.实验室小鼠的亚种起源和单倍型多样性。
Nat Genet. 2011 May 29;43(7):648-55. doi: 10.1038/ng.847.
6
Genetic analysis of complex traits in the emerging Collaborative Cross.新兴的合作杂交群体中复杂性状的遗传分析。
Genome Res. 2011 Aug;21(8):1213-22. doi: 10.1101/gr.111310.110. Epub 2011 Mar 15.
7
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
8
ALCHEMY: a reliable method for automated SNP genotype calling for small batch sizes and highly homozygous populations.ALCHEMY:一种适用于小批次和高度纯合群体的 SNP 基因型自动调用的可靠方法。
Bioinformatics. 2010 Dec 1;26(23):2952-60. doi: 10.1093/bioinformatics/btq533. Epub 2010 Oct 5.
9
Elusive copy number variation in the mouse genome.难以捉摸的小鼠基因组拷贝数变异。
PLoS One. 2010 Sep 21;5(9):e12839. doi: 10.1371/journal.pone.0012839.
10
Integrating common and rare genetic variation in diverse human populations.整合不同人类群体中的常见和罕见遗传变异。
Nature. 2010 Sep 2;467(7311):52-8. doi: 10.1038/nature09298.