Suppr超能文献

根据成对连锁不平衡估计,对未分配的单核苷酸多态性/支架进行染色体位置分配。

Assignment of chromosomal locations for unassigned SNPs/scaffolds based on pair-wise linkage disequilibrium estimates.

机构信息

Reprogen-Animal Bioscience, Faculty of Veterinary Science, University of Sydney, Camden NSW 2570, Australia.

出版信息

BMC Bioinformatics. 2010 Apr 7;11:171. doi: 10.1186/1471-2105-11-171.

Abstract

BACKGROUND

Recent developments of high-density SNP chips across a number of species require accurate genetic maps. Despite rapid advances in genome sequence assembly and availability of a number of tools for creating genetic maps, the exact genome location for a number of SNPs from these SNP chips still remains unknown. We have developed a locus ordering procedure based on linkage disequilibrium (LODE) which provides estimation of the chromosomal positions of unaligned SNPs and scaffolds. It also provides an alternative means for verification of genetic maps. We exemplified LODE in cattle.

RESULTS

The utility of the LODE procedure was demonstrated using data from 1,943 bulls genotyped for 73,569 SNPs across three different SNP chips. First, the utility of the procedure was tested by analysing the masked positions of 1,500 randomly-chosen SNPs with known locations (50 from each chromosome), representing three classes of minor allele frequencies (MAF), namely >0.05, 0.01<MAF < or = 0.05 and 0.001<MAF < or = 0.01. The efficiency (percentage of masked SNPs that could be assigned a location) was 96.7%, 30.6% and 2.0%; with an accuracy (the percentage of SNPs assigned correctly) of 99.9%, 98.9% and 33.3% in the three classes of MAF, respectively. The average precision for placement of the SNPs was 914, 3,137 and 6,853 kb, respectively. Secondly, 4,688 of 5,314 SNPs unpositioned in the Btau4.0 assembly were positioned using the LODE procedure. Based on these results, the positions of 485 unordered scaffolds were determined. The procedure was also used to validate the genome positions of 53,068 SNPs placed on Btau4.0 bovine assembly, resulting in identification of problem areas in the assembly. Finally, the accuracy of the LODE procedure was independently validated by comparative mapping on the hg18 human assembly.

CONCLUSION

The LODE procedure described in this study is an efficient and accurate method for positioning SNPs (MAF>0.05), for validating and checking the quality of a genome assembly, and offers a means for positioning of unordered scaffolds containing SNPs. The LODE procedure will be helpful in refining genome sequence assemblies, especially those being created from next-generation sequencing where high-throughput SNP discovery and genotyping platforms are integrated components of genome analysis.

摘要

背景

在多个物种中,高密度 SNP 芯片的最新发展需要准确的遗传图谱。尽管基因组序列组装的快速发展和许多用于创建遗传图谱的工具的可用性,这些 SNP 芯片的许多 SNP 的确切基因组位置仍然未知。我们开发了一种基于连锁不平衡(LODE)的基因座排序程序,该程序提供了未对齐 SNP 和支架的染色体位置估计。它还为遗传图谱的验证提供了一种替代方法。我们以牛为例说明了 LODE。

结果

使用来自三个不同 SNP 芯片的 1943 头公牛的 73569 个 SNP 的数据测试了 LODE 程序的实用性。首先,通过分析已知位置的 1500 个随机选择的 SNP(每个染色体 50 个)的掩蔽位置来测试程序的实用性,这些 SNP 代表三个类别的小等位基因频率(MAF),即>0.05、0.01<MAF<=0.05 和 0.001<MAF<=0.01。效率(可以分配位置的掩蔽 SNP 的百分比)分别为 96.7%、30.6%和 2.0%;在三个 MAF 类别中,准确性(正确分配的 SNP 的百分比)分别为 99.9%、98.9%和 33.3%。SNP 放置的平均精度分别为 914、3137 和 6853 kb。其次,使用 LODE 程序对 Btau4.0 组装中未定位的 4688 个 SNP 中的 485 个未排序支架进行了定位。基于这些结果,确定了 485 个未排序支架的位置。该程序还用于验证放置在 Btau4.0 牛组装上的 53068 个 SNP 的基因组位置,从而确定了组装中的问题区域。最后,通过与 hg18 人类组装的比较映射独立验证了 LODE 程序的准确性。

结论

本研究中描述的 LODE 程序是一种高效准确的 SNP 定位方法(MAF>0.05),用于验证和检查基因组组装的质量,并提供了一种定位包含 SNP 的未排序支架的方法。LODE 程序将有助于改进基因组序列组装,特别是那些正在从下一代测序中创建的组装,其中高通量 SNP 发现和基因分型平台是基因组分析的集成组件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2282/2859757/329362d7cfe4/1471-2105-11-171-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验