Suppr超能文献

通过直接数据挖掘进行基于单倍型的连锁不平衡图谱分析。

Haplotype-based linkage disequilibrium mapping via direct data mining.

作者信息

Li Jing, Jiang Tao

机构信息

Electrical Engineering and Computer Science Department, Case Western Reserve University, Cleveland, OH 44106, USA.

出版信息

Bioinformatics. 2005 Dec 15;21(24):4384-93. doi: 10.1093/bioinformatics/bti732. Epub 2005 Oct 25.

Abstract

MOTIVATION

With the availability of large-scale, high-density single-nucleotide polymorphism markers and information on haplotype structures and frequencies, a great challenge is how to take advantage of haplotype information in the association mapping of complex diseases in case-control studies.

RESULTS

We present a novel approach for association mapping based on directly mining haplotypes (i.e. phased genotype pairs) produced from case-control data or case-parent data via a density-based clustering algorithm, which can be applied to whole-genome screens as well as candidate-gene studies in small genomic regions. The method directly explores the sharing of haplotype segments in affected individuals that are rarely present in normal individuals. The measure of sharing between two haplotypes is defined by a new similarity metric that combines the length of the shared segments and the number of common alleles around any marker position of the haplotypes, which is robust against recent mutations/genotype errors and recombination events. The effectiveness of the approach is demonstrated by using both simulated datasets and real datasets. The results show that the algorithm is accurate for different population models and for different disease models, even for genes with small effects, and it outperforms some recently developed methods.

摘要

动机

随着大规模、高密度单核苷酸多态性标记的可得性以及单倍型结构和频率信息的出现,一个巨大的挑战是如何在病例对照研究中复杂疾病的关联图谱绘制中利用单倍型信息。

结果

我们提出了一种基于直接挖掘通过基于密度的聚类算法从病例对照数据或病例-亲本数据产生的单倍型(即定相基因型对)进行关联图谱绘制的新方法,该方法可应用于全基因组筛选以及小基因组区域的候选基因研究。该方法直接探索患病个体中正常个体很少出现的单倍型片段的共享情况。两种单倍型之间的共享度量由一种新的相似性度量定义,该度量结合了共享片段的长度以及单倍型任何标记位置周围的共同等位基因数量,对近期突变/基因型错误和重组事件具有鲁棒性。通过使用模拟数据集和真实数据集证明了该方法的有效性。结果表明,该算法对于不同的群体模型和不同的疾病模型都是准确的,即使对于效应较小的基因也是如此,并且它优于一些最近开发的方法。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验