• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用混合数据进行个体单倍型推断的新型工具。

A novel tool for individual haplotype inference using mixed data.

作者信息

Lin Chen-Pang, Fann Cathy S J

机构信息

Institute of Public Health, National Yang-Ming University, Taipei, Taiwan.

出版信息

J Biomed Sci. 2009 Jun 2;16(1):52. doi: 10.1186/1423-0127-16-52.

DOI:10.1186/1423-0127-16-52
PMID:19486537
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2711065/
Abstract

BACKGROUND

In many studies, researchers may recruit samples consisting of independent trios and unrelated individuals. However, most of the currently available haplotype inference methods do not cope well with these kinds of mixed data sets.

METHODS

We propose a general and simple methodology using a mixture of weighted multinomial (MIXMUL) approach that combines separate haplotype information from unrelated individuals and independent trios for haplotype inference to the individual level.

RESULTS

The new MIXMUL procedure improves over existing methods in that it can accurately estimate haplotype frequencies from mixed data sets and output probable haplotype pairs in optimized reconstruction outcomes for all subjects that have contributed to estimation. Simulation results showed that this new MIXMUL procedure competes well with the EM-based method, i.e. FAMHAP, under a few assumed scenarios.

CONCLUSION

The results showed that MIXMUL can provide accurate estimates similar to those haplotype frequencies obtained from FAMHAP and output the probable haplotype pairs in the most optimal reconstruction outcome for all subjects that have contributed to estimation. If available data consist of combinations of unrelated individuals and independent trios, the MIXMUL procedure can be used to estimate the haplotype frequencies accurately and output the most likely reconstructed haplotype pairs of each subject in the estimation.

摘要

背景

在许多研究中,研究人员可能会招募由独立三人组和无关个体组成的样本。然而,目前大多数可用的单倍型推断方法并不能很好地处理这类混合数据集。

方法

我们提出了一种通用且简单的方法,即使用加权多项混合(MIXMUL)方法,该方法将来自无关个体和独立三人组的单独单倍型信息结合起来,用于将单倍型推断到个体水平。

结果

新的MIXMUL程序优于现有方法,因为它可以从混合数据集中准确估计单倍型频率,并在为所有参与估计的受试者优化重建结果中输出可能的单倍型对。模拟结果表明,在一些假设情况下,这种新的MIXMUL程序与基于期望最大化(EM)的方法FAMHAP竞争良好。

结论

结果表明,MIXMUL可以提供与从FAMHAP获得的单倍型频率相似的准确估计,并在为所有参与估计的受试者的最优重建结果中输出可能的单倍型对。如果可用数据由无关个体和独立三人组的组合组成,则MIXMUL程序可用于准确估计单倍型频率,并在估计中输出每个受试者最可能重建的单倍型对。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/237d/2711065/37cf77b17bbb/1423-0127-16-52-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/237d/2711065/568749f3da43/1423-0127-16-52-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/237d/2711065/31ea879d9461/1423-0127-16-52-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/237d/2711065/37cf77b17bbb/1423-0127-16-52-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/237d/2711065/568749f3da43/1423-0127-16-52-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/237d/2711065/31ea879d9461/1423-0127-16-52-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/237d/2711065/37cf77b17bbb/1423-0127-16-52-3.jpg

相似文献

1
A novel tool for individual haplotype inference using mixed data.一种使用混合数据进行个体单倍型推断的新型工具。
J Biomed Sci. 2009 Jun 2;16(1):52. doi: 10.1186/1423-0127-16-52.
2
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.HAPLORE:一个用于在无重组的一般家系中进行单倍型重建的程序。
Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1.
3
Maximum-likelihood estimation of haplotype frequencies in nuclear families.核心家庭中单体型频率的最大似然估计。
Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323.
4
Estimate haplotype frequencies in pedigrees.估计系谱中的单倍型频率。
BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S5. doi: 10.1186/1471-2105-7-S4-S5.
5
A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals.针对三联体和无关个体的大型数据集进行基因型填充和单倍型相位推断的统一方法。
Am J Hum Genet. 2009 Feb;84(2):210-23. doi: 10.1016/j.ajhg.2009.01.005. Epub 2009 Feb 5.
6
Simple association analysis combining data from trios/sibships and unrelated controls.结合三联体/同胞对数据和无关对照数据的简单关联分析。
Genet Epidemiol. 2008 Sep;32(6):520-7. doi: 10.1002/gepi.20325.
7
Penalized estimation of haplotype frequencies.单倍型频率的惩罚估计
Bioinformatics. 2008 Jul 15;24(14):1596-602. doi: 10.1093/bioinformatics/btn236. Epub 2008 May 16.
8
Estimating haplotype relative risks on human survival in population-based association studies.
Hum Hered. 2005;59(2):88-97. doi: 10.1159/000085223. Epub 2005 Apr 18.
9
Haplotype association analysis of human disease traits using genotype data of unrelated individuals.利用无关个体的基因型数据对人类疾病性状进行单倍型关联分析。
Genet Res. 2005 Dec;86(3):223-31. doi: 10.1017/S0016672305007792.
10
A comparison of several methods for haplotype frequency estimation and haplotype reconstruction for tightly linked markers from general pedigrees.几种用于估计单倍型频率和重建来自一般家系的紧密连锁标记的单倍型的方法的比较。
Genet Epidemiol. 2006 Jul;30(5):423-37. doi: 10.1002/gepi.20154.

本文引用的文献

1
Simultaneously correcting for population stratification and for genotyping error in case-control association studies.在病例对照关联研究中同时校正群体分层和基因分型错误。
Am J Hum Genet. 2007 Oct;81(4):726-43. doi: 10.1086/520962. Epub 2007 Aug 22.
2
Efficient multilocus association testing for whole genome association studies using localized haplotype clustering.利用局部单倍型聚类进行全基因组关联研究的高效多位点关联测试。
Genet Epidemiol. 2007 Jul;31(5):365-75. doi: 10.1002/gepi.20216.
3
A comparison of several methods for haplotype frequency estimation and haplotype reconstruction for tightly linked markers from general pedigrees.
几种用于估计单倍型频率和重建来自一般家系的紧密连锁标记的单倍型的方法的比较。
Genet Epidemiol. 2006 Jul;30(5):423-37. doi: 10.1002/gepi.20154.
4
Family-based designs in the age of large-scale gene-association studies.大规模基因关联研究时代的基于家系的设计。
Nat Rev Genet. 2006 May;7(5):385-94. doi: 10.1038/nrg1839.
5
Characteristics of replicated single-nucleotide polymorphism genotypes from COGA: Affymetrix and Center for Inherited Disease Research.COGA 中复制的单核苷酸多态性基因型的特征:Affymetrix 和遗传性疾病研究中心。
BMC Genet. 2005 Dec 30;6 Suppl 1(Suppl 1):S154. doi: 10.1186/1471-2156-6-S1-S154.
6
Integrating case-control and TDT studies.整合病例对照研究和传递不平衡检验(TDT)研究。
Ann Hum Genet. 2005 May;69(Pt 3):329-35. doi: 10.1046/j.1529-8817.2005.00156.x.
7
Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation.在单倍型推断和缺失数据插补中考虑连锁不平衡的衰减。
Am J Hum Genet. 2005 Mar;76(3):449-62. doi: 10.1086/428594. Epub 2005 Jan 31.
8
Combining the transmission disequilibrium test and case-control methodology using generalized logistic regression.结合传递不平衡检验和使用广义逻辑回归的病例对照方法。
Eur J Hum Genet. 2004 Nov;12(11):964-70. doi: 10.1038/sj.ejhg.5201255.
9
Maximum-likelihood estimation of haplotype frequencies in nuclear families.核心家庭中单体型频率的最大似然估计。
Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323.
10
Efficiency of haplotype frequency estimation when nuclear family information is included.纳入核心家庭信息时单倍型频率估计的效率。
Hum Hered. 2002;54(1):45-53. doi: 10.1159/000066692.