• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工神经网络分析与其他多标记物方法在检测基因关联方面的比较。

Comparison of artificial neural network analysis with other multimarker methods for detecting genetic association.

作者信息

Curtis David

机构信息

Academic Centre for Psychiatry, St Bartholomew's and Royal London School of Medicine and Dentistry, Royal London Hospital, Whitechapel, London, UK.

出版信息

BMC Genet. 2007 Jul 18;8:49. doi: 10.1186/1471-2156-8-49.

DOI:10.1186/1471-2156-8-49
PMID:17640352
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1940019/
Abstract

BACKGROUND

Debate remains as to the optimal method for utilising genotype data obtained from multiple markers in case-control association studies. I and colleagues have previously described a method of association analysis using artificial neural networks (ANNs), whose performance compared favourably to single-marker methods. Here, the performance of ANN analysis is compared with other multi-marker methods, comprising different haplotype-based analyses and locus-based analyses.

RESULTS

Of several methods studied and applied to simulated SNP datasets, heterogeneity testing of estimated haplotype frequencies using asymptotic p values rather than permutation testing had the lowest power of the methods studied and ANN analysis had the highest power. The difference in power to detect association between these two methods was statistically significant (p = 0.001) but other comparisons between methods were not significant. The raw t statistic obtained from ANN analysis correlated highly with the empirical statistical significance obtained from permutation testing of the ANN results and with the p value obtained from the heterogeneity test.

CONCLUSION

Although ANN analysis was more powerful than the standard haplotype-based test it is unlikely to be taken up widely. The permutation testing necessary to obtain a valid p value makes it slow to perform and it is not underpinned by a theoretical model relating marker genotypes to disease phenotype. Nevertheless, the superior performance of this method does imply that the widely-used haplotype-based methods for detecting association with multiple markers are not optimal and efforts could be made to improve upon them. The fact that the t statistic obtained from ANN analysis is highly correlated with the statistical significance does suggest a possibility to use ANN analysis in situations where large numbers of markers have been genotyped, since the t value could be used as a proxy for the p value in preliminary analyses.

摘要

背景

在病例对照关联研究中,关于利用从多个标记获得的基因型数据的最佳方法仍存在争议。我和同事之前描述了一种使用人工神经网络(ANN)进行关联分析的方法,其性能优于单标记方法。在此,将ANN分析的性能与其他多标记方法进行比较,这些方法包括不同的基于单倍型的分析和基于位点的分析。

结果

在研究并应用于模拟SNP数据集的几种方法中,使用渐近p值而非置换检验对估计的单倍型频率进行异质性检验在所研究的方法中功效最低,而ANN分析功效最高。这两种方法在检测关联的功效上的差异具有统计学意义(p = 0.001),但方法之间的其他比较无显著差异。从ANN分析获得的原始t统计量与对ANN结果进行置换检验获得的经验统计显著性以及与从异质性检验获得的p值高度相关。

结论

尽管ANN分析比基于单倍型的标准检验更具功效,但它不太可能被广泛采用。获得有效p值所需的置换检验使其执行速度缓慢,并且它没有基于将标记基因型与疾病表型相关联的理论模型。然而,该方法的优越性能确实意味着广泛使用的基于单倍型的多标记关联检测方法并非最优,可以努力对其进行改进。从ANN分析获得的t统计量与统计显著性高度相关这一事实确实表明,在对大量标记进行基因分型的情况下有可能使用ANN分析,因为在初步分析中t值可以用作p值的替代。

相似文献

1
Comparison of artificial neural network analysis with other multimarker methods for detecting genetic association.人工神经网络分析与其他多标记物方法在检测基因关联方面的比较。
BMC Genet. 2007 Jul 18;8:49. doi: 10.1186/1471-2156-8-49.
2
Multiple testing in the context of haplotype analysis revisited: application to case-control data.单倍型分析中的多重检验再探讨:在病例对照数据中的应用
Ann Hum Genet. 2005 Nov;69(Pt 6):747-56. doi: 10.1111/j.1529-8817.2005.00198.x.
3
Efficiency and power in genetic association studies.基因关联研究中的效率与效能
Nat Genet. 2005 Nov;37(11):1217-23. doi: 10.1038/ng1669. Epub 2005 Oct 23.
4
Detailed analysis of the relative power of direct and indirect association studies and the implications for their interpretation.直接关联研究与间接关联研究相对效力的详细分析及其解读的意义。
Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27.
5
Genetic association mapping under founder heterogeneity via weighted haplotype similarity analysis in candidate genes.通过对候选基因进行加权单倍型相似性分析,在奠基者异质性情况下进行基因关联定位。
Genet Epidemiol. 2004 Nov;27(3):182-91. doi: 10.1002/gepi.20022.
6
On the advantage of haplotype analysis in the presence of multiple disease susceptibility alleles.单倍型分析在存在多个疾病易感性等位基因情况下的优势
Genet Epidemiol. 2002 Oct;23(3):221-33. doi: 10.1002/gepi.10200.
7
High resolution T association tests of complex diseases based on family data.基于家系数据的复杂疾病高分辨率全基因组关联测试
Ann Hum Genet. 2005 Mar;69(Pt 2):187-208. doi: 10.1046/j.1529-8817.2004.00151.x.
8
Quantifying the amount of missing information in genetic association studies.量化基因关联研究中缺失信息的数量。
Genet Epidemiol. 2006 Dec;30(8):703-17. doi: 10.1002/gepi.20181.
9
A new multimarker test for family-based association studies.一种用于基于家系的关联研究的新型多标记检测方法。
Genet Epidemiol. 2007 Jan;31(1):9-17. doi: 10.1002/gepi.20186.
10
Fine mapping functional sites or regions from case-control data using haplotypes of multiple linked SNPs.利用多个连锁单核苷酸多态性(SNP)的单倍型,从病例对照数据中精细定位功能位点或区域。
Ann Hum Genet. 2005 Jan;69(Pt 1):102-12. doi: 10.1046/j.1529-8817.2004.00140.x.

引用本文的文献

1
Genomic prediction of carcass traits using different haplotype block partitioning methods in beef cattle.利用不同单倍型块划分方法对肉牛胴体性状进行基因组预测。
Evol Appl. 2022 Nov 14;15(12):2028-2042. doi: 10.1111/eva.13491. eCollection 2022 Dec.
2
What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics.什么造就了良好的预测?特征重要性以及开启遗传学中机器学习的黑箱。
Hum Genet. 2022 Sep;141(9):1515-1528. doi: 10.1007/s00439-021-02402-z. Epub 2021 Dec 4.
3
Incorporating Genome Annotation Into Genomic Prediction for Carcass Traits in Chinese Simmental Beef Cattle.将基因组注释纳入中国西门塔尔牛胴体性状的基因组预测
Front Genet. 2020 May 15;11:481. doi: 10.3389/fgene.2020.00481. eCollection 2020.
4
Genomic prediction of genetic merit using LD-based haplotypes in the Nordic Holstein population.在北欧荷斯坦牛群体中使用基于连锁不平衡的单倍型对遗传价值进行基因组预测。
BMC Genomics. 2014 Dec 23;15(1):1171. doi: 10.1186/1471-2164-15-1171.
5
Application of the back-error propagation artificial neural network (BPANN) on genetic variants in the PPAR-γ and RXR-α gene and risk of metabolic syndrome in a Chinese Han population.反向传播人工神经网络(BPANN)在中国汉族人群中PPAR-γ和RXR-α基因遗传变异与代谢综合征风险的应用。
J Biomed Res. 2014 Mar;28(2):114-22. doi: 10.7555/JBR.27.20120061. Epub 2013 Mar 20.
6
A review for detecting gene-gene interactions using machine learning methods in genetic epidemiology.基于机器学习方法在遗传流行病学中检测基因-基因相互作用的研究综述。
Biomed Res Int. 2013;2013:432375. doi: 10.1155/2013/432375. Epub 2013 Oct 21.
7
Genetic classification of populations using supervised learning.基于监督学习的人群遗传分类。
PLoS One. 2011 May 12;6(5):e14802. doi: 10.1371/journal.pone.0014802.
8
Neural networks for genetic epidemiology: past, present, and future.神经网络在遗传流行病学中的应用:过去、现在和未来。
BioData Min. 2008 Jul 17;1(1):3. doi: 10.1186/1756-0381-1-3.
9
Investigation into the ability of SNP chipsets and microsatellites to detect association with a disease locus.关于单核苷酸多态性(SNP)芯片组和微卫星检测与疾病位点关联能力的研究。
Ann Hum Genet. 2008 Jul;72(Pt 4):547-56. doi: 10.1111/j.1469-1809.2008.00434.x. Epub 2008 Mar 18.

本文引用的文献

1
Investigation of the ability of haplotype association and logistic regression to identify associated susceptibility loci.单倍型关联分析和逻辑回归识别相关易感基因座能力的研究。
Ann Hum Genet. 2006 Nov;70(Pt 6):893-906. doi: 10.1111/j.1469-1809.2006.00301.x.
2
Program report: GENECOUNTING support programs.程序报告:基因计数支持程序。
Ann Hum Genet. 2006 Mar;70(Pt 2):277-9. doi: 10.1111/j.1529-8817.2005.00225.x.
3
The ENCODE (ENCyclopedia Of DNA Elements) Project.DNA 元件百科全书(ENCODE)计划
Science. 2004 Oct 22;306(5696):636-40. doi: 10.1126/science.1105136.
4
The International HapMap Project.国际人类基因组单体型图计划
Nature. 2003 Dec 18;426(6968):789-96. doi: 10.1038/nature02168.
5
Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power.利用单倍型标签检测由连锁不平衡引起的疾病关联:一类检验方法及统计效能的决定因素。
Hum Hered. 2003;56(1-3):18-31. doi: 10.1159/000073729.
6
Pedigree disequilibrium tests for multilocus haplotypes.多位点单倍型的系谱不平衡检验
Genet Epidemiol. 2003 Sep;25(2):115-21. doi: 10.1002/gepi.10252.
7
Assessing optimal neural network architecture for identifying disease-associated multi-marker genotypes using a permutation test, and application to calpain 10 polymorphisms associated with diabetes.使用置换检验评估用于识别疾病相关多标记基因型的最佳神经网络架构,并应用于与糖尿病相关的钙蛋白酶10多态性。
Ann Hum Genet. 2003 Jul;67(Pt 4):348-56. doi: 10.1046/j.1469-1809.2003.00030.x.
8
A note on calculation of empirical P values from Monte Carlo procedure.关于从蒙特卡罗方法计算经验P值的说明。
Am J Hum Genet. 2003 Feb;72(2):498-9. doi: 10.1086/346173.
9
GENECOUNTING: haplotype analysis with missing genotypes.基因计数:对缺失基因型进行单倍型分析。
Bioinformatics. 2002 Dec;18(12):1694-5. doi: 10.1093/bioinformatics/18.12.1694.
10
A note on the calculation of empirical P values from Monte Carlo procedures.关于从蒙特卡罗方法计算经验P值的说明。
Am J Hum Genet. 2002 Aug;71(2):439-41. doi: 10.1086/341527.