• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于情绪障碍全基因组关联研究的数据挖掘方法。

Data mining approaches for genome-wide association of mood disorders.

作者信息

Pirooznia Mehdi, Seifuddin Fayaz, Judy Jennifer, Mahon Pamela B, Potash James B, Zandi Peter P

机构信息

School of Medicine, Johns Hopkins University, Baltimore, MD 21205, USA.

出版信息

Psychiatr Genet. 2012 Apr;22(2):55-61. doi: 10.1097/YPG.0b013e32834dc40d.

DOI:10.1097/YPG.0b013e32834dc40d
PMID:22081063
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3306768/
Abstract

BACKGROUND

Mood disorders are highly heritable forms of major mental illness. A major breakthrough in elucidating the genetic architecture of mood disorders was anticipated with the advent of genome-wide association studies (GWAS). However, to date few susceptibility loci have been conclusively identified. The genetic etiology of mood disorders appears to be quite complex, and as a result, alternative approaches for analyzing GWAS data are needed. Recently, a polygenic scoring approach that captures the effects of alleles across multiple loci was successfully applied to the analysis of GWAS data in schizophrenia and bipolar disorder (BP). However, this method may be overly simplistic in its approach to the complexity of genetic effects. Data mining methods are available that may be applied to analyze the high dimensional data generated by GWAS of complex psychiatric disorders.

RESULTS

We sought to compare the performance of five data mining methods, namely, Bayesian networks, support vector machine, random forest, radial basis function network, and logistic regression, against the polygenic scoring approach in the analysis of GWAS data on BP. The different classification methods were trained on GWAS datasets from the Bipolar Genome Study (2191 cases with BP and 1434 controls) and their ability to accurately classify case/control status was tested on a GWAS dataset from the Wellcome Trust Case Control Consortium.

CONCLUSION

The performance of the classifiers in the test dataset was evaluated by comparing area under the receiver operating characteristic curves. Bayesian networks performed the best of all the data mining classifiers, but none of these did significantly better than the polygenic score approach. We further examined a subset of single-nucleotide polymorphisms (SNPs) in genes that are expressed in the brain, under the hypothesis that these might be most relevant to BP susceptibility, but all the classifiers performed worse with this reduced set of SNPs. The discriminative accuracy of all of these methods is unlikely to be of diagnostic or clinical utility at the present time. Further research is needed to develop strategies for selecting sets of SNPs likely to be relevant to disease susceptibility and to determine if other data mining classifiers that utilize other algorithms for inferring relationships among the sets of SNPs may perform better.

摘要

背景

情绪障碍是主要精神疾病中具有高度遗传性的类型。随着全基因组关联研究(GWAS)的出现,人们预期在阐明情绪障碍的遗传结构方面会取得重大突破。然而,迄今为止,仅有少数易感性位点被明确鉴定。情绪障碍的遗传病因似乎相当复杂,因此,需要采用其他方法来分析GWAS数据。最近,一种捕获多个位点等位基因效应的多基因评分方法已成功应用于精神分裂症和双相情感障碍(BP)的GWAS数据分析。然而,该方法在处理遗传效应的复杂性时可能过于简单。现有的数据挖掘方法可用于分析复杂精神疾病GWAS产生的高维数据。

结果

我们试图在分析BP的GWAS数据时,将五种数据挖掘方法(即贝叶斯网络、支持向量机、随机森林、径向基函数网络和逻辑回归)与多基因评分方法的性能进行比较。不同的分类方法在双相基因组研究的GWAS数据集(2191例BP患者和1434例对照)上进行训练,并在威康信托病例对照协会的GWAS数据集上测试其准确分类病例/对照状态的能力。

结论

通过比较受试者工作特征曲线下的面积来评估测试数据集中分类器的性能。贝叶斯网络在所有数据挖掘分类器中表现最佳,但这些分类器均未显著优于多基因评分方法。我们进一步检查了在大脑中表达的基因中的单核苷酸多态性(SNP)子集,假设这些可能与BP易感性最相关,但使用这一减少的SNP集时,所有分类器的表现都更差。目前,所有这些方法的判别准确性不太可能具有诊断或临床实用性。需要进一步研究来制定选择可能与疾病易感性相关的SNP集的策略,并确定是否有其他利用其他算法推断SNP集之间关系的数据挖掘分类器可能表现得更好。

相似文献

1
Data mining approaches for genome-wide association of mood disorders.用于情绪障碍全基因组关联研究的数据挖掘方法。
Psychiatr Genet. 2012 Apr;22(2):55-61. doi: 10.1097/YPG.0b013e32834dc40d.
2
Genome-wide association study reveals greater polygenic loading for schizophrenia in cases with a family history of illness.全基因组关联研究表明,有家族病史的精神分裂症患者具有更高的多基因负荷。
Am J Med Genet B Neuropsychiatr Genet. 2016 Mar;171B(2):276-89. doi: 10.1002/ajmg.b.32402. Epub 2015 Dec 11.
3
Gene-wide analyses of genome-wide association data sets: evidence for multiple common risk alleles for schizophrenia and bipolar disorder and for overlap in genetic risk.全基因组关联数据集的基因层面分析:精神分裂症和双相情感障碍存在多个常见风险等位基因及遗传风险重叠的证据。
Mol Psychiatry. 2009 Mar;14(3):252-60. doi: 10.1038/mp.2008.133. Epub 2008 Dec 9.
4
Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis.五种主要精神疾病具有共同影响的风险基因座的鉴定:全基因组分析。
Lancet. 2013 Apr 20;381(9875):1371-1379. doi: 10.1016/S0140-6736(12)62129-1. Epub 2013 Feb 28.
5
Genetic relationships between suicide attempts, suicidal ideation and major psychiatric disorders: a genome-wide association and polygenic scoring study.自杀未遂、自杀意念与主要精神障碍之间的遗传关系:全基因组关联和多基因评分研究。
Am J Med Genet B Neuropsychiatr Genet. 2014 Jul;165B(5):428-37. doi: 10.1002/ajmg.b.32247. Epub 2014 Jun 25.
6
Identifying novel chemical-related susceptibility genes for five psychiatric disorders through integrating genome-wide association study and tissue-specific 3'aQTL annotation datasets.通过整合全基因组关联研究和组织特异性3'aQTL注释数据集,鉴定五种精神疾病与化学物质相关的新型易感基因。
Eur Arch Psychiatry Clin Neurosci. 2025 Apr;275(3):851-862. doi: 10.1007/s00406-023-01753-0. Epub 2024 Feb 2.
7
Molecular genetic overlap in bipolar disorder, schizophrenia, and major depressive disorder.双相情感障碍、精神分裂症和重度抑郁症中的分子遗传学重叠。
World J Biol Psychiatry. 2014 Apr;15(3):200-8. doi: 10.3109/15622975.2012.662282. Epub 2012 Mar 9.
8
Common variants at 2q11.2, 8q21.3, and 11q13.2 are associated with major mood disorders.常见变异位于 2q11.2、8q21.3 和 11q13.2 与主要心境障碍有关。
Transl Psychiatry. 2017 Dec 11;7(12):1273. doi: 10.1038/s41398-017-0019-0.
9
GWAS of Suicide Attempt in Psychiatric Disorders and Association With Major Depression Polygenic Risk Scores.精神障碍自杀未遂的全基因组关联研究及与重度抑郁症多基因风险评分的关联
Am J Psychiatry. 2019 Aug 1;176(8):651-660. doi: 10.1176/appi.ajp.2019.18080957. Epub 2019 Jun 5.
10
Genome-wide Association for Major Depression Through Age at Onset Stratification: Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium.通过发病年龄分层进行的全基因组关联研究:精神基因组学联盟重度抑郁症工作组
Biol Psychiatry. 2017 Feb 15;81(4):325-335. doi: 10.1016/j.biopsych.2016.05.010. Epub 2016 May 24.

引用本文的文献

1
A Systematic Review of Genetics- and Molecular-Pathway-Based Machine Learning Models for Neurological Disorder Diagnosis.基于遗传学和分子途径的机器学习模型在神经障碍诊断中的系统评价
Int J Mol Sci. 2024 Jun 11;25(12):6422. doi: 10.3390/ijms25126422.
2
Deep Learning Framework for Complex Disease Risk Prediction Using Genomic Variations.深度学习框架用于使用基因组变异预测复杂疾病风险。
Sensors (Basel). 2023 May 1;23(9):4439. doi: 10.3390/s23094439.
3
Machine learning algorithm performance evaluation in structural magnetic resonance imaging-based classification of pediatric bipolar disorders type I patients.

本文引用的文献

1
Biomedical text mining and its applications.生物医学文本挖掘及其应用。
PLoS Comput Biol. 2009 Dec;5(12):e1000597. doi: 10.1371/journal.pcbi.1000597. Epub 2009 Dec 24.
2
Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies.同时进行基因型调用和单倍型相位分析可提高全基因组关联研究的基因型准确性,并减少假阳性关联。
Am J Hum Genet. 2009 Dec;85(6):847-61. doi: 10.1016/j.ajhg.2009.11.004.
3
A comparison of classification methods for predicting Chronic Fatigue Syndrome based on genetic data.
基于结构磁共振成像的I型小儿双相情感障碍患者分类中机器学习算法性能评估
Front Comput Neurosci. 2022 Aug 23;16:915477. doi: 10.3389/fncom.2022.915477. eCollection 2022.
4
Prediction of Mental Health in Medical Workers During COVID-19 Based on Machine Learning.基于机器学习的新冠疫情期间医护人员心理健康预测
Front Public Health. 2021 Sep 7;9:697850. doi: 10.3389/fpubh.2021.697850. eCollection 2021.
5
Machine learning for genetic prediction of psychiatric disorders: a systematic review.机器学习在精神障碍遗传预测中的应用:系统综述。
Mol Psychiatry. 2021 Jan;26(1):70-79. doi: 10.1038/s41380-020-0825-2. Epub 2020 Jun 26.
6
An evaluation of machine-learning for predicting phenotype: studies in yeast, rice, and wheat.用于预测表型的机器学习评估:酵母、水稻和小麦的研究
Mach Learn. 2020;109(2):251-277. doi: 10.1007/s10994-019-05848-5. Epub 2019 Oct 23.
7
Data Mining Algorithms and Techniques in Mental Health: A Systematic Review.数据挖掘算法和技术在精神健康中的应用:系统评价。
J Med Syst. 2018 Jul 21;42(9):161. doi: 10.1007/s10916-018-1018-2.
8
Evaluation of potential novel variations and their interactions related to bipolar disorders: analysis of genome-wide association study data.双相情感障碍相关潜在新变异及其相互作用的评估:全基因组关联研究数据分析
Neuropsychiatr Dis Treat. 2016 Nov 24;12:2997-3004. doi: 10.2147/NDT.S112558. eCollection 2016.
9
Influence of Feature Encoding and Choice of Classifier on Disease Risk Prediction in Genome-Wide Association Studies.全基因组关联研究中特征编码和分类器选择对疾病风险预测的影响
PLoS One. 2015 Aug 18;10(8):e0135832. doi: 10.1371/journal.pone.0135832. eCollection 2015.
10
The genetic interacting landscape of 63 candidate genes in Major Depressive Disorder: an explorative study.63 个候选基因在重度抑郁症中的遗传相互作用景观:一项探索性研究。
BioData Min. 2014 Sep 9;7:19. doi: 10.1186/1756-0381-7-19. eCollection 2014.
基于基因数据预测慢性疲劳综合征的分类方法比较。
J Transl Med. 2009 Sep 22;7:81. doi: 10.1186/1479-5876-7-81.
4
Genome-wide linkage and follow-up association study of postpartum mood symptoms.产后情绪症状的全基因组连锁与后续关联研究。
Am J Psychiatry. 2009 Nov;166(11):1229-37. doi: 10.1176/appi.ajp.2009.09030417. Epub 2009 Sep 15.
5
Assessing the druggability of protein-protein interactions by a supervised machine-learning method.运用有监督机器学习方法评估蛋白-蛋白相互作用的成药性。
BMC Bioinformatics. 2009 Aug 25;10:263. doi: 10.1186/1471-2105-10-263.
6
Common polygenic variation contributes to risk of schizophrenia and bipolar disorder.常见的多基因变异会增加患精神分裂症和双相情感障碍的风险。
Nature. 2009 Aug 6;460(7256):748-52. doi: 10.1038/nature08185. Epub 2009 Jul 1.
7
Genome-wide association study of bipolar disorder in European American and African American individuals.欧美裔和非裔美国人双相情感障碍的全基因组关联研究。
Mol Psychiatry. 2009 Aug;14(8):755-63. doi: 10.1038/mp.2009.43. Epub 2009 Jun 2.
8
Functional and evolutionary insights into human brain development through global transcriptome analysis.通过全球转录组分析对人类大脑发育的功能和进化洞察
Neuron. 2009 May 28;62(4):494-509. doi: 10.1016/j.neuron.2009.03.027.
9
A hybrid approach for biomarker discovery from microarray gene expression data for cancer classification.一种用于从微阵列基因表达数据中发现生物标志物以进行癌症分类的混合方法。
Cancer Inform. 2007 Feb 22;2:301-11.
10
Advances and pitfalls in protein structure prediction.蛋白质结构预测的进展与陷阱
Curr Protein Pept Sci. 2008 Dec;9(6):567-77. doi: 10.2174/138920308786733958.