• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于两阶段互信息的贝叶斯套索算法用于多位点全基因组关联研究

A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies.

作者信息

Guo Hongping, Yu Zuguo, An Jiyuan, Han Guosheng, Ma Yuanlin, Tang Runbin

机构信息

Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan 411105, China.

School of Mathematics and Computer Science, Hanjiang Normal University, Shiyan 442000, China.

出版信息

Entropy (Basel). 2020 Mar 13;22(3):329. doi: 10.3390/e22030329.

DOI:10.3390/e22030329
PMID:33286103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7516787/
Abstract

Genome-wide association study (GWAS) has turned out to be an essential technology for exploring the genetic mechanism of complex traits. To reduce the complexity of computation, it is well accepted to remove unrelated single nucleotide polymorphisms (SNPs) before GWAS, e.g., by using iterative sure independence screening expectation-maximization Bayesian Lasso (ISIS EM-BLASSO) method. In this work, a modified version of ISIS EM-BLASSO is proposed, which reduces the number of SNPs by a screening methodology based on Pearson correlation and mutual information, then estimates the effects via EM-Bayesian Lasso (EM-BLASSO), and finally detects the true quantitative trait nucleotides (QTNs) through likelihood ratio test. We call our method a two-stage mutual information based Bayesian Lasso (MBLASSO). Under three simulation scenarios, MBLASSO improves the statistical power and retains the higher effect estimation accuracy when comparing with three other algorithms. Moreover, MBLASSO performs best on model fitting, the accuracy of detected associations is the highest, and 21 genes can only be detected by MBLASSO in datasets.

摘要

全基因组关联研究(GWAS)已成为探索复杂性状遗传机制的一项重要技术。为降低计算复杂度,在GWAS之前去除不相关的单核苷酸多态性(SNP)已被广泛接受,例如,通过使用迭代确定独立筛选期望最大化贝叶斯套索(ISIS EM-BLASSO)方法。在这项工作中,提出了一种ISIS EM-BLASSO的改进版本,该版本通过基于皮尔逊相关性和互信息的筛选方法减少SNP的数量,然后通过EM-贝叶斯套索(EM-BLASSO)估计效应,最后通过似然比检验检测真正的数量性状核苷酸(QTN)。我们将我们的方法称为基于两阶段互信息的贝叶斯套索(MBLASSO)。在三种模拟场景下,与其他三种算法相比,MBLASSO提高了统计功效并保持了较高的效应估计准确性。此外,MBLASSO在模型拟合方面表现最佳,检测到的关联准确性最高,并且在数据集中只有MBLASSO能检测到21个基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/b78bb1e0dc46/entropy-22-00329-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/e4da63314467/entropy-22-00329-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/c18996d20bad/entropy-22-00329-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/1dbc4a5e2995/entropy-22-00329-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/b78bb1e0dc46/entropy-22-00329-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/e4da63314467/entropy-22-00329-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/c18996d20bad/entropy-22-00329-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/1dbc4a5e2995/entropy-22-00329-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1791/7516787/b78bb1e0dc46/entropy-22-00329-g004.jpg

相似文献

1
A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies.一种基于两阶段互信息的贝叶斯套索算法用于多位点全基因组关联研究
Entropy (Basel). 2020 Mar 13;22(3):329. doi: 10.3390/e22030329.
2
Iterative sure independence screening EM-Bayesian LASSO algorithm for multi-locus genome-wide association studies.用于多位点全基因组关联研究的迭代确定独立筛选EM-贝叶斯套索算法
PLoS Comput Biol. 2017 Jan 31;13(1):e1005357. doi: 10.1371/journal.pcbi.1005357. eCollection 2017 Jan.
3
An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies.一种用于全基因组关联研究的与经验贝叶斯相结合的高效计分检验。
Front Genet. 2021 Oct 1;12:742752. doi: 10.3389/fgene.2021.742752. eCollection 2021.
4
A fast algorithm for Bayesian multi-locus model in genome-wide association studies.全基因组关联研究中贝叶斯多位点模型的快速算法。
Mol Genet Genomics. 2017 Aug;292(4):923-934. doi: 10.1007/s00438-017-1322-4. Epub 2017 May 22.
5
The Application of Multi-Locus GWAS for the Detection of Salt-Tolerance Loci in Rice.多位点全基因组关联研究在水稻耐盐位点检测中的应用
Front Plant Sci. 2018 Oct 4;9:1464. doi: 10.3389/fpls.2018.01464. eCollection 2018.
6
Single-Locus and Multi-Locus Genome-Wide Association Studies for Intramuscular Fat in Duroc Pigs.杜洛克猪肌内脂肪的单基因座和多基因座全基因组关联研究
Front Genet. 2019 Jun 28;10:619. doi: 10.3389/fgene.2019.00619. eCollection 2019.
7
Multi-Locus Genome-Wide Association Study and Genomic Selection of Kernel Moisture Content at the Harvest Stage in Maize.玉米收获期籽粒含水量的多位点全基因组关联研究及基因组选择
Front Plant Sci. 2021 Jul 9;12:697688. doi: 10.3389/fpls.2021.697688. eCollection 2021.
8
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
9
Genome-Wide Association Mapping of Starch Pasting Properties in Maize Using Single-Locus and Multi-Locus Models.利用单基因座和多基因座模型对玉米淀粉糊化特性进行全基因组关联定位
Front Plant Sci. 2018 Sep 5;9:1311. doi: 10.3389/fpls.2018.01311. eCollection 2018.
10
TSLRF: Two-Stage Algorithm Based on Least Angle Regression and Random Forest in genome-wide association studies.TSLRF:基于最小角回归和随机森林的全基因组关联研究两阶段算法。
Sci Rep. 2019 Dec 2;9(1):18034. doi: 10.1038/s41598-019-54519-x.

引用本文的文献

1
Improving genetic variant identification for quantitative traits using ensemble learning-based approaches.使用基于集成学习的方法改进数量性状的遗传变异识别。
BMC Genomics. 2025 Mar 12;26(1):237. doi: 10.1186/s12864-025-11443-x.
2
Explainable artificial intelligence for genotype-to-phenotype prediction in plant breeding: a case study with a dataset from an almond germplasm collection.用于植物育种中基因型到表型预测的可解释人工智能:以杏仁种质资源库数据集为例的研究
Front Plant Sci. 2024 Sep 9;15:1434229. doi: 10.3389/fpls.2024.1434229. eCollection 2024.
3
MIDESP: Mutual Information-Based Detection of Epistatic SNP Pairs for Qualitative and Quantitative Phenotypes.

本文引用的文献

1
AraPheno and the AraGWAS Catalog 2020: a major database update including RNA-Seq and knockout mutation data for Arabidopsis thaliana.AraPheno 和 AraGWAS 目录 2020:一个主要的数据库更新,包括拟南芥的 RNA-Seq 和敲除突变数据。
Nucleic Acids Res. 2020 Jan 8;48(D1):D1063-D1068. doi: 10.1093/nar/gkz925.
2
The Application of Multi-Locus GWAS for the Detection of Salt-Tolerance Loci in Rice.多位点全基因组关联研究在水稻耐盐位点检测中的应用
Front Plant Sci. 2018 Oct 4;9:1464. doi: 10.3389/fpls.2018.01464. eCollection 2018.
3
pKWmEB: integration of Kruskal-Wallis test with empirical Bayes under polygenic background control for multi-locus genome-wide association study.
MIDESP:基于互信息的定性和定量表型上位性SNP对检测
Biology (Basel). 2021 Sep 16;10(9):921. doi: 10.3390/biology10090921.
4
Ensemble Linear Subspace Analysis of High-Dimensional Data.高维数据的集成线性子空间分析
Entropy (Basel). 2021 Mar 9;23(3):324. doi: 10.3390/e23030324.
pKWmEB:多基因背景控制下的 Kruskal-Wallis 检验与经验贝叶斯的整合,用于多基因座全基因组关联研究。
Heredity (Edinb). 2018 Mar;120(3):208-218. doi: 10.1038/s41437-017-0007-4. Epub 2017 Dec 13.
4
Two-stage identification of SNP effects on dynamic poplar growth.基于两阶段方法鉴定 SNP 对杨树生长动态的影响。
Plant J. 2018 Jan;93(2):286-296. doi: 10.1111/tpj.13777. Epub 2017 Dec 28.
5
Variable Selection via Partial Correlation.通过偏相关进行变量选择。
Stat Sin. 2017 Jul;27(3):983-996. doi: 10.5705/ss.202015.0473.
6
pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies.pLARmEB:用于多位点全基因组关联研究的最小角回归与经验贝叶斯方法的整合
Heredity (Edinb). 2017 Jun;118(6):517-524. doi: 10.1038/hdy.2017.8. Epub 2017 Mar 15.
7
Iterative sure independence screening EM-Bayesian LASSO algorithm for multi-locus genome-wide association studies.用于多位点全基因组关联研究的迭代确定独立筛选EM-贝叶斯套索算法
PLoS Comput Biol. 2017 Jan 31;13(1):e1005357. doi: 10.1371/journal.pcbi.1005357. eCollection 2017 Jan.
8
Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology.通过多基因座混合线性模型方法提高全基因组关联研究的效能和准确性。
Sci Rep. 2016 Jan 20;6:19444. doi: 10.1038/srep19444.
9
2HiGWAS: a unifying high-dimensional platform to infer the global genetic architecture of trait development.2HiGWAS:一个用于推断性状发育的全球遗传结构的统一高维平台。
Brief Bioinform. 2015 Nov;16(6):905-11. doi: 10.1093/bib/bbv002. Epub 2015 Feb 19.
10
Challenges of Big Data Analysis.大数据分析的挑战
Natl Sci Rev. 2014 Jun;1(2):293-314. doi: 10.1093/nsr/nwt032.