• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于熵的核密度估计识别基因与数量性状之间的关联。

Identification of the associations between genes and quantitative traits using entropy-based kernel density estimation.

作者信息

Yee Jaeyong, Park Taesung, Park Mira

机构信息

Department of Physiology and Biophysics, Eulji University, Daejeon 34824, Korea.

Department of Statistics, Seoul National University, Seoul 08826, Korea.

出版信息

Genomics Inform. 2022 Jun;20(2):e17. doi: 10.5808/gi.22033. Epub 2022 Jun 30.

DOI:10.5808/gi.22033
PMID:35794697
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9299569/
Abstract

Genetic associations have been quantified using a number of statistical measures. Entropy-based mutual information may be one of the more direct ways of estimating the association, in the sense that it does not depend on the parametrization. For this purpose, both the entropy and conditional entropy of the phenotype distribution should be obtained. Quantitative traits, however, do not usually allow an exact evaluation of entropy. The estimation of entropy needs a probability density function, which can be approximated by kernel density estimation. We have investigated the proper sequence of procedures for combining the kernel density estimation and entropy estimation with a probability density function in order to calculate mutual information. Genotypes and their interactions were constructed to set the conditions for conditional entropy. Extensive simulation data created using three types of generating functions were analyzed using two different kernels as well as two types of multifactor dimensionality reduction and another probability density approximation method called m-spacing. The statistical power in terms of correct detection rates was compared. Using kernels was found to be most useful when the trait distributions were more complex than simple normal or gamma distributions. A full-scale genomic dataset was explored to identify associations using the 2-h oral glucose tolerance test results and γ-glutamyl transpeptidase levels as phenotypes. Clearly distinguishable single-nucleotide polymorphisms (SNPs) and interacting SNP pairs associated with these phenotypes were found and listed with empirical p-values.

摘要

已经使用多种统计方法对基因关联进行了量化。基于熵的互信息可能是估计关联的更直接方法之一,因为它不依赖于参数化。为此,应该获得表型分布的熵和条件熵。然而,数量性状通常不允许对熵进行精确评估。熵的估计需要一个概率密度函数,它可以通过核密度估计来近似。我们研究了将核密度估计和熵估计与概率密度函数相结合以计算互信息的正确程序顺序。构建基因型及其相互作用以设定条件熵的条件。使用两种不同的核以及两种类型的多因素降维方法和另一种称为m间距的概率密度近似方法,对使用三种类型生成函数创建的大量模拟数据进行了分析。比较了正确检测率方面的统计功效。当性状分布比简单的正态或伽马分布更复杂时,发现使用核最为有用。使用2小时口服葡萄糖耐量试验结果和γ-谷氨酰转肽酶水平作为表型,探索了一个完整规模的基因组数据集以识别关联。发现了与这些表型相关的明显可区分的单核苷酸多态性(SNP)和相互作用的SNP对,并列出了经验p值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/4741aa307656/gi-22033f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/d606d6a93737/gi-22033f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/1a3952be2a07/gi-22033f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/6c7e1e7d44dd/gi-22033f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/2d696e72a851/gi-22033f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/4741aa307656/gi-22033f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/d606d6a93737/gi-22033f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/1a3952be2a07/gi-22033f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/6c7e1e7d44dd/gi-22033f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/2d696e72a851/gi-22033f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eea4/9299569/4741aa307656/gi-22033f5.jpg

相似文献

1
Identification of the associations between genes and quantitative traits using entropy-based kernel density estimation.使用基于熵的核密度估计识别基因与数量性状之间的关联。
Genomics Inform. 2022 Jun;20(2):e17. doi: 10.5808/gi.22033. Epub 2022 Jun 30.
2
Detecting Genetic Interactions for Quantitative Traits Using m-Spacing Entropy Measure.使用m间距熵测度检测数量性状的基因相互作用。
Biomed Res Int. 2015;2015:523641. doi: 10.1155/2015/523641. Epub 2015 Aug 3.
3
A modified entropy-based approach for identifying gene-gene interactions in case-control study.基于改进的熵方法的病例对照研究中基因-基因交互作用的识别。
PLoS One. 2013 Jul 18;8(7):e69321. doi: 10.1371/journal.pone.0069321. Print 2013.
4
Lower-order effects adjustment in quantitative traits model-based multifactor dimensionality reduction.基于数量性状模型的多因子降维的低阶效应调整。
PLoS One. 2012;7(1):e29594. doi: 10.1371/journal.pone.0029594. Epub 2012 Jan 5.
5
Nonparametric entropy estimation using kernel densities.使用核密度的非参数熵估计
Methods Enzymol. 2009;467:531-546. doi: 10.1016/S0076-6879(09)67020-8.
6
Learning Gaussian mixture models with entropy-based criteria.使用基于熵的准则学习高斯混合模型。
IEEE Trans Neural Netw. 2009 Nov;20(11):1756-71. doi: 10.1109/TNN.2009.2030190. Epub 2009 Sep 18.
7
Prioritizing individual genetic variants after kernel machine testing using variable selection.在使用变量选择的核机器测试后对个体遗传变异进行优先级排序。
Genet Epidemiol. 2016 Dec;40(8):722-731. doi: 10.1002/gepi.21993. Epub 2016 Aug 3.
8
A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype.基于统一模型的多因素降维方法识别与生存表型相关的基因-基因相互作用的比较研究。
BioData Min. 2021 Mar 1;14(1):17. doi: 10.1186/s13040-021-00248-9.
9
A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction.一种应用于检测基因-环境相互作用的快速多核方法。
Genet Epidemiol. 2015 Sep;39(6):456-68. doi: 10.1002/gepi.21909. Epub 2015 Jul 3.
10
Extraction of configurational entropy from molecular simulations via an expansion approximation.通过展开近似从分子模拟中提取构型熵
J Chem Phys. 2007 Jul 14;127(2):024107. doi: 10.1063/1.2746329.

本文引用的文献

1
A Genome-Wide Association Study of a Korean Population Identifies Genetic Susceptibility to Hypertension Based on Sex-Specific Differences.一项基于韩国人群的全基因组关联研究,鉴定了性别特异性差异与高血压遗传易感性的关系。
Genes (Basel). 2021 Nov 16;12(11):1804. doi: 10.3390/genes12111804.
2
Information Theory in Computational Biology: Where We Stand Today.计算生物学中的信息论:我们如今的现状
Entropy (Basel). 2020 Jun 6;22(6):627. doi: 10.3390/e22060627.
3
Benefits and limitations of genome-wide association studies.全基因组关联研究的优势和局限性。
Nat Rev Genet. 2019 Aug;20(8):467-484. doi: 10.1038/s41576-019-0127-1.
4
Effects of diabetic ketoacidosis in the respiratory system.糖尿病酮症酸中毒对呼吸系统的影响。
World J Diabetes. 2019 Jan 15;10(1):16-22. doi: 10.4239/wjd.v10.i1.16.
5
Genome-wide association study of high-altitude pulmonary edema in a Han Chinese population.汉族人群高原肺水肿的全基因组关联研究
Oncotarget. 2017 May 9;8(19):31568-31580. doi: 10.18632/oncotarget.16362.
6
Serum gamma-glutamyl transferase and risk of type 2 diabetes in the general Korean population: a Mendelian randomization study.韩国普通人群中血清γ-谷氨酰转移酶与2型糖尿病风险:一项孟德尔随机化研究
Hum Mol Genet. 2016 Sep 1;25(17):3877-3886. doi: 10.1093/hmg/ddw226. Epub 2016 Jul 27.
7
No Interaction with Alcohol Consumption, but Independent Effect of C12orf51 (HECTD4) on Type 2 Diabetes Mellitus in Korean Adults Aged 40-69 Years: The KoGES_Ansan and Ansung Study.与饮酒无相互作用,但C12orf51(HECTD4)对40 - 69岁韩国成年人2型糖尿病有独立影响:韩国基因组与流行病学研究(KoGES)之安山和安城研究
PLoS One. 2016 Feb 18;11(2):e0149321. doi: 10.1371/journal.pone.0149321. eCollection 2016.
8
Detecting Genetic Interactions for Quantitative Traits Using m-Spacing Entropy Measure.使用m间距熵测度检测数量性状的基因相互作用。
Biomed Res Int. 2015;2015:523641. doi: 10.1155/2015/523641. Epub 2015 Aug 3.
9
Mutual information between discrete and continuous data sets.离散数据集与连续数据集之间的互信息。
PLoS One. 2014 Feb 19;9(2):e87357. doi: 10.1371/journal.pone.0087357. eCollection 2014.
10
A modified entropy-based approach for identifying gene-gene interactions in case-control study.基于改进的熵方法的病例对照研究中基因-基因交互作用的识别。
PLoS One. 2013 Jul 18;8(7):e69321. doi: 10.1371/journal.pone.0069321. Print 2013.