• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于U统计量的随机森林方法用于基因关联研究。

A U-Statistic-based random Forest approach for genetic association study.

作者信息

Li Ming, Peng Ruo-Sin, Wei Changshuai, Lu Qing

机构信息

Department of Epidemiology, Michigan State University, East Lansing, MI 48824, USA.

出版信息

Front Biosci (Elite Ed). 2012 Jun 1;4(7):2607-2617. doi: 10.2741/e576.

DOI:10.2741/e576
PMID:22652671
Abstract

Variations in complex traits are influenced by multiple genetic variants, environmental risk factors, and their interactions. Though substantial progress has been made in identifying single genetic variants associated with complex traits, detecting the gene-gene and gene-environment interactions remains a great challenge. When a large number of genetic variants and environmental risk factors are involved, searching for interactions is limited to pair-wise interactions due to the exponentially increased feature space and computational intensity. Alternatively, recursive partitioning approaches, such as random forests, have gained popularity in high-dimensional genetic association studies. In this article, we propose a U-Statistic-based random forest approach, referred to as Forest U-Test, for genetic association studies with quantitative traits. Through simulation studies, we showed that the Forest U-Test outperformed exiting methods. The proposed method was also applied to study Cannabis Dependence (CD), using three independent datasets from the Study of Addiction: Genetics and Environment. A significant joint association was detected with an empirical p-value less than 0.001. The finding was also replicated in two independent datasets with p-values of 5.93e-19 and 4.70e-17, respectively.

摘要

复杂性状的变异受到多种遗传变异、环境风险因素及其相互作用的影响。尽管在识别与复杂性状相关的单个遗传变异方面已取得重大进展,但检测基因-基因和基因-环境相互作用仍然是一项巨大挑战。当涉及大量遗传变异和环境风险因素时,由于特征空间和计算强度呈指数级增加,搜索相互作用仅限于成对相互作用。另外,递归划分方法,如随机森林,在高维遗传关联研究中越来越受欢迎。在本文中,我们提出了一种基于U统计量的随机森林方法,称为森林U检验,用于定量性状的遗传关联研究。通过模拟研究,我们表明森林U检验优于现有方法。所提出的方法还应用于研究大麻依赖(CD),使用了来自成瘾:遗传学与环境研究的三个独立数据集。检测到显著的联合关联,经验p值小于0.001。这一发现也在另外两个独立数据集中得到重复,p值分别为5.93e-19和4.70e-17。

相似文献

1
A U-Statistic-based random Forest approach for genetic association study.一种基于U统计量的随机森林方法用于基因关联研究。
Front Biosci (Elite Ed). 2012 Jun 1;4(7):2607-2617. doi: 10.2741/e576.
2
Detecting genetic interactions for quantitative traits with U-statistics.利用 U 统计量检测数量性状的遗传相互作用。
Genet Epidemiol. 2011 Sep;35(6):457-68. doi: 10.1002/gepi.20594. Epub 2011 May 26.
3
Random forests on Hadoop for genome-wide association studies of multivariate neuroimaging phenotypes.基于 Hadoop 的随机森林在多变量神经影像学表型全基因组关联研究中的应用。
BMC Bioinformatics. 2013;14 Suppl 16(Suppl 16):S6. doi: 10.1186/1471-2105-14-S16-S6. Epub 2013 Oct 22.
4
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.
5
SNP selection and classification of genome-wide SNP data using stratified sampling random forests.基于分层抽样随机森林的全基因组 SNP 数据 SNP 选择与分类。
IEEE Trans Nanobioscience. 2012 Sep;11(3):216-27. doi: 10.1109/TNB.2012.2214232.
6
Trees Assembling Mann-Whitney approach for detecting genome-wide joint association among low-marginal-effect loci.基于树的 Mann-Whitney 检验方法在低边缘效应基因座全基因组联合关联检测中的应用。
Genet Epidemiol. 2013 Jan;37(1):84-91. doi: 10.1002/gepi.21693. Epub 2012 Nov 7.
7
Detecting Gene-Gene Interactions Associated with Multiple Complex Traits with U-Statistics.使用U统计量检测与多种复杂性状相关的基因-基因相互作用
Curr Genomics. 2016 Oct;17(5):403-415. doi: 10.2174/1389202917666160513100946.
8
Family-based gene-environment interaction using sequence kernel association test (FGE-SKAT) for complex quantitative traits.基于家系的基因-环境互作序列核关联检验(FGE-SKAT)用于复杂的定量性状。
Sci Rep. 2021 Apr 1;11(1):7431. doi: 10.1038/s41598-021-86871-2.
9
An optimal kernel-based U-statistic method for quantitative gene-set association analysis.一种用于定量基因集关联分析的基于核的最优U统计量方法。
Genet Epidemiol. 2019 Mar;43(2):137-149. doi: 10.1002/gepi.22170. Epub 2018 Nov 19.
10
Identifying Genetic Variants for Addiction via Propensity Score Adjusted Generalized Kendall's Tau.通过倾向得分调整的广义肯德尔tau系数识别成瘾的基因变异体。
J Am Stat Assoc. 2014;109(507):905-930. doi: 10.1080/01621459.2014.901223.

引用本文的文献

1
Inter-ethnic differences in genetic polymorphisms of xenobiotic-metabolizing enzymes (CYP1A1, CYP2D6, NAT1 and NAT2) in healthy populations: correlation with the functional in silico prediction.健康人群中外源生物代谢酶(CYP1A1、CYP2D6、NAT1和NAT2)基因多态性的种族间差异:与计算机模拟功能预测的相关性
Mol Biol Rep. 2014 Sep;41(9):5735-43. doi: 10.1007/s11033-014-3445-6. Epub 2014 Jun 17.