• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多维遗传通路数据的半参数回归:最小二乘核机器与线性混合模型

Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.

作者信息

Liu Dawei, Lin Xihong, Ghosh Debashis

机构信息

Center for Statistical Sciences, Brown University, Providence, Rhode Island 02912, USA.

出版信息

Biometrics. 2007 Dec;63(4):1079-88. doi: 10.1111/j.1541-0420.2007.00799.x.

DOI:10.1111/j.1541-0420.2007.00799.x
PMID:18078480
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2665800/
Abstract

We consider a semiparametric regression model that relates a normal outcome to covariates and a genetic pathway, where the covariate effects are modeled parametrically and the pathway effect of multiple gene expressions is modeled parametrically or nonparametrically using least-squares kernel machines (LSKMs). This unified framework allows a flexible function for the joint effect of multiple genes within a pathway by specifying a kernel function and allows for the possibility that each gene expression effect might be nonlinear and the genes within the same pathway are likely to interact with each other in a complicated way. This semiparametric model also makes it possible to test for the overall genetic pathway effect. We show that the LSKM semiparametric regression can be formulated using a linear mixed model. Estimation and inference hence can proceed within the linear mixed model framework using standard mixed model software. Both the regression coefficients of the covariate effects and the LSKM estimator of the genetic pathway effect can be obtained using the best linear unbiased predictor in the corresponding linear mixed model formulation. The smoothing parameter and the kernel parameter can be estimated as variance components using restricted maximum likelihood. A score test is developed to test for the genetic pathway effect. Model/variable selection within the LSKM framework is discussed. The methods are illustrated using a prostate cancer data set and evaluated using simulations.

摘要

我们考虑一个半参数回归模型,该模型将正态结果与协变量和遗传通路相关联,其中协变量效应采用参数化建模,多个基因表达的通路效应使用最小二乘核机器(LSKM)进行参数化或非参数化建模。这个统一的框架通过指定核函数,为通路内多个基因的联合效应提供了一个灵活的函数,并允许每个基因表达效应可能是非线性的,且同一通路内的基因可能以复杂的方式相互作用。这个半参数模型还使得检验整体遗传通路效应成为可能。我们表明,LSKM半参数回归可以用线性混合模型来表述。因此,估计和推断可以在使用标准混合模型软件的线性混合模型框架内进行。协变量效应的回归系数和遗传通路效应的LSKM估计量都可以在相应的线性混合模型表述中使用最佳线性无偏预测器获得。平滑参数和核参数可以使用限制最大似然法作为方差分量进行估计。开发了一个得分检验来检验遗传通路效应。讨论了LSKM框架内的模型/变量选择。使用前列腺癌数据集对这些方法进行了说明,并通过模拟进行了评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7b8/2665800/5479b6dc5aea/nihms-58177-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7b8/2665800/235ea9c52581/nihms-58177-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7b8/2665800/5479b6dc5aea/nihms-58177-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7b8/2665800/235ea9c52581/nihms-58177-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a7b8/2665800/5479b6dc5aea/nihms-58177-f0002.jpg

相似文献

1
Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.多维遗传通路数据的半参数回归:最小二乘核机器与线性混合模型
Biometrics. 2007 Dec;63(4):1079-88. doi: 10.1111/j.1541-0420.2007.00799.x.
2
Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models.通过逻辑混合模型,使用逻辑核机器回归估计和检验遗传通路对疾病结局的影响。
BMC Bioinformatics. 2008 Jun 24;9:292. doi: 10.1186/1471-2105-9-292.
3
Testing and estimation in marker-set association study using semiparametric quantile regression kernel machine.使用半参数分位数回归核机器进行标记集关联研究中的检验与估计。
Biometrics. 2016 Jun;72(2):364-71. doi: 10.1111/biom.12438. Epub 2015 Nov 17.
4
Semiparametric estimation in generalized linear mixed models with auxiliary covariates: a pairwise likelihood approach.具有辅助协变量的广义线性混合模型中的半参数估计:一种成对似然方法。
Biometrics. 2014 Dec;70(4):910-9. doi: 10.1111/biom.12208. Epub 2014 Sep 23.
5
Bayesian inference in semiparametric mixed models for longitudinal data.纵向数据半参数混合模型中的贝叶斯推断。
Biometrics. 2010 Mar;66(1):70-8. doi: 10.1111/j.1541-0420.2009.01227.x. Epub 2009 May 7.
6
Variable selection for semiparametric mixed models in longitudinal studies.纵向研究中半参数混合模型的变量选择
Biometrics. 2010 Mar;66(1):79-88. doi: 10.1111/j.1541-0420.2009.01240.x. Epub 2009 Apr 13.
7
Adjustment for missingness using auxiliary information in semiparametric regression.在半参数回归中使用辅助信息对缺失值进行调整。
Biometrics. 2010 Mar;66(1):115-22. doi: 10.1111/j.1541-0420.2009.01231.x. Epub 2009 May 7.
8
Robust alternatives to the F-Test in mixed linear models based on MM-estimates.基于MM估计的混合线性模型中F检验的稳健替代方法。
Biometrics. 2007 Dec;63(4):1045-52. doi: 10.1111/j.1541-0420.2007.00804.x. Epub 2007 May 2.
9
Semiparametric regression in size-biased sampling.规模偏差抽样中的半参数回归
Biometrics. 2010 Mar;66(1):149-58. doi: 10.1111/j.1541-0420.2009.01260.x. Epub 2009 May 4.
10
Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reduction.微阵列数据分类的系统基准测试:评估非线性和降维的作用。
Bioinformatics. 2004 Nov 22;20(17):3185-95. doi: 10.1093/bioinformatics/bth383. Epub 2004 Jul 1.

引用本文的文献

1
Weighted overlapping group lasso for integrating prior network knowledge into gene set analysis.用于将先验网络知识整合到基因集分析中的加权重叠组套索法。
BMC Bioinformatics. 2025 Sep 1;26(1):226. doi: 10.1186/s12859-025-06170-9.
2
Assessing the impact of air pollution on lung function in South Korea using Bayesian kernel machine regression.使用贝叶斯核机器回归评估空气污染对韩国肺功能的影响。
Sci Rep. 2025 Sep 1;15(1):32138. doi: 10.1038/s41598-025-17352-z.
3
SpaceBF: Spatial coexpression analysis using Bayesian Fused approaches in spatial omics datasets.

本文引用的文献

1
Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.基因集富集分析:一种基于知识的方法用于解读全基因组表达谱。
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50. doi: 10.1073/pnas.0506580102. Epub 2005 Sep 30.
2
Testing association of a pathway with survival using gene expression data.利用基因表达数据测试一条信号通路与生存情况的关联性。
Bioinformatics. 2005 May 1;21(9):1950-7. doi: 10.1093/bioinformatics/bti267. Epub 2005 Jan 18.
3
Random effects selection in linear mixed models.
SpaceBF:在空间组学数据集中使用贝叶斯融合方法进行空间共表达分析。
bioRxiv. 2025 Jun 22:2025.03.29.646124. doi: 10.1101/2025.03.29.646124.
4
STANCE: a unified statistical model to detect cell-type-specific spatially variable genes in spatial transcriptomics.STANCE:一种用于在空间转录组学中检测细胞类型特异性空间可变基因的统一统计模型。
Nat Commun. 2025 Feb 20;16(1):1793. doi: 10.1038/s41467-025-57117-w.
5
Detecting Clinically Relevant Topological Structures in Multiplexed Spatial Proteomics Imaging Using TopKAT.使用TopKAT在多重空间蛋白质组学成像中检测临床相关拓扑结构
bioRxiv. 2024 Dec 21:2024.12.18.628976. doi: 10.1101/2024.12.18.628976.
6
Kernel machine tests of association using extrinsic and intrinsic cluster evaluation metrics.基于外在和内在聚类评估指标的核机器关联检验。
PLoS Comput Biol. 2024 Nov 11;20(11):e1012524. doi: 10.1371/journal.pcbi.1012524. eCollection 2024 Nov.
7
Integrating Multimodal Neuroimaging and Genetics: A Structurally-Linked Sparse Canonical Correlation Analysis Approach.整合多模态神经影像学和遗传学:一种结构链接稀疏典型相关分析方法。
IEEE J Transl Eng Health Med. 2024 Sep 19;12:659-667. doi: 10.1109/JTEHM.2024.3463720. eCollection 2024.
8
Kernel-based hierarchical structural component models for pathway analysis on survival phenotype.基于核的层次结构成分模型在生存表型的通路分析。
Genes Genomics. 2024 Dec;46(12):1415-1421. doi: 10.1007/s13258-024-01569-9. Epub 2024 Sep 26.
9
Analysis of Microbiome Data.微生物组数据分析
Annu Rev Stat Appl. 2024 Apr;11(1):483-504. doi: 10.1146/annurev-statistics-040522-120734. Epub 2023 Oct 13.
10
BayesKAT: bayesian optimal kernel-based test for genetic association studies reveals joint genetic effects in complex diseases.贝叶斯KAT:用于基因关联研究的基于贝叶斯最优核的检验揭示复杂疾病中的联合基因效应。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae182.
线性混合模型中的随机效应选择
Biometrics. 2003 Dec;59(4):762-9. doi: 10.1111/j.0006-341x.2003.00089.x.
4
Comment on " 'Stemness': transcriptional profiling of embryonic and adult stem cells" and "a stem cell molecular signature".关于《“干性”:胚胎干细胞和成体干细胞的转录谱分析》及《一种干细胞分子特征》的评论
Science. 2003 Oct 17;302(5644):393; author reply 393. doi: 10.1126/science.1086384.
5
Hypothesis testing in semiparametric additive mixed models.半参数加法混合模型中的假设检验
Biostatistics. 2003 Jan;4(1):57-74. doi: 10.1093/biostatistics/4.1.57.
6
PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes.参与氧化磷酸化的PGC-1α反应性基因在人类糖尿病中协同下调。
Nat Genet. 2003 Jul;34(3):267-73. doi: 10.1038/ng1180.
7
Delineation of prognostic biomarkers in prostate cancer.前列腺癌预后生物标志物的描绘
Nature. 2001 Aug 23;412(6849):822-6. doi: 10.1038/35090585.
8
Significance analysis of microarrays applied to the ionizing radiation response.应用于电离辐射反应的微阵列显著性分析。
Proc Natl Acad Sci U S A. 2001 Apr 24;98(9):5116-21. doi: 10.1073/pnas.091062498. Epub 2001 Apr 17.
9
Random-effects models for longitudinal data.纵向数据的随机效应模型。
Biometrics. 1982 Dec;38(4):963-74.