• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

2型糖尿病病例对照研究中高维单核苷酸多态性数据的基于网络的正则化方法

Network-based regularization for high dimensional SNP data in the case-control study of Type 2 diabetes.

作者信息

Ren Jie, He Tao, Li Ye, Liu Sai, Du Yinhao, Jiang Yu, Wu Cen

机构信息

Department of Statistics, Kansas State University, 1116 Mid-Campus Drive N., 66506, Manhattan, KS, USA.

Department of Mathematics, San Francisco State University, San Francisco, CA, USA.

出版信息

BMC Genet. 2017 May 16;18(1):44. doi: 10.1186/s12863-017-0495-5.

DOI:10.1186/s12863-017-0495-5
PMID:28511641
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5434559/
Abstract

BACKGROUND

Over the past decades, the prevalence of type 2 diabetes mellitus (T2D) has been steadily increasing around the world. Despite large efforts devoted to better understand the genetic basis of the disease, the identified susceptibility loci can only account for a small portion of the T2D heritability. Some of the existing approaches proposed for the high dimensional genetic data from the T2D case-control study are limited by analyzing a few number of SNPs at a time from a large pool of SNPs, by ignoring the correlations among SNPs and by adopting inefficient selection techniques.

METHODS

We propose a network constrained regularization method to select important SNPs by taking the linkage disequilibrium into account. To accomodate the case control study, an iteratively reweighted least square algorithm has been developed within the coordinate descent framework where optimization of the regularized logistic loss function is performed with respect to one parameter at a time and iteratively cycle through all the parameters until convergence.

RESULTS

In this article, a novel approach is developed to identify important SNPs more effectively through incorporating the interconnections among them in the regularized selection. A coordinate descent based iteratively reweighed least squares (IRLS) algorithm has been proposed.

CONCLUSIONS

Both the simulation study and the analysis of the Nurses's Health Study, a case-control study of type 2 diabetes data with high dimensional SNP measurements, demonstrate the advantage of the network based approach over the competing alternatives.

摘要

背景

在过去几十年中,2型糖尿病(T2D)在全球的患病率一直在稳步上升。尽管人们付出了巨大努力来更好地理解该疾病的遗传基础,但已确定的易感基因座仅占T2D遗传力的一小部分。一些针对T2D病例对照研究的高维遗传数据提出的现有方法存在局限性,例如一次仅从大量单核苷酸多态性(SNP)中分析少数几个SNP、忽略SNP之间的相关性以及采用低效的选择技术。

方法

我们提出一种网络约束正则化方法,通过考虑连锁不平衡来选择重要的SNP。为适应病例对照研究,在坐标下降框架内开发了一种迭代加权最小二乘算法,其中正则化逻辑损失函数的优化是一次针对一个参数进行,并对所有参数进行迭代循环直至收敛。

结果

在本文中,开发了一种新颖的方法,通过在正则化选择中纳入SNP之间的相互联系来更有效地识别重要的SNP。提出了一种基于坐标下降的迭代加权最小二乘(IRLS)算法。

结论

模拟研究以及对护士健康研究(一项具有高维SNP测量的2型糖尿病数据的病例对照研究)的分析均表明,基于网络的方法优于其他竞争方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/556634d9a600/12863_2017_495_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/e04e572c4f82/12863_2017_495_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/0ec8e0756e84/12863_2017_495_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/8985c8d9427f/12863_2017_495_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/7d32e6059c04/12863_2017_495_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/556634d9a600/12863_2017_495_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/e04e572c4f82/12863_2017_495_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/0ec8e0756e84/12863_2017_495_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/8985c8d9427f/12863_2017_495_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/7d32e6059c04/12863_2017_495_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe6a/5434559/556634d9a600/12863_2017_495_Fig5_HTML.jpg

相似文献

1
Network-based regularization for high dimensional SNP data in the case-control study of Type 2 diabetes.2型糖尿病病例对照研究中高维单核苷酸多态性数据的基于网络的正则化方法
BMC Genet. 2017 May 16;18(1):44. doi: 10.1186/s12863-017-0495-5.
2
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
3
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
7
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
8
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.
9
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
10
The effectiveness and cost-effectiveness of carmustine implants and temozolomide for the treatment of newly diagnosed high-grade glioma: a systematic review and economic evaluation.卡莫司汀植入剂与替莫唑胺治疗新诊断的高级别胶质瘤的有效性和成本效益:一项系统评价与经济学评估
Health Technol Assess. 2007 Nov;11(45):iii-iv, ix-221. doi: 10.3310/hta11450.

引用本文的文献

1
Towards the genome-scale discovery of bivariate monotonic classifiers.迈向双变量单调分类器的全基因组规模发现。
BMC Bioinformatics. 2025 Sep 2;26(1):228. doi: 10.1186/s12859-025-06253-7.
2
TransGeneSelector: using a transformer approach to mine key genes from small transcriptomic datasets in plant responses to various environments.转基因选择器:利用一种Transformer方法从小型转录组数据集中挖掘植物对各种环境响应中的关键基因。
BMC Genomics. 2025 Mar 17;26(1):259. doi: 10.1186/s12864-025-11434-y.
3
A prognostic signature for lung adenocarcinoma by five genes associated with chemotherapy in lung adenocarcinoma.

本文引用的文献

1
The Great Migration and African-American Genomic Diversity.大迁徙与非裔美国人的基因组多样性
PLoS Genet. 2016 May 27;12(5):e1006059. doi: 10.1371/journal.pgen.1006059. eCollection 2016 May.
2
2HiGWAS: a unifying high-dimensional platform to infer the global genetic architecture of trait development.2HiGWAS:一个用于推断性状发育的全球遗传结构的统一高维平台。
Brief Bioinform. 2015 Nov;16(6):905-11. doi: 10.1093/bib/bbv002. Epub 2015 Feb 19.
3
Integrative analysis of gene-environment interactions under a multi-response partially linear varying coefficient model.
通过与肺腺癌化疗相关的五个基因构建肺腺癌的预后特征。
Clin Respir J. 2023 Dec;17(12):1349-1360. doi: 10.1111/crj.13723. Epub 2023 Dec 10.
4
Identification of ferroptosis-related genes as potential diagnostic biomarkers for diabetic nephropathy based on bioinformatics.基于生物信息学鉴定铁死亡相关基因作为糖尿病肾病潜在的诊断生物标志物
Front Mol Biosci. 2023 Aug 1;10:1183530. doi: 10.3389/fmolb.2023.1183530. eCollection 2023.
5
Springer: An R package for bi-level variable selection of high-dimensional longitudinal data.施普林格:用于高维纵向数据双层变量选择的R包。
Front Genet. 2023 Apr 6;14:1088223. doi: 10.3389/fgene.2023.1088223. eCollection 2023.
6
Integrating Multi-Omics Data for Gene-Environment Interactions.整合多组学数据以研究基因-环境相互作用
BioTech (Basel). 2021 Jan 29;10(1):3. doi: 10.3390/biotech10010003.
7
Sparse group variable selection for gene-environment interactions in the longitudinal study.稀疏群组变量选择在纵向研究中的基因-环境交互作用。
Genet Epidemiol. 2022 Jul;46(5-6):317-340. doi: 10.1002/gepi.22461. Epub 2022 Jun 29.
8
Interep: An R Package for High-Dimensional Interaction Analysis of the Repeated Measurement Data.Interep:一个用于重复测量数据高维交互分析的R软件包。
Genes (Basel). 2022 Mar 19;13(3):544. doi: 10.3390/genes13030544.
9
Computational Modeling of Gene-Specific Transcriptional Repression, Activation and Chromatin Interactions in Leukemogenesis by LASSO-Regularized Logistic Regression.通过 LASSO 正则化逻辑回归对白血病发生过程中基因特异性转录抑制、激活和染色质相互作用的计算建模。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2109-2122. doi: 10.1109/TCBB.2021.3078128. Epub 2021 Dec 8.
10
A maximum flow-based network approach for identification of stable noncoding biomarkers associated with the multigenic neurological condition, autism.一种基于最大流的网络方法,用于识别与多基因神经疾病——自闭症相关的稳定非编码生物标志物。
BioData Min. 2021 May 3;14(1):28. doi: 10.1186/s13040-021-00262-x.
多响应部分线性可变系数模型下基因-环境相互作用的综合分析
Stat Med. 2014 Dec 10;33(28):4988-98. doi: 10.1002/sim.6287. Epub 2014 Aug 21.
4
A novel method for identifying nonlinear gene-environment interactions in case-control association studies.一种在病例对照关联研究中识别非线性基因-环境相互作用的新方法。
Hum Genet. 2013 Dec;132(12):1413-25. doi: 10.1007/s00439-013-1350-z. Epub 2013 Aug 24.
5
Haplotype association analysis of genes within the WNT signalling pathways in diabetic nephropathy.WNT 信号通路相关基因单体型与糖尿病肾病的关联分析。
BMC Nephrol. 2013 Jun 18;14:126. doi: 10.1186/1471-2369-14-126.
6
Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes.大规模的关联分析为 2 型糖尿病的遗传结构和病理生理学提供了深入了解。
Nat Genet. 2012 Sep;44(9):981-90. doi: 10.1038/ng.2383. Epub 2012 Aug 12.
7
The Sparse Laplacian Shrinkage Estimator for High-Dimensional Regression.用于高维回归的稀疏拉普拉斯收缩估计器
Ann Stat. 2011;39(4):2021-2046. doi: 10.1214/11-aos897.
8
DNA methylation profiling identifies epigenetic differences between diabetes patients with ESRD and diabetes patients without nephropathy.DNA 甲基化分析鉴定出 ESRD 糖尿病患者和无肾病糖尿病患者之间的表观遗传差异。
Epigenetics. 2011 Jan;6(1):20-8. doi: 10.4161/epi.6.1.13362. Epub 2011 Jan 1.
9
The chaperone proteins HSP70, HSP40/DnaJ and GRP78/BiP suppress misfolding and formation of β-sheet-containing aggregates by human amylin: a potential role for defective chaperone biology in Type 2 diabetes.伴侣蛋白 HSP70、HSP40/DnaJ 和 GRP78/BiP 抑制人胰岛淀粉样多肽的错误折叠和形成β-折叠含有的聚集体:2 型糖尿病中伴侣蛋白生物学缺陷的潜在作用。
Biochem J. 2010 Nov 15;432(1):113-21. doi: 10.1042/BJ20100434.
10
The Gene, Environment Association Studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions.基因-环境关联研究联盟(GENEVA):通过跨多种疾病研究的合作,最大化从 GWAS 中获得的知识。
Genet Epidemiol. 2010 May;34(4):364-72. doi: 10.1002/gepi.20492.