• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用套索回归在基因研究中检测预测性总体效应。

Using LASSO regression to detect predictive aggregate effects in genetic studies.

作者信息

Fontanarosa Joel B, Dai Yang

机构信息

Bioinformatics Program, Department of Bioengineering (MC 063), University of Illinois at Chicago, 851 S, Morgan Street, 218 SEO, Chicago, IL 60607-7052, USA.

出版信息

BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S69. doi: 10.1186/1753-6561-5-S9-S69.

DOI:10.1186/1753-6561-5-S9-S69
PMID:22373537
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3287908/
Abstract

We use least absolute shrinkage and selection operator (LASSO) regression to select genetic markers and phenotypic features that are most informative with respect to a trait of interest. We compare several strategies for applying LASSO methods in risk prediction models, using the Genetic Analysis Workshop 17 exome simulation data consisting of 697 individuals with information on genotypic and phenotypic features (smoking, age, sex) in 5-fold cross-validated fashion. The cross-validated averages of the area under the receiver operating curve range from 0.45 to 0.63 for different strategies using only genotypic markers. The same values are improved to 0.69-0.87 when both genotypic and phenotypic information are used. The ability of the LASSO method to find true causal markers is limited, but the method was able to discover several common variants (e.g., FLT1) under certain conditions.

摘要

我们使用最小绝对收缩和选择算子(LASSO)回归来选择与感兴趣的性状最相关的遗传标记和表型特征。我们使用遗传分析研讨会17的外显子组模拟数据(包含697名个体的基因型和表型特征信息,如吸烟、年龄、性别),以5折交叉验证的方式比较了在风险预测模型中应用LASSO方法的几种策略。仅使用基因型标记的不同策略,其受试者工作特征曲线下面积的交叉验证平均值在0.45至0.63之间。当同时使用基因型和表型信息时,相同的值提高到了0.69 - 0.87。LASSO方法找到真正因果标记的能力有限,但该方法在某些条件下能够发现一些常见变异(如FLT1)。

相似文献

1
Using LASSO regression to detect predictive aggregate effects in genetic studies.使用套索回归在基因研究中检测预测性总体效应。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S69. doi: 10.1186/1753-6561-5-S9-S69.
2
Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions.使用正则化线性回归模型的基因组选择:岭回归、套索回归、弹性网络及其扩展。
BMC Proc. 2012 May 21;6 Suppl 2(Suppl 2):S10. doi: 10.1186/1753-6561-6-S2-S10.
3
Least absolute shrinkage and selection operator type methods for the identification of serum biomarkers of overweight and obesity: simulation and application.用于识别超重和肥胖血清生物标志物的最小绝对收缩和选择算子类型方法:模拟与应用
BMC Med Res Methodol. 2016 Nov 14;16(1):154. doi: 10.1186/s12874-016-0254-8.
4
Comparison of three statistical approaches for feature selection for fine-scale genetic population assignment in four pig breeds.四种猪品种中用于精细尺度遗传群体分配的特征选择的三种统计方法的比较。
Trop Anim Health Prod. 2021 Jul 10;53(3):395. doi: 10.1007/s11250-021-02824-x.
5
Resampling procedures to identify important SNPs using a consensus approach.使用共识方法识别重要单核苷酸多态性(SNP)的重采样程序。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S59. doi: 10.1186/1753-6561-5-S9-S59.
6
Preventing Heterotopic Ossification in Combat Casualties-Which Models Are Best Suited for Clinical Use?预防战斗伤员的异位骨化——哪些模型最适合临床应用?
Clin Orthop Relat Res. 2015 Sep;473(9):2807-13. doi: 10.1007/s11999-015-4302-1.
7
Evaluation of a LASSO regression approach on the unrelated samples of Genetic Analysis Workshop 17.对遗传分析研讨会17的非相关样本进行套索回归方法评估。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S12. doi: 10.1186/1753-6561-5-S9-S12.
8
Predicting radiation-induced valvular heart damage.预测辐射诱发的心脏瓣膜损伤。
Acta Oncol. 2015 Nov;54(10):1796-804. doi: 10.3109/0284186X.2015.1016624. Epub 2015 Mar 24.
9
Identification of clinically relevant features in hypertensive patients using penalized regression: a case study of cardiovascular events.使用惩罚回归识别高血压患者的临床相关特征:心血管事件的案例研究。
Med Biol Eng Comput. 2019 Sep;57(9):2011-2026. doi: 10.1007/s11517-019-02007-9. Epub 2019 Jul 25.
10
Comparison between the stochastic search variable selection and the least absolute shrinkage and selection operator for genome-wide association studies of rheumatoid arthritis.类风湿关节炎全基因组关联研究中随机搜索变量选择法与最小绝对收缩与选择算子法的比较
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S21. doi: 10.1186/1753-6561-3-S7-S21.

引用本文的文献

1
Integration of autophagy-related genes and immune dysregulation reveals a prognostic landscape in multiple myeloma.自噬相关基因与免疫失调的整合揭示了多发性骨髓瘤的预后格局。
Front Oncol. 2025 Sep 17;15:1635596. doi: 10.3389/fonc.2025.1635596. eCollection 2025.
2
What matters most? Identifying key resource gain and loss factors affecting nurses' work engagement and job burnout in digital healthcare contexts.最重要的是什么?识别影响数字医疗环境中护士工作投入和职业倦怠的关键资源获取和损失因素。
BMC Nurs. 2025 Jul 29;24(1):985. doi: 10.1186/s12912-025-03586-9.
3
Prognostic Value of Ubiquitination-Related Genes in Ovarian Cancer and Their Correlation With Tumor Immunity.泛素化相关基因在卵巢癌中的预后价值及其与肿瘤免疫的相关性
Hum Mutat. 2025 Jul 15;2025:8369299. doi: 10.1155/humu/8369299. eCollection 2025.
4
A nomogram for predicting poor sleep quality in patients with systemic lupus erythematosus.一种用于预测系统性红斑狼疮患者睡眠质量差的列线图。
Front Neurol. 2025 May 21;16:1562949. doi: 10.3389/fneur.2025.1562949. eCollection 2025.
5
Development and preliminary validation of five miRNAs for lung adenocarcinoma prognostic model associated with immune infiltration.与免疫浸润相关的肺腺癌预后模型的5种微小RNA的开发及初步验证
Sci Rep. 2025 Jan 2;15(1):528. doi: 10.1038/s41598-024-84128-2.
6
A telomere-related gene risk model for predicting prognosis and treatment response in acute myeloid leukemia.一种用于预测急性髓系白血病预后和治疗反应的端粒相关基因风险模型。
Heliyon. 2024 May 22;10(11):e31705. doi: 10.1016/j.heliyon.2024.e31705. eCollection 2024 Jun 15.
7
A novel NET-related gene signature for predicting DLBCL prognosis.一种用于预测弥漫性大 B 细胞淋巴瘤预后的新型 NET 相关基因特征。
J Transl Med. 2023 Sep 16;21(1):630. doi: 10.1186/s12967-023-04494-9.
8
Machine Learning Model Based on Insulin Resistance Metagenes Underpins Genetic Basis of Type 2 Diabetes.基于胰岛素抵抗宏基因组的机器学习模型揭示了 2 型糖尿病的遗传基础。
Biomolecules. 2023 Feb 24;13(3):432. doi: 10.3390/biom13030432.
9
Establishment and validation of a carbohydrate metabolism-related gene signature for prognostic model and immune response in acute myeloid leukemia.建立和验证一个与碳水化合物代谢相关的基因特征,用于急性髓系白血病的预后模型和免疫反应。
Front Immunol. 2022 Dec 5;13:1038570. doi: 10.3389/fimmu.2022.1038570. eCollection 2022.
10
A novel signature derived from metabolism-related genes GPT and SMS to predict prognosis of laryngeal squamous cell carcinoma.一种源自代谢相关基因GPT和SMS的新型标志物,用于预测喉鳞状细胞癌的预后。
Cancer Cell Int. 2022 Jul 8;22(1):226. doi: 10.1186/s12935-022-02647-2.

本文引用的文献

1
Identification of multiple rare variants associated with a disease.与一种疾病相关的多个罕见变异的鉴定。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S103. doi: 10.1186/1753-6561-5-S9-S103.
2
A LASSO-based approach to analyzing rare variants in genetic association studies.一种基于套索算法的方法用于分析基因关联研究中的罕见变异。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S100. doi: 10.1186/1753-6561-5-S9-S100.
3
Genetic Analysis Workshop 17 mini-exome simulation.遗传分析研讨会17小型外显子模拟
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S2. doi: 10.1186/1753-6561-5-S9-S2.
4
Brief review of regression-based and machine learning methods in genetic epidemiology: the Genetic Analysis Workshop 17 experience.基于回归和机器学习方法在遗传流行病学中的简要综述:遗传分析研讨会 17 的经验。
Genet Epidemiol. 2011;35 Suppl 1(Suppl 1):S5-11. doi: 10.1002/gepi.20642.
5
Risk prediction using genome-wide association studies.基于全基因组关联研究的风险预测。
Genet Epidemiol. 2010 Nov;34(7):643-52. doi: 10.1002/gepi.20509.
6
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
J Stat Softw. 2010;33(1):1-22.
7
Association screening of common and rare genetic variants by penalized regression.通过惩罚回归进行常见和罕见遗传变异的关联筛选。
Bioinformatics. 2010 Oct 1;26(19):2375-82. doi: 10.1093/bioinformatics/btq448. Epub 2010 Aug 6.
8
Biological, clinical and population relevance of 95 loci for blood lipids.95 个与血脂相关的生物学、临床和人群相关性位点。
Nature. 2010 Aug 5;466(7307):707-13. doi: 10.1038/nature09270.
9
Uncovering the roles of rare variants in common disease through whole-genome sequencing.通过全基因组测序揭示常见疾病中罕见变异的作用。
Nat Rev Genet. 2010 Jun;11(6):415-25. doi: 10.1038/nrg2779.
10
Machine learning in genome-wide association studies.全基因组关联研究中的机器学习。
Genet Epidemiol. 2009;33 Suppl 1:S51-7. doi: 10.1002/gepi.20473.