• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多响应模型的同步参数学习与双聚类

Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models.

作者信息

Yu Ming, Natesan Ramamurthy Karthikeyan, Thompson Addie, Lozano Aurélie C

机构信息

Booth School of Business, The University of Chicago, Chicago, IL, United States.

IBM Research, Yorktown Heights, NY, United States.

出版信息

Front Big Data. 2019 Aug 14;2:27. doi: 10.3389/fdata.2019.00027. eCollection 2019.

DOI:10.3389/fdata.2019.00027
PMID:33693350
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7931892/
Abstract

We consider multi-response and multi-task regression models, where the parameter matrix to be estimated is expected to have an unknown grouping structure. The groupings can be along tasks, or features, or both, the last one indicating a bi-cluster or "checkerboard" structure. Discovering this grouping structure along with parameter inference makes sense in several applications, such as multi-response Genome-Wide Association Studies (GWAS). By inferring this additional structure we can obtain valuable information on the underlying data mechanisms (e.g., relationships among genotypes and phenotypes in GWAS). In this paper, we propose two formulations to simultaneously learn the parameter matrix and its group structures, based on convex regularization penalties. We present optimization approaches to solve the resulting problems and provide numerical convergence guarantees. Extensive experiments demonstrate much better clustering quality compared to other methods, and our approaches are also validated on real datasets concerning phenotypes and genotypes of plant varieties.

摘要

我们考虑多响应和多任务回归模型,其中待估计的参数矩阵预计具有未知的分组结构。分组可以沿着任务、特征或两者进行,最后一种情况表示双聚类或“棋盘”结构。在诸如多响应全基因组关联研究(GWAS)等多种应用中,发现这种分组结构并进行参数推断是有意义的。通过推断这种额外的结构,我们可以获得关于潜在数据机制的有价值信息(例如,GWAS中基因型和表型之间的关系)。在本文中,我们基于凸正则化惩罚提出了两种公式,以同时学习参数矩阵及其分组结构。我们提出了优化方法来解决由此产生的问题,并提供数值收敛保证。大量实验表明,与其他方法相比,我们的聚类质量要好得多,并且我们的方法也在关于植物品种表型和基因型的真实数据集上得到了验证。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/948df95ae4c9/fdata-02-00027-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/d10ab48c1076/fdata-02-00027-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/beb0998aae1e/fdata-02-00027-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/4e31c15e02ee/fdata-02-00027-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/11fba1109aae/fdata-02-00027-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/cb9e30652fd1/fdata-02-00027-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/db24c3864621/fdata-02-00027-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/6e0519a65450/fdata-02-00027-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/fdac9056a592/fdata-02-00027-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/948df95ae4c9/fdata-02-00027-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/d10ab48c1076/fdata-02-00027-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/beb0998aae1e/fdata-02-00027-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/4e31c15e02ee/fdata-02-00027-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/11fba1109aae/fdata-02-00027-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/cb9e30652fd1/fdata-02-00027-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/db24c3864621/fdata-02-00027-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/6e0519a65450/fdata-02-00027-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/fdac9056a592/fdata-02-00027-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2b8/7931892/948df95ae4c9/fdata-02-00027-g0009.jpg

相似文献

1
Simultaneous Parameter Learning and Bi-clustering for Multi-Response Models.多响应模型的同步参数学习与双聚类
Front Big Data. 2019 Aug 14;2:27. doi: 10.3389/fdata.2019.00027. eCollection 2019.
2
Convex Discriminative Multitask Clustering.凸判别式多任务聚类。
IEEE Trans Pattern Anal Mach Intell. 2015 Jan;37(1):28-40. doi: 10.1109/TPAMI.2014.2343221.
3
Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition.层次聚类多任务学习用于联合人体动作分组和识别。
IEEE Trans Pattern Anal Mach Intell. 2017 Jan;39(1):102-114. doi: 10.1109/TPAMI.2016.2537337. Epub 2016 Mar 2.
4
Multi-Stage Multi-Task Feature Learning.多阶段多任务特征学习
Adv Neural Inf Process Syst. 2013 Oct;14:2979-3010.
5
Efficient multi-task learning with adaptive temporal structure for progression prediction.用于进展预测的具有自适应时间结构的高效多任务学习。
Neural Comput Appl. 2023 May 10:1-16. doi: 10.1007/s00521-023-08461-9.
6
Multitask spectral clustering by exploring intertask correlation.通过探索任务间相关性实现多任务谱聚类。
IEEE Trans Cybern. 2015 May;45(5):1069-80. doi: 10.1109/TCYB.2014.2344015. Epub 2014 Sep 18.
7
Adaptive Weighted Graph Fusion Incomplete Multi-View Subspace Clustering.自适应加权图融合不完全多视图子空间聚类
Sensors (Basel). 2020 Oct 10;20(20):5755. doi: 10.3390/s20205755.
8
Network inference with ensembles of bi-clustering trees.基于二部聚类树集成的网络推断。
BMC Bioinformatics. 2019 Oct 28;20(1):525. doi: 10.1186/s12859-019-3104-y.
9
Multilinear Multitask Learning by Rank-Product Regularization.基于秩积正则化的多线性多任务学习
IEEE Trans Neural Netw Learn Syst. 2020 Apr;31(4):1336-1350. doi: 10.1109/TNNLS.2019.2919774. Epub 2019 Jun 24.
10
Multi-view projected clustering with graph learning.基于图学习的多视图投影聚类。
Neural Netw. 2020 Jun;126:335-346. doi: 10.1016/j.neunet.2020.03.020. Epub 2020 Apr 1.

引用本文的文献

1
Regularized multi-trait multi-locus linear mixed models for genome-wide association studies and genomic selection in crops.作物全基因组关联研究和基因组选择的正则化多性状多基因座线性混合模型。
BMC Bioinformatics. 2023 Oct 26;24(1):399. doi: 10.1186/s12859-023-05519-2.

本文引用的文献

1
Sorghum Dw2 Encodes a Protein Kinase Regulator of Stem Internode Length.高粱 Dw2 编码一个调控茎节间长度的蛋白激酶调节剂。
Sci Rep. 2017 Jul 4;7(1):4616. doi: 10.1038/s41598-017-04609-5.
2
Network Lasso: Clustering and Optimization in Large Graphs.网络套索:大型图中的聚类与优化
KDD. 2015 Aug;2015:387-396. doi: 10.1145/2783258.2783313.
3
Convex biclustering.凸双聚类
Biometrics. 2017 Mar;73(1):10-19. doi: 10.1111/biom.12540. Epub 2016 May 10.
4
Identification of Dw1, a Regulator of Sorghum Stem Internode Length.高粱茎节间长度调控因子Dw1的鉴定
PLoS One. 2016 Mar 10;11(3):e0151271. doi: 10.1371/journal.pone.0151271. eCollection 2016.
5
Genome-wide association analysis for multiple continuous secondary phenotypes.全基因组关联分析多个连续次要表型。
Am J Hum Genet. 2013 May 2;92(5):744-59. doi: 10.1016/j.ajhg.2013.04.004.
6
Loss of an MDR transporter in compact stalks of maize br2 and sorghum dw3 mutants.玉米br2和高粱dw3突变体紧密茎杆中多药耐药转运蛋白的缺失。
Science. 2003 Oct 3;302(5642):81-4. doi: 10.1126/science.1086072.