• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无标记数据情况下潜在结构化预测分数的排序与合并

Ranking and Combining Latent Structured Predictive Scores without Labeled Data.

作者信息

Afshar Shiva, Chen Yinghan, Han Shizhong, Lin Ying

机构信息

Department of Neurology, Emory University, Atlanta, GA, 30322, USA.

Department of Mathematics and Statistics, University of Nevada, Reno, NV, 89557, USA.

出版信息

IISE Trans. 2024 Dec 4. doi: 10.1080/24725854.2024.2417258.

DOI:10.1080/24725854.2024.2417258
PMID:40857441
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12345621/
Abstract

Combining multiple predictors obtained from distributed data sources to an accurate meta-learner is promising to achieve enhanced performance in lots of prediction problems. As the accuracy of each predictor is usually unknown, integrating the predictors to achieve better performance is challenging. Conventional ensemble learning methods assess the accuracy of predictors based on extensive labeled data. In practical applications, however, the acquisition of such labeled data can prove to be an arduous task. Furthermore, the predictors under consideration may exhibit high degrees of correlation, particularly when similar data sources or machine learning algorithms were employed during their model training. In response to these challenges, this paper introduces a novel structured unsupervised ensemble learning model (SUEL) to exploit the dependency between a set of predictors with continuous predictive scores, rank the predictors without labeled data and combine them to an ensembled score with weights. Two novel correlation-based decomposition algorithms are further proposed to estimate the SUEL model, constrained quadratic optimization (SUEL.CQO) and matrix-factorization-based (SUEL.MF) approaches. The efficacy of the proposed methods is rigorously assessed through both simulation studies and real-world application of risk genes discovery. The results compellingly demonstrate that the proposed methods can efficiently integrate the dependent predictors to an ensemble model without the need of ground truth data.

摘要

将从分布式数据源获得的多个预测器组合成一个准确的元学习器,有望在许多预测问题中实现更高的性能。由于每个预测器的准确性通常是未知的,因此将这些预测器集成以实现更好的性能具有挑战性。传统的集成学习方法基于大量的标记数据来评估预测器的准确性。然而,在实际应用中,获取此类标记数据可能是一项艰巨的任务。此外,所考虑的预测器可能表现出高度的相关性,特别是当在其模型训练期间采用了相似的数据源或机器学习算法时。针对这些挑战,本文引入了一种新颖的结构化无监督集成学习模型(SUEL),以利用具有连续预测分数的一组预测器之间的依赖性,在没有标记数据的情况下对预测器进行排序,并将它们组合成一个带有权重的集成分数。进一步提出了两种基于相关性的新颖分解算法来估计SUEL模型,即约束二次优化(SUEL.CQO)和基于矩阵分解的(SUEL.MF)方法。通过模拟研究和风险基因发现的实际应用,对所提出方法的有效性进行了严格评估。结果有力地证明,所提出的方法可以有效地将相关的预测器集成到一个集成模型中,而无需真实数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/684c6dc62df8/nihms-2031599-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/fc3d554c9317/nihms-2031599-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/81033e2e0f42/nihms-2031599-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/36494e3b1422/nihms-2031599-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/0275539fb61f/nihms-2031599-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/acb959e7505c/nihms-2031599-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/7c95a6c4699d/nihms-2031599-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/c0e18b733df7/nihms-2031599-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/a910e74ffc4f/nihms-2031599-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/ae023893acd9/nihms-2031599-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/1b3966353023/nihms-2031599-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/fdf47d3cb8b4/nihms-2031599-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/16487227d1bd/nihms-2031599-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/684c6dc62df8/nihms-2031599-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/fc3d554c9317/nihms-2031599-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/81033e2e0f42/nihms-2031599-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/36494e3b1422/nihms-2031599-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/0275539fb61f/nihms-2031599-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/acb959e7505c/nihms-2031599-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/7c95a6c4699d/nihms-2031599-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/c0e18b733df7/nihms-2031599-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/a910e74ffc4f/nihms-2031599-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/ae023893acd9/nihms-2031599-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/1b3966353023/nihms-2031599-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/fdf47d3cb8b4/nihms-2031599-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/16487227d1bd/nihms-2031599-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4eef/12345621/684c6dc62df8/nihms-2031599-f0013.jpg

相似文献

1
Ranking and Combining Latent Structured Predictive Scores without Labeled Data.无标记数据情况下潜在结构化预测分数的排序与合并
IISE Trans. 2024 Dec 4. doi: 10.1080/24725854.2024.2417258.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Short-Term Memory Impairment短期记忆障碍
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
8
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
9
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
10
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.

本文引用的文献

1
Gradient Boosted Trees for Spatial Data and Its Application to Medical Imaging Data.用于空间数据的梯度提升树及其在医学成像数据中的应用。
IISE Trans Healthc Syst Eng. 2022;12(3):165-179. doi: 10.1080/24725579.2021.1995536. Epub 2021 Nov 9.
2
Disease category-specific annotation of variants using an ensemble learning framework.基于集成学习框架的疾病类别特异性变异注释。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab438.
3
A Machine Learning Approach to Predicting Autism Risk Genes: Validation of Known Genes and Discovery of New Candidates.
一种预测自闭症风险基因的机器学习方法:已知基因的验证与新候选基因的发现
Front Genet. 2020 Sep 10;11:500064. doi: 10.3389/fgene.2020.500064. eCollection 2020.
4
Forecasting risk gene discovery in autism with machine learning and genome-scale data.利用机器学习和全基因组数据预测自闭症风险基因。
Sci Rep. 2020 Mar 12;10(1):4569. doi: 10.1038/s41598-020-61288-5.
5
Genomic Patterns of De Novo Mutation in Simplex Autism.单纯性自闭症的新生突变基因组模式
Cell. 2017 Oct 19;171(3):710-722.e12. doi: 10.1016/j.cell.2017.08.047. Epub 2017 Sep 28.
6
Polygenic transmission disequilibrium confirms that common and rare variation act additively to create risk for autism spectrum disorders.多基因传递不平衡证实,常见变异和罕见变异以累加方式作用,增加患自闭症谱系障碍的风险。
Nat Genet. 2017 Jul;49(7):978-985. doi: 10.1038/ng.3863. Epub 2017 May 15.
7
Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder.全基因组测序资源鉴定出18个自闭症谱系障碍的新候选基因。
Nat Neurosci. 2017 Apr;20(4):602-611. doi: 10.1038/nn.4524. Epub 2017 Mar 6.
8
A Cell Type-Specific Expression Signature Predicts Haploinsufficient Autism-Susceptibility Genes.一种细胞类型特异性表达特征可预测单倍剂量不足的自闭症易感基因。
Hum Mutat. 2017 Feb;38(2):204-215. doi: 10.1002/humu.23147. Epub 2016 Dec 5.
9
Genome-wide prediction and functional characterization of the genetic basis of autism spectrum disorder.自闭症谱系障碍遗传基础的全基因组预测与功能表征
Nat Neurosci. 2016 Nov;19(11):1454-1462. doi: 10.1038/nn.4353. Epub 2016 Aug 1.
10
A spectral approach integrating functional genomic annotations for coding and noncoding variants.一种整合编码和非编码变异功能基因组注释的光谱方法。
Nat Genet. 2016 Feb;48(2):214-20. doi: 10.1038/ng.3477. Epub 2016 Jan 4.