• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过对比学习进行表型亚型分析。

Phenotypic subtyping via contrastive learning.

作者信息

Gorla Aditya, Sankararaman Sriram, Burchard Esteban, Flint Jonathan, Zaitlen Noah, Rahmani Elior

机构信息

Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA.

Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA.

出版信息

bioRxiv. 2023 Jan 6:2023.01.05.522921. doi: 10.1101/2023.01.05.522921.

DOI:10.1101/2023.01.05.522921
PMID:36711575
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9881932/
Abstract

Defining and accounting for subphenotypic structure has the potential to increase statistical power and provide a deeper understanding of the heterogeneity in the molecular basis of complex disease. Existing phenotype subtyping methods primarily rely on clinically observed heterogeneity or metadata clustering. However, they generally tend to capture the dominant sources of variation in the data, which often originate from variation that is not descriptive of the mechanistic heterogeneity of the phenotype of interest; in fact, such dominant sources of variation, such as population structure or technical variation, are, in general, expected to be independent of subphenotypic structure. We instead aim to find a subspace with signal that is unique to a group of samples for which we believe that subphenotypic variation exists (e.g., cases of a disease). To that end, we introduce Phenotype Aware Components Analysis (PACA), a contrastive learning approach leveraging canonical correlation analysis to robustly capture weak sources of subphenotypic variation. In the context of disease, PACA learns a gradient of variation unique to cases in a given dataset, while leveraging control samples for accounting for variation and imbalances of biological and technical confounders between cases and controls. We evaluated PACA using an extensive simulation study, as well as on various subtyping tasks using genotypes, transcriptomics, and DNA methylation data. Our results provide multiple strong evidence that PACA allows us to robustly capture weak unknown variation of interest while being calibrated and well-powered, far superseding the performance of alternative methods. This renders PACA as a state-of-the-art tool for defining subtypes that are more likely to reflect molecular heterogeneity, especially in challenging cases where the phenotypic heterogeneity may be masked by a myriad of strong unrelated effects in the data.

摘要

定义和考量亚表型结构有可能提高统计功效,并能更深入地理解复杂疾病分子基础中的异质性。现有的表型分型方法主要依赖临床观察到的异质性或元数据聚类。然而,它们通常倾向于捕捉数据中的主要变异来源,而这些来源往往并非源自对感兴趣表型的机制异质性具有描述性的变异;事实上,诸如群体结构或技术变异等主要变异来源,通常预期与亚表型结构无关。相反,我们旨在找到一个具有信号的子空间,该子空间对于我们认为存在亚表型变异的一组样本(例如某种疾病的病例)而言是独特的。为此,我们引入了表型感知成分分析(PACA),这是一种对比学习方法,利用典型相关分析来稳健地捕捉亚表型变异的微弱来源。在疾病背景下,PACA学习给定数据集中病例所特有的变异梯度,同时利用对照样本考量病例与对照之间生物学和技术混杂因素的变异及不平衡情况。我们使用广泛的模拟研究以及基于基因型、转录组学和DNA甲基化数据的各种分型任务对PACA进行了评估。我们的结果提供了多个有力证据,表明PACA能够在经过校准且功效良好的情况下,稳健地捕捉感兴趣的未知微弱变异,远远超越了其他方法的性能。这使得PACA成为定义更有可能反映分子异质性的亚型的先进工具,尤其是在具有挑战性的情况下,即表型异质性可能被数据中众多强烈的无关效应所掩盖。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c776/9881932/b69fbdb0a195/nihpp-2023.01.05.522921v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c776/9881932/b0198ff9a425/nihpp-2023.01.05.522921v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c776/9881932/b311391eb29a/nihpp-2023.01.05.522921v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c776/9881932/b69fbdb0a195/nihpp-2023.01.05.522921v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c776/9881932/b0198ff9a425/nihpp-2023.01.05.522921v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c776/9881932/b311391eb29a/nihpp-2023.01.05.522921v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c776/9881932/b69fbdb0a195/nihpp-2023.01.05.522921v1-f0005.jpg

相似文献

1
Phenotypic subtyping via contrastive learning.通过对比学习进行表型亚型分析。
bioRxiv. 2023 Jan 6:2023.01.05.522921. doi: 10.1101/2023.01.05.522921.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误:利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。
J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.
4
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
5
Epigenetic patient stratification via contrastive machine learning refines hallmark biomarkers in minoritized children with asthma.通过对比机器学习进行表观遗传学患者分层可优化哮喘少数族裔儿童的标志性生物标志物。
Res Sq. 2024 Sep 13:rs.3.rs-5066762. doi: 10.21203/rs.3.rs-5066762/v1.
6
Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.美国东部地区遥感气溶胶光学厚度与PM2.5之间关系的评估及统计建模
Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.
7
Qualitative Study定性研究
8
Impact of summer programmes on the outcomes of disadvantaged or 'at risk' young people: A systematic review.暑期项目对处境不利或“有风险”的年轻人的影响:一项系统综述。
Campbell Syst Rev. 2024 Jun 13;20(2):e1406. doi: 10.1002/cl2.1406. eCollection 2024 Jun.
9
SpaGIC: graph-informed clustering in spatial transcriptomics via self-supervised contrastive learning.SpaGIC:基于自监督对比学习的空间转录组学图信息聚类。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae578.
10
Reducing annotation burden in MR: A novel MR-contrast guided contrastive learning approach for image segmentation.减少磁共振成像中的标注负担:一种新的基于磁共振对比引导的对比学习方法用于图像分割。
Med Phys. 2024 Apr;51(4):2707-2720. doi: 10.1002/mp.16820. Epub 2023 Nov 13.

引用本文的文献

1
Using Alternative Definitions of Controls to Increase Statistical Power in GWAS.利用对照的替代定义提高 GWAS 的统计功效。
Behav Genet. 2024 Jul;54(4):353-366. doi: 10.1007/s10519-024-10187-w. Epub 2024 Jun 13.