• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于绘制基因型-表型动态图谱的多模态学习

Multimodal Learning for Mapping the Genotype-Phenotype Dynamics.

作者信息

Khodaee Farhan, Zandie Rohola, Edelman Elazer R

机构信息

Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, 02139, MA, USA.

Department of Medicine (Cardiovascular Medicine), Brigham and Women's Hospital, Boston, 02115, MA, USA.

出版信息

Res Sq. 2024 May 16:rs.3.rs-4355413. doi: 10.21203/rs.3.rs-4355413/v1.

DOI:10.21203/rs.3.rs-4355413/v1
PMID:38798675
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11118704/
Abstract

How complex phenotypes emerge from intricate gene expression patterns is a fundamental question in biology. Quantitative characterization of this relationship, however, is challenging due to the vast combinatorial possibilities and dynamic interplay between genotype and phenotype landscapes. Integrating high-content genotyping approaches such as single-cell RNA sequencing and advanced learning methods such as language models offers an opportunity for dissecting this complex relationship. Here, we present a computational integrated genetics framework designed to analyze and interpret the high-dimensional landscape of genotypes and their associated phenotypes simultaneously. We applied this approach to develop a multimodal foundation model to explore the genotype-phenotype relationship manifold for human transcriptomics at the cellular level. Analyzing this joint manifold showed a refined resolution of cellular heterogeneity, enhanced precision in phenotype annotating, and uncovered potential cross-tissue biomarkers that are undetectable through conventional gene expression analysis alone. Moreover, our results revealed that the gene networks are characterized by scale-free patterns and show context-dependent gene-gene interactions, both of which result in significant variations in the topology of the gene network, particularly evident during aging. Finally, utilizing contextualized embeddings, we investigated gene polyfunctionality which illustrates the multifaceted roles that genes play in different biological processes, and demonstrated that for VWF gene in endothelial cells. Overall, this study advances our understanding of the dynamic interplay between gene expression and phenotypic manifestation and demonstrates the potential of integrated genetics in uncovering new dimensions of cellular function and complexity.

摘要

复杂的表型如何从错综复杂的基因表达模式中产生,这是生物学中的一个基本问题。然而,由于基因型和表型格局之间存在巨大的组合可能性和动态相互作用,对这种关系进行定量表征具有挑战性。整合诸如单细胞RNA测序等高内涵基因分型方法和诸如语言模型等先进学习方法,为剖析这种复杂关系提供了一个契机。在此,我们提出了一个计算整合遗传学框架,旨在同时分析和解释基因型及其相关表型的高维格局。我们应用这种方法开发了一个多模态基础模型,以在细胞水平上探索人类转录组学的基因型-表型关系流形。对这个联合流形的分析显示了细胞异质性的精细分辨率、表型注释的更高精度,并发现了仅通过传统基因表达分析无法检测到的潜在跨组织生物标志物。此外,我们的结果表明,基因网络具有无标度模式的特征,并表现出依赖于上下文的基因-基因相互作用,这两者都会导致基因网络拓扑结构的显著变化,在衰老过程中尤为明显。最后,利用上下文嵌入,我们研究了基因多功能性,它说明了基因在不同生物过程中所起的多方面作用,并以内皮细胞中的VWF基因为例进行了证明。总体而言,这项研究推进了我们对基因表达与表型表现之间动态相互作用的理解,并证明了整合遗传学在揭示细胞功能和复杂性新维度方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/06e17968fef5/nihpp-rs4355413v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/25439090a564/nihpp-rs4355413v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/43e9bdf47552/nihpp-rs4355413v1-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/7ab8edc6c4a9/nihpp-rs4355413v1-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/d2310c3ac3ed/nihpp-rs4355413v1-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/86c0e24db9ab/nihpp-rs4355413v1-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/439689965ee2/nihpp-rs4355413v1-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/406a694cc8e3/nihpp-rs4355413v1-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/b428cda87e73/nihpp-rs4355413v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/f1985d5a3b7c/nihpp-rs4355413v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/4fb475ed89a6/nihpp-rs4355413v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/15c46a93fc79/nihpp-rs4355413v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/901b1f8b8e5d/nihpp-rs4355413v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/06e17968fef5/nihpp-rs4355413v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/25439090a564/nihpp-rs4355413v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/43e9bdf47552/nihpp-rs4355413v1-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/7ab8edc6c4a9/nihpp-rs4355413v1-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/d2310c3ac3ed/nihpp-rs4355413v1-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/86c0e24db9ab/nihpp-rs4355413v1-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/439689965ee2/nihpp-rs4355413v1-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/406a694cc8e3/nihpp-rs4355413v1-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/b428cda87e73/nihpp-rs4355413v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/f1985d5a3b7c/nihpp-rs4355413v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/4fb475ed89a6/nihpp-rs4355413v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/15c46a93fc79/nihpp-rs4355413v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/901b1f8b8e5d/nihpp-rs4355413v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6655/11118704/06e17968fef5/nihpp-rs4355413v1-f0006.jpg

相似文献

1
Multimodal Learning for Mapping the Genotype-Phenotype Dynamics.用于绘制基因型-表型动态图谱的多模态学习
Res Sq. 2024 May 16:rs.3.rs-4355413. doi: 10.21203/rs.3.rs-4355413/v1.
2
Multimodal learning for mapping genotype-phenotype dynamics.用于绘制基因型-表型动态图谱的多模态学习
Nat Comput Sci. 2025 Apr;5(4):333-344. doi: 10.1038/s43588-024-00765-7. Epub 2025 Jan 28.
3
DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype-phenotype prediction.DeepGAMI:基于生物学的深度辅助学习的多模态整合与插补方法,以提高基因型-表型预测。
Genome Med. 2023 Oct 31;15(1):88. doi: 10.1186/s13073-023-01248-6.
4
SGTB: A graph representation learning model combining transformer and BERT for optimizing gene expression analysis in spatial transcriptomics data.SGTB:一种结合Transformer和BERT的图表示学习模型,用于优化空间转录组学数据中的基因表达分析。
Comput Biol Chem. 2025 Apr 21;118:108482. doi: 10.1016/j.compbiolchem.2025.108482.
5
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
6
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误:利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。
J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.
7
Computational solutions for spatial transcriptomics.空间转录组学的计算解决方案。
Comput Struct Biotechnol J. 2022 Sep 1;20:4870-4884. doi: 10.1016/j.csbj.2022.08.043. eCollection 2022.
8
Mapping information-rich genotype-phenotype landscapes with genome-scale Perturb-seq.利用全基因组 Perturb-seq 技术绘制富含信息的基因型-表型图谱。
Cell. 2022 Jul 7;185(14):2559-2575.e28. doi: 10.1016/j.cell.2022.05.013. Epub 2022 Jun 9.
9
Foodomics as part of the host-microbiota-exposome interplay.食物组学作为宿主-微生物群-暴露组相互作用的一部分。
J Proteomics. 2016 Sep 16;147:3-20. doi: 10.1016/j.jprot.2016.04.033. Epub 2016 Apr 26.
10
Analysis of Single-Cell Gene Pair Coexpression Landscapes by Stochastic Kinetic Modeling Reveals Gene-Pair Interactions in Development.通过随机动力学建模分析单细胞基因对共表达景观揭示发育中的基因对相互作用
Front Genet. 2020 Jan 31;10:1387. doi: 10.3389/fgene.2019.01387. eCollection 2019.

本文引用的文献

1
scGPT: toward building a foundation model for single-cell multi-omics using generative AI.scGPT:迈向使用生成式人工智能构建单细胞多组学基础模型
Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.
2
Inhibition of MEG3 ameliorates cardiomyocyte apoptosis and autophagy by regulating the expression of miRNA-129-5p in a mouse model of heart failure.抑制 MEG3 通过调节心力衰竭小鼠模型中 miRNA-129-5p 的表达来改善心肌细胞凋亡和自噬。
Redox Rep. 2023 Dec;28(1):2224607. doi: 10.1080/13510002.2023.2224607.
3
Transfer learning enables predictions in network biology.
迁移学习可实现网络生物学预测。
Nature. 2023 Jun;618(7965):616-624. doi: 10.1038/s41586-023-06139-9. Epub 2023 May 31.
4
Multimodal machine learning in precision health: A scoping review.精准健康中的多模态机器学习:一项范围综述。
NPJ Digit Med. 2022 Nov 7;5(1):171. doi: 10.1038/s41746-022-00712-8.
5
Mapping information-rich genotype-phenotype landscapes with genome-scale Perturb-seq.利用全基因组 Perturb-seq 技术绘制富含信息的基因型-表型图谱。
Cell. 2022 Jul 7;185(14):2559-2575.e28. doi: 10.1016/j.cell.2022.05.013. Epub 2022 Jun 9.
6
The Tabula Sapiens: A multiple-organ, single-cell transcriptomic atlas of humans.智慧人图谱:人类多器官单细胞转录组图谱。
Science. 2022 May 13;376(6594):eabl4896. doi: 10.1126/science.abl4896.
7
The tongue features associated with chronic kidney disease.与慢性肾脏病相关的舌象特征。
Medicine (Baltimore). 2021 Mar 5;100(9):e25037. doi: 10.1097/MD.0000000000025037.
8
Exploring genetic interaction manifolds constructed from rich single-cell phenotypes.探索由丰富的单细胞表型构建的遗传相互作用流形。
Science. 2019 Aug 23;365(6455):786-793. doi: 10.1126/science.aax4438. Epub 2019 Aug 8.
9
Innate lymphoid cells support regulatory T cells in the intestine through interleukin-2.先天淋巴细胞通过白细胞介素-2支持肠道中的调节性 T 细胞。
Nature. 2019 Apr;568(7752):405-409. doi: 10.1038/s41586-019-1082-x. Epub 2019 Apr 3.
10
STAT3-induced upregulation of lncRNA MEG3 regulates the growth of cardiac hypertrophy through miR-361-5p/HDAC9 axis.STAT3 诱导的长链非编码 RNA MEG3 通过 miR-361-5p/HDAC9 轴调节心肌肥厚的生长。
Sci Rep. 2019 Jan 24;9(1):460. doi: 10.1038/s41598-018-36369-1.