• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于单细胞多组学整合的无监督神经网络(UMINT):在健康与疾病中的应用

Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease.

作者信息

Maitra Chayan, Seal Dibyendu B, Das Vivek, De Rajat K

机构信息

Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India.

Tatras Data Services Pvt. Ltd., New Delhi, India.

出版信息

Front Mol Biosci. 2023 May 24;10:1184748. doi: 10.3389/fmolb.2023.1184748. eCollection 2023.

DOI:10.3389/fmolb.2023.1184748
PMID:37293552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10244650/
Abstract

Multi-omics studies have enabled us to understand the mechanistic drivers behind complex disease states and progressions, thereby providing novel and actionable biological insights into health status. However, integrating data from multiple modalities is challenging due to high dimensionality and diverse nature of data, and noise associated with each platform. Sparsity in data, non-overlapping features and technical batch effects make the task of learning more complicated. Conventional machine learning (ML) tools are not quite effective against such data integration hazards due to their simplistic nature with less capacity. In addition, existing methods for single cell multi-omics integration are computationally expensive. Therefore, in this work, we have introduced a novel Unsupervised neural network for single cell Multi-omics INTegration (UMINT). UMINT serves as a promising model for integrating variable number of single cell omics layers with high dimensions. It has a light-weight architecture with substantially reduced number of parameters. The proposed model is capable of learning a latent low-dimensional embedding that can extract useful features from the data facilitating further downstream analyses. UMINT has been applied to integrate healthy and disease CITE-seq (paired RNA and surface proteins) datasets including a rare disease Mucosa-Associated Lymphoid Tissue (MALT) tumor. It has been benchmarked against existing state-of-the-art methods for single cell multi-omics integration. Furthermore, UMINT is capable of integrating paired single cell gene expression and ATAC-seq (Transposase-Accessible Chromatin) assays as well.

摘要

多组学研究使我们能够理解复杂疾病状态和进展背后的机制驱动因素,从而为健康状况提供新的、可付诸行动的生物学见解。然而,由于数据的高维度、多样性质以及与每个平台相关的噪声,整合来自多种模态的数据具有挑战性。数据的稀疏性、不重叠特征和技术批次效应使得学习任务更加复杂。传统的机器学习(ML)工具由于其简单的性质和较小的容量,对这种数据整合风险不太有效。此外,现有的单细胞多组学整合方法计算成本高昂。因此,在这项工作中,我们引入了一种用于单细胞多组学整合(UMINT)的新型无监督神经网络。UMINT作为一种有前途的模型,用于整合可变数量的高维单细胞组学层。它具有轻量级架构,参数数量大幅减少。所提出的模型能够学习潜在的低维嵌入,从数据中提取有用特征,便于进一步的下游分析。UMINT已应用于整合健康和疾病的CITE-seq(配对的RNA和表面蛋白)数据集,包括一种罕见疾病黏膜相关淋巴组织(MALT)肿瘤。它已与现有的单细胞多组学整合的先进方法进行了基准测试。此外,UMINT还能够整合配对的单细胞基因表达和ATAC-seq(转座酶可及染色质)分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/986bcfa24882/fmolb-10-1184748-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/a3e781bc74d4/fmolb-10-1184748-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/43c6a35ee27c/fmolb-10-1184748-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/a7d3fe265394/fmolb-10-1184748-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/60204d158abd/fmolb-10-1184748-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/359cdd411ac6/fmolb-10-1184748-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/0dafda9dd2f1/fmolb-10-1184748-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/5ec0e89427ab/fmolb-10-1184748-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/dc129bddb2b5/fmolb-10-1184748-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/12054ef65d7e/fmolb-10-1184748-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/986bcfa24882/fmolb-10-1184748-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/a3e781bc74d4/fmolb-10-1184748-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/43c6a35ee27c/fmolb-10-1184748-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/a7d3fe265394/fmolb-10-1184748-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/60204d158abd/fmolb-10-1184748-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/359cdd411ac6/fmolb-10-1184748-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/0dafda9dd2f1/fmolb-10-1184748-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/5ec0e89427ab/fmolb-10-1184748-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/dc129bddb2b5/fmolb-10-1184748-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/12054ef65d7e/fmolb-10-1184748-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/88ad/10244650/986bcfa24882/fmolb-10-1184748-g010.jpg

相似文献

1
Unsupervised neural network for single cell Multi-omics INTegration (UMINT): an application to health and disease.用于单细胞多组学整合的无监督神经网络(UMINT):在健康与疾病中的应用
Front Mol Biosci. 2023 May 24;10:1184748. doi: 10.3389/fmolb.2023.1184748. eCollection 2023.
2
Multimodal deep learning approaches for single-cell multi-omics data integration.多模态深度学习方法在单细胞多组学数据整合中的应用。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad313.
3
A Cell Cycle-Aware Network for Data Integration and Label Transferring of Single-Cell RNA-Seq and ATAC-Seq.一种细胞周期感知网络,用于整合单细胞 RNA-Seq 和 ATAC-Seq 数据并进行标签转移。
Adv Sci (Weinh). 2024 Aug;11(31):e2401815. doi: 10.1002/advs.202401815. Epub 2024 Jun 17.
4
A Cell Cycle-aware Network for Data Integration and Label Transferring of Single-cell RNA-seq and ATAC-seq.一种用于单细胞RNA测序和转座酶可及染色质测序数据整合与标签转移的细胞周期感知网络。
bioRxiv. 2024 Feb 2:2024.01.31.578213. doi: 10.1101/2024.01.31.578213.
5
MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model.MODILM:一种利用新型多组学数据整合学习模型改善复杂疾病分类的方法。
BMC Med Inform Decis Mak. 2023 May 5;23(1):82. doi: 10.1186/s12911-023-02173-9.
6
Single-cell multi-omics integration for unpaired data by a siamese network with graph-based contrastive loss.基于图对比损失的孪生网络对非配对数据进行单细胞多组学整合。
BMC Bioinformatics. 2023 Jan 4;24(1):5. doi: 10.1186/s12859-022-05126-7.
7
scAWMV: an adaptively weighted multi-view learning framework for the integrative analysis of parallel scRNA-seq and scATAC-seq data.scAWMV:一种自适应加权多视图学习框架,用于平行 scRNA-seq 和 scATAC-seq 数据的综合分析。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac739.
8
A multi-use deep learning method for CITE-seq and single-cell RNA-seq data integration with cell surface protein prediction and imputation.一种用于CITE-seq和单细胞RNA-seq数据整合以及细胞表面蛋白预测与插补的多用途深度学习方法。
Nat Mach Intell. 2022 Nov;4(11):940-952. doi: 10.1038/s42256-022-00545-w. Epub 2022 Oct 27.
9
moSCminer: a cell subtype classification framework based on the attention neural network integrating the single-cell multi-omics dataset on the cloud.moSCminer:一种基于注意力神经网络的细胞亚型分类框架,集成了基于云的单细胞多组学数据集。
PeerJ. 2024 Feb 26;12:e17006. doi: 10.7717/peerj.17006. eCollection 2024.
10
Estimating gene expression from DNA methylation and copy number variation: A deep learning regression model for multi-omics integration.从 DNA 甲基化和拷贝数变异估计基因表达:一种用于多组学整合的深度学习回归模型。
Genomics. 2020 Jul;112(4):2833-2841. doi: 10.1016/j.ygeno.2020.03.021. Epub 2020 Mar 29.

引用本文的文献

1
A Map of the Lipid-Metabolite-Protein Network to Aid Multi-Omics Integration.脂质-代谢物-蛋白质网络图谱助力多组学整合
Biomolecules. 2025 Mar 26;15(4):484. doi: 10.3390/biom15040484.
2
Data stewardship and curation practices in AI-based genomics and automated microscopy image analysis for high-throughput screening studies: promoting robust and ethical AI applications.基于人工智能的基因组学和用于高通量筛选研究的自动显微镜图像分析中的数据管理与整理实践:推动可靠且符合伦理的人工智能应用。
Hum Genomics. 2025 Feb 23;19(1):16. doi: 10.1186/s40246-025-00716-x.
3
Bootstrap Evaluation of Association Matrices (BEAM) for Integrating Multiple Omics Profiles with Multiple Outcomes.

本文引用的文献

1
SIMBA: single-cell embedding along with features.SIMBA:单细胞特征嵌入。
Nat Methods. 2024 Jun;21(6):1003-1013. doi: 10.1038/s41592-023-01899-8. Epub 2023 May 29.
2
scMoC: single-cell multi-omics clustering.scMoC:单细胞多组学聚类
Bioinform Adv. 2022 Feb 15;2(1):vbac011. doi: 10.1093/bioadv/vbac011. eCollection 2022.
3
Computational Methods for Single-cell Multi-omics Integration and Alignment.单细胞多组学整合与对齐的计算方法。
用于整合具有多个结果的多个组学概况的关联矩阵的自助评估(BEAM)
bioRxiv. 2024 Aug 3:2024.07.31.605805. doi: 10.1101/2024.07.31.605805.
4
Multi-task learning from multimodal single-cell omics with Matilda.多模态单细胞组学的 Matilda 多任务学习。
Nucleic Acids Res. 2023 May 8;51(8):e45. doi: 10.1093/nar/gkad157.
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):836-849. doi: 10.1016/j.gpb.2022.11.013. Epub 2022 Dec 26.
4
Deep learning for rare disease: A scoping review.深度学习在罕见病中的应用:一项范围综述。
J Biomed Inform. 2022 Nov;135:104227. doi: 10.1016/j.jbi.2022.104227. Epub 2022 Oct 17.
5
Multi-omics single-cell data integration and regulatory inference with graph-linked embedding.基于图链接嵌入的多组学单细胞数据整合与调控推断。
Nat Biotechnol. 2022 Oct;40(10):1458-1466. doi: 10.1038/s41587-022-01284-4. Epub 2022 May 2.
6
Improved integration of single-cell transcriptome and surface protein expression by LinQ-View.通过 LinQ-View 提高单细胞转录组和表面蛋白表达的整合。
Cell Rep Methods. 2021 Jul 23;1(4):100056. doi: 10.1016/j.crmeth.2021.100056. eCollection 2021 Aug 23.
7
UINMF performs mosaic integration of single-cell multi-omic datasets using nonnegative matrix factorization.UINMF 通过非负矩阵分解对单细胞多组学数据集进行镶嵌式整合。
Nat Commun. 2022 Feb 9;13(1):780. doi: 10.1038/s41467-022-28431-4.
8
MUON: multimodal omics analysis framework.MUON:多模态组学分析框架。
Genome Biol. 2022 Feb 1;23(1):42. doi: 10.1186/s13059-021-02577-8.
9
Statistics or biology: the zero-inflation controversy about scRNA-seq data.统计学还是生物学:关于 scRNA-seq 数据的零膨胀争议。
Genome Biol. 2022 Jan 21;23(1):31. doi: 10.1186/s13059-022-02601-5.
10
scJoint integrates atlas-scale single-cell RNA-seq and ATAC-seq data with transfer learning.scJoint 集成了图谱尺度单细胞 RNA-seq 和 ATAC-seq 数据,并结合了迁移学习。
Nat Biotechnol. 2022 May;40(5):703-710. doi: 10.1038/s41587-021-01161-6. Epub 2022 Jan 20.