• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FAVA:从 scRNA-seq 和蛋白质组学数据中推断出的高质量功能关联网络。

FAVA: high-quality functional association networks inferred from scRNA-seq and proteomics data.

机构信息

Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen N, Denmark.

VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.

出版信息

Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae010.

DOI:10.1093/bioinformatics/btae010
PMID:38192003
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10868155/
Abstract

MOTIVATION

Protein networks are commonly used for understanding how proteins interact. However, they are typically biased by data availability, favoring well-studied proteins with more interactions. To uncover functions of understudied proteins, we must use data that are not affected by this literature bias, such as single-cell RNA-seq and proteomics. Due to data sparseness and redundancy, functional association analysis becomes complex.

RESULTS

To address this, we have developed FAVA (Functional Associations using Variational Autoencoders), which compresses high-dimensional data into a low-dimensional space. FAVA infers networks from high-dimensional omics data with much higher accuracy than existing methods, across a diverse collection of real as well as simulated datasets. FAVA can process large datasets with over 0.5 million conditions and has predicted 4210 interactions between 1039 understudied proteins. Our findings showcase FAVA's capability to offer novel perspectives on protein interactions. FAVA functions within the scverse ecosystem, employing AnnData as its input source.

AVAILABILITY AND IMPLEMENTATION

Source code, documentation, and tutorials for FAVA are accessible on GitHub at https://github.com/mikelkou/fava. FAVA can also be installed and used via pip/PyPI as well as via the scverse ecosystem https://github.com/scverse/ecosystem-packages/tree/main/packages/favapy.

摘要

动机

蛋白质网络常用于理解蛋白质之间的相互作用。然而,它们通常受到数据可用性的影响,偏向于研究较多、相互作用较多的蛋白质。为了揭示研究较少的蛋白质的功能,我们必须使用不受文献偏差影响的数据,如单细胞 RNA-seq 和蛋白质组学。由于数据稀疏和冗余,功能关联分析变得复杂。

结果

为了解决这个问题,我们开发了 FAVA(使用变分自动编码器的功能关联),它将高维数据压缩到低维空间。FAVA 比现有方法从高维组学数据中推断网络的准确性要高得多,涵盖了真实和模拟数据集的多样化集合。FAVA 可以处理超过 0.5 万个条件的大型数据集,并预测了 1039 个研究较少的蛋白质之间的 4210 个相互作用。我们的研究结果展示了 FAVA 提供蛋白质相互作用新视角的能力。FAVA 在 scverse 生态系统中运行,使用 AnnData 作为其输入源。

可用性和实现

FAVA 的源代码、文档和教程可在 GitHub 上获得,网址为 https://github.com/mikelkou/fava。FAVA 也可以通过 pip/PyPI以及通过 scverse 生态系统 https://github.com/scverse/ecosystem-packages/tree/main/packages/favapy 进行安装和使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/194021e67383/btae010f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/8dac9af2db85/btae010f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/3871b8e73ef9/btae010f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/9ea1c5a581c8/btae010f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/5af91b40990e/btae010f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/194021e67383/btae010f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/8dac9af2db85/btae010f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/3871b8e73ef9/btae010f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/9ea1c5a581c8/btae010f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/5af91b40990e/btae010f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c4d/10868155/194021e67383/btae010f5.jpg

相似文献

1
FAVA: high-quality functional association networks inferred from scRNA-seq and proteomics data.FAVA:从 scRNA-seq 和蛋白质组学数据中推断出的高质量功能关联网络。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae010.
2
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
3
SPANN: annotating single-cell resolution spatial transcriptome data with scRNA-seq data.SPANN:利用单细胞RNA测序数据注释单细胞分辨率空间转录组数据。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbad533.
4
Clustering scRNA-seq data with the cross-view collaborative information fusion strategy.使用跨视图协同信息融合策略对 scRNA-seq 数据进行聚类。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae511.
5
ResPAN: a powerful batch correction model for scRNA-seq data through residual adversarial networks.ResPAN:通过残差对抗网络对 scRNA-seq 数据进行强大的批量校正模型。
Bioinformatics. 2022 Aug 10;38(16):3942-3949. doi: 10.1093/bioinformatics/btac427.
6
CIForm as a Transformer-based model for cell-type annotation of large-scale single-cell RNA-seq data.CIForm 作为一种基于 Transformer 的模型,用于大规模单细胞 RNA-seq 数据的细胞类型注释。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad195.
7
FlowGrid enables fast clustering of very large single-cell RNA-seq data.FlowGrid能够对非常大的单细胞RNA测序数据进行快速聚类。
Bioinformatics. 2021 Dec 22;38(1):282-283. doi: 10.1093/bioinformatics/btab521.
8
scMAGS: Marker gene selection from scRNA-seq data for spatial transcriptomics studies.scMAGS:从单细胞RNA测序数据中选择标记基因用于空间转录组学研究。
Comput Biol Med. 2023 Mar;155:106634. doi: 10.1016/j.compbiomed.2023.106634. Epub 2023 Feb 9.
9
Transfer learning for clustering single-cell RNA-seq data crossing-species and batch, case on uterine fibroids.跨物种和批次的单细胞 RNA-seq 数据聚类的迁移学习:以子宫肌瘤为例。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad426.
10
scAWMV: an adaptively weighted multi-view learning framework for the integrative analysis of parallel scRNA-seq and scATAC-seq data.scAWMV:一种自适应加权多视图学习框架,用于平行 scRNA-seq 和 scATAC-seq 数据的综合分析。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac739.

引用本文的文献

1
Recent Advances in Mass Spectrometry-Based Bottom-Up Proteomics.基于质谱的自下而上蛋白质组学的最新进展
Anal Chem. 2025 Mar 11;97(9):4728-4749. doi: 10.1021/acs.analchem.4c06750. Epub 2025 Feb 25.
2
The STRING database in 2025: protein networks with directionality of regulation.2025年的STRING数据库:具有调控方向性的蛋白质网络。
Nucleic Acids Res. 2025 Jan 6;53(D1):D730-D737. doi: 10.1093/nar/gkae1113.
3
Heterogeneous network approaches to protein pathway prediction.用于蛋白质通路预测的异构网络方法。

本文引用的文献

1
Massively parallel sample preparation for multiplexed single-cell proteomics using nPOP.使用nPOP进行多重单细胞蛋白质组学的大规模平行样本制备。
Nat Protoc. 2024 Dec;19(12):3750-3776. doi: 10.1038/s41596-024-01033-8. Epub 2024 Aug 8.
2
hdWGCNA identifies co-expression networks in high-dimensional transcriptomics data.hdWGCNA 鉴定高维转录组学数据中的共表达网络。
Cell Rep Methods. 2023 Jun 12;3(6):100498. doi: 10.1016/j.crmeth.2023.100498. eCollection 2023 Jun 26.
3
Prioritized mass spectrometry increases the depth, sensitivity and data completeness of single-cell proteomics.
Comput Struct Biotechnol J. 2024 Jun 27;23:2727-2739. doi: 10.1016/j.csbj.2024.06.022. eCollection 2024 Dec.
4
Multi-layered genetic approaches to identify approved drug targets.用于识别已批准药物靶点的多层基因方法。
Cell Genom. 2023 Jun 15;3(7):100341. doi: 10.1016/j.xgen.2023.100341. eCollection 2023 Jul 12.
5
Toward an Integrated Machine Learning Model of a Proteomics Experiment.迈向蛋白质组学实验的集成机器学习模型。
J Proteome Res. 2023 Mar 3;22(3):681-696. doi: 10.1021/acs.jproteome.2c00711. Epub 2023 Feb 6.
6
The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest.2023 年的 STRING 数据库:针对任何感兴趣的测序基因组的蛋白质-蛋白质关联网络和功能富集分析。
Nucleic Acids Res. 2023 Jan 6;51(D1):D638-D646. doi: 10.1093/nar/gkac1000.
优先质谱分析提高了单细胞蛋白质组学的深度、灵敏度和数据完整性。
Nat Methods. 2023 May;20(5):714-722. doi: 10.1038/s41592-023-01830-1. Epub 2023 Apr 3.
4
Recent advances in the field of single-cell proteomics.单细胞蛋白质组学领域的最新进展。
Transl Oncol. 2023 Jan;27:101556. doi: 10.1016/j.tranon.2022.101556. Epub 2022 Oct 19.
5
Understudied proteins: opportunities and challenges for functional proteomics.研究不足的蛋白质:功能蛋白质组学面临的机遇与挑战
Nat Methods. 2022 Jul;19(7):774-779. doi: 10.1038/s41592-022-01454-x.
6
An open invitation to the Understudied Proteins Initiative.向未充分研究蛋白质计划发出的公开邀请。
Nat Biotechnol. 2022 Jun;40(6):815-817. doi: 10.1038/s41587-022-01316-z.
7
Ultra-high sensitivity mass spectrometry quantifies single-cell proteome changes upon perturbation.超高灵敏度质谱定量分析扰动后单细胞蛋白质组的变化。
Mol Syst Biol. 2022 Mar;18(3):e10798. doi: 10.15252/msb.202110798.
8
The reactome pathway knowledgebase 2022.反应体通路知识库2022版。
Nucleic Acids Res. 2022 Jan 7;50(D1):D687-D692. doi: 10.1093/nar/gkab1028.
9
A single-cell type transcriptomics map of human tissues.人类组织单细胞转录组图谱。
Sci Adv. 2021 Jul 28;7(31). doi: 10.1126/sciadv.abh2169. Print 2021 Jul.
10
scLink: Inferring Sparse Gene Co-expression Networks from Single-cell Expression Data.scLink:从单细胞表达数据推断稀疏基因共表达网络。
Genomics Proteomics Bioinformatics. 2021 Jun;19(3):475-492. doi: 10.1016/j.gpb.2020.11.006. Epub 2021 Jul 10.