• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞 RNA 测序数据中基因互作网络的收缩估计。

Shrinkage estimation of gene interaction networks in single-cell RNA sequencing data.

机构信息

Computer Science Research Centre, University of Surrey, Guildford, UK.

出版信息

BMC Bioinformatics. 2024 Oct 26;25(1):339. doi: 10.1186/s12859-024-05946-9.

DOI:10.1186/s12859-024-05946-9
PMID:39462345
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11515282/
Abstract

BACKGROUND

Gene interaction networks are graphs in which nodes represent genes and edges represent functional interactions between them. These interactions can be at multiple levels, for instance, gene regulation, protein-protein interaction, or metabolic pathways. To analyse gene interaction networks at a large scale, gene co-expression network analysis is often applied on high-throughput gene expression data such as RNA sequencing data. With the advance in sequencing technology, expression of genes can be measured in individual cells. Single-cell RNA sequencing (scRNAseq) provides insights of cellular development, differentiation and characteristics at the transcriptomic level. High sparsity and high-dimensional data structures pose challenges in scRNAseq data analysis.

RESULTS

In this study, a sparse inverse covariance matrix estimation framework for scRNAseq data is developed to capture direct functional interactions between genes. Comparative analyses highlight high performance and fast computation of Stein-type shrinkage in high-dimensional data using simulated scRNAseq data. Data transformation approaches also show improvement in performance of shrinkage methods in non-Gaussian distributed data. Zero-inflated modelling of scRNAseq data based on a negative binomial distribution enhances shrinkage performance in zero-inflated data without interference on non zero-inflated count data.

CONCLUSION

The proposed framework broadens application of graphical model in scRNAseq analysis with flexibility in sparsity of count data resulting from dropout events, high performance, and fast computational time. Implementation of the framework is in a reproducible Snakemake workflow https://github.com/calathea24/ZINBGraphicalModel and R package ZINBStein https://github.com/calathea24/ZINBStein .

摘要

背景

基因交互网络是一种图,其中节点表示基因,边表示它们之间的功能交互。这些交互可以在多个层次上进行,例如基因调控、蛋白质-蛋白质相互作用或代谢途径。为了在大规模上分析基因交互网络,通常在高通量基因表达数据(例如 RNA 测序数据)上应用基因共表达网络分析。随着测序技术的进步,可以在单个细胞中测量基因的表达。单细胞 RNA 测序 (scRNAseq) 提供了细胞发育、分化和转录组水平特征的见解。高稀疏性和高维数据结构给 scRNAseq 数据分析带来了挑战。

结果

本研究开发了一种用于 scRNAseq 数据的稀疏逆协方差矩阵估计框架,以捕获基因之间的直接功能交互。比较分析突出了使用模拟 scRNAseq 数据在高维数据中使用 Stein 型收缩的高性能和快速计算。数据转换方法还表明,在非高斯分布数据中,收缩方法的性能在数据转换后得到了提高。基于负二项分布的 scRNAseq 数据的零膨胀建模增强了零膨胀数据中收缩性能,而不会干扰非零膨胀计数数据。

结论

该框架拓宽了图形模型在 scRNAseq 分析中的应用,具有灵活性,可以处理由于辍学事件导致的计数数据的稀疏性,具有高性能和快速计算时间。该框架的实现是在可重复的 Snakemake 工作流程 https://github.com/calathea24/ZINBGraphicalModel 和 R 包 ZINBStein https://github.com/calathea24/ZINBStein 中进行的。

相似文献

1
Shrinkage estimation of gene interaction networks in single-cell RNA sequencing data.单细胞 RNA 测序数据中基因互作网络的收缩估计。
BMC Bioinformatics. 2024 Oct 26;25(1):339. doi: 10.1186/s12859-024-05946-9.
2
Gene regulatory network reconstruction using single-cell RNA sequencing of barcoded genotypes in diverse environments.使用在不同环境中进行条码基因型单细胞 RNA 测序进行基因调控网络重建。
Elife. 2020 Jan 27;9:e51254. doi: 10.7554/eLife.51254.
3
SFINN: inferring gene regulatory network from single-cell and spatial transcriptomic data with shared factor neighborhood and integrated neural network.SFINN:利用共享因子邻域和集成神经网络从单细胞和空间转录组数据推断基因调控网络。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae433.
4
C-ziptf: stable tensor factorization for zero-inflated multi-dimensional genomics data.C-ziptf:用于零膨胀多维基因组学数据的稳定张量分解。
BMC Bioinformatics. 2024 Oct 5;25(1):323. doi: 10.1186/s12859-024-05886-4.
5
SigEMD: A powerful method for differential gene expression analysis in single-cell RNA sequencing data.SigEMD:一种用于单细胞 RNA 测序数据分析中差异基因表达分析的强大方法。
Methods. 2018 Aug 1;145:25-32. doi: 10.1016/j.ymeth.2018.04.017. Epub 2018 Apr 24.
6
A fast and efficient count-based matrix factorization method for detecting cell types from single-cell RNAseq data.一种基于计数的快速高效矩阵分解方法,用于从单细胞RNA测序数据中检测细胞类型。
BMC Syst Biol. 2019 Apr 5;13(Suppl 2):28. doi: 10.1186/s12918-019-0699-6.
7
Joint gene network construction by single-cell RNA sequencing data.基于单细胞 RNA 测序数据的基因共网络构建
Biometrics. 2023 Jun;79(2):915-925. doi: 10.1111/biom.13645. Epub 2022 Apr 15.
8
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。
BMC Bioinformatics. 2024 Sep 18;25(1):305. doi: 10.1186/s12859-024-05926-z.
9
Predicting gene regulatory links from single-cell RNA-seq data using graph neural networks.利用图神经网络从单细胞 RNA-seq 数据中预测基因调控关系。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad414.
10
Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data.单细胞 RNA 测序数据分析工具的差异基因表达分析比较。
BMC Bioinformatics. 2019 Jan 18;20(1):40. doi: 10.1186/s12859-019-2599-6.

引用本文的文献

1
Utilizing RNA-seq data in monotone iterative generalized linear model to elevate prior knowledge quality of the circRNA-miRNA-mRNA regulatory axis.在单调迭代广义线性模型中利用RNA测序数据提高环状RNA-微小RNA-信使RNA调控轴的先验知识质量。
BMC Bioinformatics. 2025 May 27;26(1):139. doi: 10.1186/s12859-025-06161-w.

本文引用的文献

1
GeneNetTools: tests for Gaussian graphical models with shrinkage.GeneNetTools:具有收缩的高斯图形模型检验。
Bioinformatics. 2022 Nov 15;38(22):5049-5054. doi: 10.1093/bioinformatics/btac657.
2
TFLink: an integrated gateway to access transcription factor-target gene interactions for multiple species.TFLink:一个集成的网关,用于访问多个物种的转录因子-靶基因相互作用。
Database (Oxford). 2022 Sep 16;2022. doi: 10.1093/database/baac083.
3
Distance correlation application to gene co-expression network analysis.距离相关系数在基因共表达网络分析中的应用。
BMC Bioinformatics. 2022 Feb 21;23(1):81. doi: 10.1186/s12859-022-04609-x.
4
Statistics or biology: the zero-inflation controversy about scRNA-seq data.统计学还是生物学:关于 scRNA-seq 数据的零膨胀争议。
Genome Biol. 2022 Jan 21;23(1):31. doi: 10.1186/s13059-022-02601-5.
5
Constructing local cell-specific networks from single-cell data.从单细胞数据构建局部细胞特异性网络。
Proc Natl Acad Sci U S A. 2021 Dec 21;118(51). doi: 10.1073/pnas.2113178118.
6
scLink: Inferring Sparse Gene Co-expression Networks from Single-cell Expression Data.scLink:从单细胞表达数据推断稀疏基因共表达网络。
Genomics Proteomics Bioinformatics. 2021 Jun;19(3):475-492. doi: 10.1016/j.gpb.2020.11.006. Epub 2021 Jul 10.
7
Separating measurement and expression models clarifies confusion in single-cell RNA sequencing analysis.分离测量和表达模型可澄清单细胞 RNA 测序分析中的混淆。
Nat Genet. 2021 Jun;53(6):770-777. doi: 10.1038/s41588-021-00873-4. Epub 2021 May 24.
8
ESCO: single cell expression simulation incorporating gene co-expression.ESCO:整合基因共表达的单细胞表达模拟
Bioinformatics. 2021 Aug 25;37(16):2374-2381. doi: 10.1093/bioinformatics/btab116.
9
The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.2021 年的 STRING 数据库:可定制的蛋白质-蛋白质网络,以及用户上传的基因/测量集的功能特征分析。
Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612. doi: 10.1093/nar/gkaa1074.
10
The BioGRID database: A comprehensive biomedical resource of curated protein, genetic, and chemical interactions.The BioGRID 数据库:一个经过精心整理的生物医学资源,包含蛋白质、遗传和化学相互作用。
Protein Sci. 2021 Jan;30(1):187-200. doi: 10.1002/pro.3978. Epub 2020 Nov 23.