• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用单细胞转录组数据中的基因相关性。

Leveraging gene correlations in single cell transcriptomic data.

机构信息

Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA.

Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA.

出版信息

BMC Bioinformatics. 2024 Sep 18;25(1):305. doi: 10.1186/s12859-024-05926-z.

DOI:10.1186/s12859-024-05926-z
PMID:39294560
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11411778/
Abstract

BACKGROUND

Many approaches have been developed to overcome technical noise in single cell RNA-sequencing (scRNAseq). As researchers dig deeper into data-looking for rare cell types, subtleties of cell states, and details of gene regulatory networks-there is a growing need for algorithms with controllable accuracy and fewer ad hoc parameters and thresholds. Impeding this goal is the fact that an appropriate null distribution for scRNAseq cannot simply be extracted from data in which ground truth about biological variation is unknown (i.e., usually).

RESULTS

We approach this problem analytically, assuming that scRNAseq data reflect only cell heterogeneity (what we seek to characterize), transcriptional noise (temporal fluctuations randomly distributed across cells), and sampling error (i.e., Poisson noise). We analyze scRNAseq data without normalization-a step that skews distributions, particularly for sparse data-and calculate p values associated with key statistics. We develop an improved method for selecting features for cell clustering and identifying gene-gene correlations, both positive and negative. Using simulated data, we show that this method, which we call BigSur (Basic Informatics and Gene Statistics from Unnormalized Reads), captures even weak yet significant correlation structures in scRNAseq data. Applying BigSur to data from a clonal human melanoma cell line, we identify thousands of correlations that, when clustered without supervision into gene communities, align with known cellular components and biological processes, and highlight potentially novel cell biological relationships.

CONCLUSIONS

New insights into functionally relevant gene regulatory networks can be obtained using a statistically grounded approach to the identification of gene-gene correlations.

摘要

背景

为了克服单细胞 RNA 测序(scRNAseq)中的技术噪声,已经开发了许多方法。随着研究人员深入挖掘数据——寻找罕见的细胞类型、细胞状态的细微差别和基因调控网络的细节——需要越来越多的算法,这些算法具有可控的准确性和较少的特殊参数和阈值。阻碍这一目标的是,由于关于生物变异的真实情况未知(即通常情况下),因此不能简单地从数据中提取 scRNAseq 的适当零分布。

结果

我们从分析的角度来解决这个问题,假设 scRNAseq 数据仅反映细胞异质性(我们试图描述的)、转录噪声(随机分布在细胞之间的时间波动)和抽样误差(即泊松噪声)。我们对未进行归一化的 scRNAseq 数据进行分析——这一步骤会扭曲分布,尤其是对于稀疏数据——并计算与关键统计数据相关的 p 值。我们开发了一种改进的方法,用于选择细胞聚类和识别正相关和负相关基因的特征。使用模拟数据,我们表明,这种方法(我们称之为 BigSur,即未归一化读取的基本信息学和基因统计)可以捕获 scRNAseq 数据中即使是微弱但却显著的相关结构。将 BigSur 应用于克隆人黑色素瘤细胞系的数据,我们鉴定了数千个相关性,当无监督地将这些相关性聚类为基因社区时,这些相关性与已知的细胞成分和生物过程相吻合,并突出了潜在的新的细胞生物学关系。

结论

使用基于统计学的方法识别基因-基因相关性,可以深入了解功能相关的基因调控网络。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/e1042ee2ec22/12859_2024_5926_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/dcd1e0cdb1ea/12859_2024_5926_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/ae767f898213/12859_2024_5926_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/29068e8d9d39/12859_2024_5926_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/8e7cd2d1d2f7/12859_2024_5926_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/4087ad9df315/12859_2024_5926_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/5890faf12bf6/12859_2024_5926_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/e1042ee2ec22/12859_2024_5926_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/dcd1e0cdb1ea/12859_2024_5926_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/ae767f898213/12859_2024_5926_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/29068e8d9d39/12859_2024_5926_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/8e7cd2d1d2f7/12859_2024_5926_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/4087ad9df315/12859_2024_5926_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/5890faf12bf6/12859_2024_5926_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/377c/11411778/e1042ee2ec22/12859_2024_5926_Fig7_HTML.jpg

相似文献

1
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。
BMC Bioinformatics. 2024 Sep 18;25(1):305. doi: 10.1186/s12859-024-05926-z.
2
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。
bioRxiv. 2023 Nov 1:2023.03.14.532643. doi: 10.1101/2023.03.14.532643.
3
Shrinkage estimation of gene interaction networks in single-cell RNA sequencing data.单细胞 RNA 测序数据中基因互作网络的收缩估计。
BMC Bioinformatics. 2024 Oct 26;25(1):339. doi: 10.1186/s12859-024-05946-9.
4
TENET: gene network reconstruction using transfer entropy reveals key regulatory factors from single cell transcriptomic data.TENET:利用传递熵重建基因网络,从单细胞转录组数据中揭示关键调控因子。
Nucleic Acids Res. 2021 Jan 11;49(1):e1. doi: 10.1093/nar/gkaa1014.
5
Gene regulatory network reconstruction using single-cell RNA sequencing of barcoded genotypes in diverse environments.使用在不同环境中进行条码基因型单细胞 RNA 测序进行基因调控网络重建。
Elife. 2020 Jan 27;9:e51254. doi: 10.7554/eLife.51254.
6
Gene selection for optimal prediction of cell position in tissues from single-cell transcriptomics data.从单细胞转录组学数据中选择基因以最优地预测细胞在组织中的位置。
Life Sci Alliance. 2020 Sep 24;3(11). doi: 10.26508/lsa.202000867. Print 2020 Nov.
7
Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data.从单细胞 RNA-seq 数据中计算推断网络的方法的优势和劣势分析。
G3 (Bethesda). 2023 Mar 9;13(3). doi: 10.1093/g3journal/jkad004.
8
Identifying cell populations with scRNASeq.单细胞 RNA 测序鉴定细胞群体。
Mol Aspects Med. 2018 Feb;59:114-122. doi: 10.1016/j.mam.2017.07.002. Epub 2017 Jul 25.
9
Inference of Gene Co-expression Networks from Single-Cell RNA-Sequencing Data.从单细胞RNA测序数据推断基因共表达网络
Methods Mol Biol. 2019;1935:141-153. doi: 10.1007/978-1-4939-9057-3_10.
10
Single-Cell Transcriptome Profiling.单细胞转录组谱分析。
Methods Mol Biol. 2021;2243:311-325. doi: 10.1007/978-1-0716-1103-6_16.

引用本文的文献

1
Uncovering minimal pathways in melanoma initiation.揭示黑色素瘤起始中的最小通路。
Nat Commun. 2025 Jun 26;16(1):5369. doi: 10.1038/s41467-025-60742-0.
2
SciGeneX: enhancing transcriptional analysis through gene module detection in single-cell and spatial transcriptomics data.SciGeneX:通过在单细胞和空间转录组学数据中进行基因模块检测来增强转录分析。
NAR Genom Bioinform. 2025 Apr 17;7(2):lqaf043. doi: 10.1093/nargab/lqaf043. eCollection 2025 Jun.
3
Statistically principled feature selection for single cell transcriptomics.单细胞转录组学的统计学原理特征选择

本文引用的文献

1
Anti-correlated feature selection prevents false discovery of subpopulations in scRNAseq.抗相关特征选择可防止 scRNAseq 中亚群的假发现。
Nat Commun. 2024 Jan 24;15(1):699. doi: 10.1038/s41467-023-43406-9.
2
hdWGCNA identifies co-expression networks in high-dimensional transcriptomics data.hdWGCNA 鉴定高维转录组学数据中的共表达网络。
Cell Rep Methods. 2023 Jun 12;3(6):100498. doi: 10.1016/j.crmeth.2023.100498. eCollection 2023 Jun 26.
3
SEACells infers transcriptional and epigenomic cellular states from single-cell genomics data.
bioRxiv. 2024 Oct 15:2024.10.11.617709. doi: 10.1101/2024.10.11.617709.
SEACells 从单细胞基因组学数据推断转录和表观基因组细胞状态。
Nat Biotechnol. 2023 Dec;41(12):1746-1757. doi: 10.1038/s41587-023-01716-9. Epub 2023 Mar 27.
4
Paralog Explorer: A resource for mining information about paralogs in common research organisms.旁系同源物浏览器:一个挖掘常见研究生物体中旁系同源物信息的资源库。
Comput Struct Biotechnol J. 2022 Nov 24;20:6570-6577. doi: 10.1016/j.csbj.2022.11.041. eCollection 2022.
5
spliceJAC: transition genes and state-specific gene regulation from single-cell transcriptome data.拼接 JAC:从单细胞转录组数据中过渡基因和状态特异性基因调控。
Mol Syst Biol. 2022 Nov;18(11):e11176. doi: 10.15252/msb.202211176.
6
The Non Catalytic Protein ERG28 has a Functional Role in Cholesterol Synthesis and is Coregulated Transcriptionally.非催化蛋白 ERG28 在胆固醇合成中具有功能作用,并在转录水平上共同调节。
J Lipid Res. 2022 Dec;63(12):100295. doi: 10.1016/j.jlr.2022.100295. Epub 2022 Oct 8.
7
Inferring gene regulation from stochastic transcriptional variation across single cells at steady state.从稳态下单细胞的随机转录变化推断基因调控。
Proc Natl Acad Sci U S A. 2022 Aug 23;119(34):e2207392119. doi: 10.1073/pnas.2207392119. Epub 2022 Aug 15.
8
Metacells untangle large and complex single-cell transcriptome networks.类细胞可梳理大型且复杂的单细胞转录组网络。
BMC Bioinformatics. 2022 Aug 13;23(1):336. doi: 10.1186/s12859-022-04861-1.
9
Learning discriminative and structural samples for rare cell types with deep generative model.利用深度生成模型学习罕见细胞类型的判别和结构样本。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac317.
10
Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges.单细胞RNA测序数据的差异表达分析:当前的统计方法与突出挑战
Entropy (Basel). 2022 Jul 18;24(7):995. doi: 10.3390/e24070995.