• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用单细胞转录组数据中的基因相关性。

Leveraging gene correlations in single cell transcriptomic data.

作者信息

Silkwood Kai, Dollinger Emmanuel, Gervin Josh, Atwood Scott, Nie Qing, Lander Arthur D

机构信息

Center for Complex Biological Systems, University of California, Irvine, Irvine CA.

Department of Developmental and Cell Biology, University of California, Irvine, Irvine CA.

出版信息

bioRxiv. 2023 Nov 1:2023.03.14.532643. doi: 10.1101/2023.03.14.532643.

DOI:10.1101/2023.03.14.532643
PMID:36993765
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10055147/
Abstract

BACKGROUND

Many approaches have been developed to overcome technical noise in single cell RNA-sequencing (scRNAseq). As researchers dig deeper into data-looking for rare cell types, subtleties of cell states, and details of gene regulatory networks-there is a growing need for algorithms with controllable accuracy and fewer parameters and thresholds. Impeding this goal is the fact that an appropriate null distribution for scRNAseq cannot simply be extracted from data when ground truth about biological variation is unknown (i.e., usually).

RESULTS

We approach this problem analytically, assuming that scRNAseq data reflect only cell heterogeneity (what we seek to characterize), transcriptional noise (temporal fluctuations randomly distributed across cells), and sampling error (i.e., Poisson noise). We analyze scRNAseq data without normalization-a step that skews distributions, particularly for sparse data-and calculate -values associated with key statistics. We develop an improved method for selecting features for cell clustering and identifying gene-gene correlations, both positive and negative. Using simulated data, we show that this method, which we call BigSur (Basic Informatics and Gene Statistics from Unnormalized Reads), captures even weak yet significant correlation structures in scRNAseq data. Applying BigSur to data from a clonal human melanoma cell line, we identify thousands of correlations that, when clustered without supervision into gene communities, align with known cellular components and biological processes, and highlight potentially novel cell biological relationships.

CONCLUSIONS

New insights into functionally relevant gene regulatory networks can be obtained using a statistically grounded approach to the identification of gene-gene correlations.

摘要

背景

已经开发出许多方法来克服单细胞RNA测序(scRNAseq)中的技术噪声。随着研究人员更深入地挖掘数据,寻找稀有细胞类型、细胞状态的细微差别以及基因调控网络的细节,对具有可控准确性、更少参数和阈值的算法的需求日益增长。阻碍这一目标的是,当关于生物学变异的真实情况未知时(即通常情况下),无法简单地从数据中提取scRNAseq的合适空分布。

结果

我们通过分析方法解决这个问题,假设scRNAseq数据仅反映细胞异质性(我们试图表征的内容)、转录噪声(随机分布在细胞间的时间波动)和采样误差(即泊松噪声)。我们在不进行归一化的情况下分析scRNAseq数据——归一化这一步骤会扭曲分布,特别是对于稀疏数据——并计算与关键统计量相关的p值。我们开发了一种改进方法,用于选择细胞聚类的特征以及识别正负基因-基因相关性。使用模拟数据,我们表明这种我们称为BigSur(来自未归一化读数的基本信息学和基因统计)的方法能够捕捉scRNAseq数据中即使是微弱但显著的相关结构。将BigSur应用于克隆人黑色素瘤细胞系的数据,我们识别出数千种相关性,在无监督的情况下将这些相关性聚类成基因群落时,它们与已知的细胞成分和生物学过程一致,并突出了潜在的新型细胞生物学关系。

结论

使用基于统计学的方法来识别基因-基因相关性,可以获得对功能相关基因调控网络的新见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/8a2a3bdf8a98/nihpp-2023.03.14.532643v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/af45fae2c30b/nihpp-2023.03.14.532643v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/1d76693a642f/nihpp-2023.03.14.532643v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/47a2fa751747/nihpp-2023.03.14.532643v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/e6c1bb643b21/nihpp-2023.03.14.532643v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/6caad9057824/nihpp-2023.03.14.532643v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/8a2a3bdf8a98/nihpp-2023.03.14.532643v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/af45fae2c30b/nihpp-2023.03.14.532643v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/1d76693a642f/nihpp-2023.03.14.532643v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/47a2fa751747/nihpp-2023.03.14.532643v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/e6c1bb643b21/nihpp-2023.03.14.532643v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/6caad9057824/nihpp-2023.03.14.532643v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d10/10621361/8a2a3bdf8a98/nihpp-2023.03.14.532643v2-f0006.jpg

相似文献

1
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。
bioRxiv. 2023 Nov 1:2023.03.14.532643. doi: 10.1101/2023.03.14.532643.
2
Leveraging gene correlations in single cell transcriptomic data.利用单细胞转录组数据中的基因相关性。
BMC Bioinformatics. 2024 Sep 18;25(1):305. doi: 10.1186/s12859-024-05926-z.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Shrinkage estimation of gene interaction networks in single-cell RNA sequencing data.单细胞 RNA 测序数据中基因互作网络的收缩估计。
BMC Bioinformatics. 2024 Oct 26;25(1):339. doi: 10.1186/s12859-024-05946-9.
5
Gene regulatory network reconstruction using single-cell RNA sequencing of barcoded genotypes in diverse environments.使用在不同环境中进行条码基因型单细胞 RNA 测序进行基因调控网络重建。
Elife. 2020 Jan 27;9:e51254. doi: 10.7554/eLife.51254.
6
Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data.从单细胞 RNA-seq 数据中计算推断网络的方法的优势和劣势分析。
G3 (Bethesda). 2023 Mar 9;13(3). doi: 10.1093/g3journal/jkad004.
7
mAPC-GibbsOS: an integrated approach for robust identification of gene regulatory networks.mAPC-GibbsOS:一种用于稳健识别基因调控网络的综合方法。
BMC Syst Biol. 2013;7 Suppl 5(Suppl 5):S4. doi: 10.1186/1752-0509-7-S5-S4. Epub 2013 Dec 9.
8
FR-Match: robust matching of cell type clusters from single cell RNA sequencing data using the Friedman-Rafsky non-parametric test.FR-Match:使用 Friedman-Rafsky 非参数检验对单细胞 RNA 测序数据中的细胞类型簇进行稳健匹配。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa339.
9
TENET: gene network reconstruction using transfer entropy reveals key regulatory factors from single cell transcriptomic data.TENET:利用传递熵重建基因网络,从单细胞转录组数据中揭示关键调控因子。
Nucleic Acids Res. 2021 Jan 11;49(1):e1. doi: 10.1093/nar/gkaa1014.
10
CIPR: a web-based R/shiny app and R package to annotate cell clusters in single cell RNA sequencing experiments.CIPR:一个基于网络的 R/shiny 应用程序和 R 包,用于注释单细胞 RNA 测序实验中的细胞簇。
BMC Bioinformatics. 2020 May 15;21(1):191. doi: 10.1186/s12859-020-3538-2.

本文引用的文献

1
Anti-correlated feature selection prevents false discovery of subpopulations in scRNAseq.抗相关特征选择可防止 scRNAseq 中亚群的假发现。
Nat Commun. 2024 Jan 24;15(1):699. doi: 10.1038/s41467-023-43406-9.
2
Paralog Explorer: A resource for mining information about paralogs in common research organisms.旁系同源物浏览器:一个挖掘常见研究生物体中旁系同源物信息的资源库。
Comput Struct Biotechnol J. 2022 Nov 24;20:6570-6577. doi: 10.1016/j.csbj.2022.11.041. eCollection 2022.
3
spliceJAC: transition genes and state-specific gene regulation from single-cell transcriptome data.
拼接 JAC:从单细胞转录组数据中过渡基因和状态特异性基因调控。
Mol Syst Biol. 2022 Nov;18(11):e11176. doi: 10.15252/msb.202211176.
4
The Non Catalytic Protein ERG28 has a Functional Role in Cholesterol Synthesis and is Coregulated Transcriptionally.非催化蛋白 ERG28 在胆固醇合成中具有功能作用,并在转录水平上共同调节。
J Lipid Res. 2022 Dec;63(12):100295. doi: 10.1016/j.jlr.2022.100295. Epub 2022 Oct 8.
5
Inferring gene regulation from stochastic transcriptional variation across single cells at steady state.从稳态下单细胞的随机转录变化推断基因调控。
Proc Natl Acad Sci U S A. 2022 Aug 23;119(34):e2207392119. doi: 10.1073/pnas.2207392119. Epub 2022 Aug 15.
6
Learning discriminative and structural samples for rare cell types with deep generative model.利用深度生成模型学习罕见细胞类型的判别和结构样本。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac317.
7
Differential Expression Analysis of Single-Cell RNA-Seq Data: Current Statistical Approaches and Outstanding Challenges.单细胞RNA测序数据的差异表达分析:当前的统计方法与突出挑战
Entropy (Basel). 2022 Jul 18;24(7):995. doi: 10.3390/e24070995.
8
Benchmarking methods for detecting differential states between conditions from multi-subject single-cell RNA-seq data.用于检测多主体单细胞 RNA-seq 数据中条件间差异状态的基准方法。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac286.
9
Detecting critical transition signals from single-cell transcriptomes to infer lineage-determining transcription factors.从单细胞转录组中检测关键转变信号,以推断决定谱系的转录因子。
Nucleic Acids Res. 2022 Sep 9;50(16):e91. doi: 10.1093/nar/gkac452.
10
Comparison and evaluation of statistical error models for scRNA-seq.单细胞RNA测序(scRNA-seq)统计误差模型的比较与评估
Genome Biol. 2022 Jan 18;23(1):27. doi: 10.1186/s13059-021-02584-9.