• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单细胞 RNA 测序数据的捕获效率建模可提高对转录组范围爆发动力学的推断。

Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics.

机构信息

Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom.

I-X Centre for AI in Science, Imperial College London, White City Campus, London W12 0BZ, United Kingdom.

出版信息

Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad395.

DOI:10.1093/bioinformatics/btad395
PMID:37354494
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10318389/
Abstract

MOTIVATION

Gene expression is characterized by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequencing (scRNA-seq) has made it possible to quantify the cell-to-cell variability in transcription at a global genome-wide level. However, scRNA-seq data are prone to technical variability, including low and variable capture efficiency of transcripts from individual cells.

RESULTS

Here, we propose a novel mathematical theory for the observed variability in scRNA-seq data. Our method captures burst kinetics and variability in both the cell size and capture efficiency, which allows us to propose several likelihood-based and simulation-based methods for the inference of burst kinetics from scRNA-seq data. Using both synthetic and real data, we show that the simulation-based methods provide an accurate, robust and flexible tool for inferring burst kinetics from scRNA-seq data. In particular, in a supervised manner, a simulation-based inference method based on neural networks proves to be accurate and useful when applied to both allele and nonallele-specific scRNA-seq data.

AVAILABILITY AND IMPLEMENTATION

The code for Neural Network and Approximate Bayesian Computation inference is available at https://github.com/WT215/nnRNA and https://github.com/WT215/Julia_ABC, respectively.

摘要

动机

基因表达的特点是转录的随机突发,这些突发发生在启动子活性的短暂和随机时期。基因表达突发的动力学在整个基因组中是不同的,并且取决于启动子序列等因素。单细胞 RNA 测序 (scRNA-seq) 使得在全基因组范围内量化转录的细胞间变异性成为可能。然而,scRNA-seq 数据容易受到技术变异性的影响,包括来自单个细胞的转录本的低捕获效率和可变捕获效率。

结果

在这里,我们提出了一种用于解释 scRNA-seq 数据中观察到的变异性的新数学理论。我们的方法捕获了细胞大小和捕获效率中的突发动力学和变异性,这使我们能够提出几种基于似然和基于模拟的方法,用于从 scRNA-seq 数据推断突发动力学。使用合成数据和真实数据,我们表明基于模拟的方法为从 scRNA-seq 数据推断突发动力学提供了一种准确、稳健和灵活的工具。特别是,在监督方式下,基于神经网络的基于模拟的推断方法在应用于等位基因和非等位基因特异性 scRNA-seq 数据时被证明是准确和有用的。

可用性和实现

基于神经网络和近似贝叶斯计算的推断的代码分别可在 https://github.com/WT215/nnRNA 和 https://github.com/WT215/Julia_ABC 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/1adb2ecf2297/btad395f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/2bb9eac1c649/btad395f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/84d42ace9d7d/btad395f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/b861fbb36506/btad395f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/4e7cb7d5c122/btad395f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/9c0193356777/btad395f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/1adb2ecf2297/btad395f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/2bb9eac1c649/btad395f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/84d42ace9d7d/btad395f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/b861fbb36506/btad395f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/4e7cb7d5c122/btad395f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/9c0193356777/btad395f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bff3/10318389/1adb2ecf2297/btad395f6.jpg

相似文献

1
Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics.单细胞 RNA 测序数据的捕获效率建模可提高对转录组范围爆发动力学的推断。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad395.
2
bayNorm: Bayesian gene expression recovery, imputation and normalization for single-cell RNA-sequencing data.bayNorm:用于单细胞 RNA-seq 数据的贝叶斯基因表达恢复、插补和标准化。
Bioinformatics. 2020 Feb 15;36(4):1174-1181. doi: 10.1093/bioinformatics/btz726.
3
Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation.基于新型合成 scRNA-seq 数据生成方法的网络推断中插补方法的基准测试。
BMC Bioinformatics. 2022 Jun 17;23(1):236. doi: 10.1186/s12859-022-04778-9.
4
Graph contrastive learning as a versatile foundation for advanced scRNA-seq data analysis.图对比学习作为高级 scRNA-seq 数据分析的多功能基础。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae558.
5
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.机器学习和统计方法在单细胞 RNA 测序数据分析中的应用。
Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063.
6
Random forest based similarity learning for single cell RNA sequencing data.基于随机森林的单细胞 RNA 测序数据相似性学习。
Bioinformatics. 2018 Jul 1;34(13):i79-i88. doi: 10.1093/bioinformatics/bty260.
7
STGRNS: an interpretable transformer-based method for inferring gene regulatory networks from single-cell transcriptomic data.STGRNS:一种基于可解释转换器的方法,用于从单细胞转录组数据推断基因调控网络。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad165.
8
SCRIP: an accurate simulator for single-cell RNA sequencing data.SCRIP:单细胞 RNA 测序数据的精确模拟器。
Bioinformatics. 2022 Feb 7;38(5):1304-1311. doi: 10.1093/bioinformatics/btab824.
9
scBGEDA: deep single-cell clustering analysis via a dual denoising autoencoder with bipartite graph ensemble clustering.scBGEDA:基于双分图集成分聚类的对偶去噪自动编码器的单细胞聚类分析。
Bioinformatics. 2023 Feb 14;39(2). doi: 10.1093/bioinformatics/btad075.
10
scNPF: an integrative framework assisted by network propagation and network fusion for preprocessing of single-cell RNA-seq data.scNPF:一种基于网络传播和网络融合的综合框架,用于单细胞 RNA-seq 数据的预处理。
BMC Genomics. 2019 May 8;20(1):347. doi: 10.1186/s12864-019-5747-5.

引用本文的文献

1
Marker genes reveal dynamic features of cell evolving processes.标记基因揭示了细胞进化过程的动态特征。
Bioinform Adv. 2025 Aug 5;5(1):vbaf185. doi: 10.1093/bioadv/vbaf185. eCollection 2025.
2
A conserved coupling of transcriptional ON and OFF periods underlies bursting dynamics.转录开启和关闭时期的保守耦合是爆发动力学的基础。
Nat Struct Mol Biol. 2025 Jul 15. doi: 10.1038/s41594-025-01615-4.
3
Cell-cycle dependence of bursty gene expression: insights from fitting mechanistic models to single-cell RNA-seq data.爆发性基因表达的细胞周期依赖性:通过将机制模型拟合到单细胞RNA测序数据获得的见解

本文引用的文献

1
Multi-view data visualisation manifold learning.多视图数据可视化 流形学习
PeerJ Comput Sci. 2024 May 24;10:e1993. doi: 10.7717/peerj-cs.1993. eCollection 2024.
2
Genome-wide inference reveals that feedback regulations constrain promoter-dependent transcriptional burst kinetics.全基因组推断表明,反馈调控限制了依赖启动子的转录爆发动力学。
Nucleic Acids Res. 2023 Jan 11;51(1):68-83. doi: 10.1093/nar/gkac1204.
3
Selective advantage of epigenetically disrupted cancer cells via phenotypic inertia.通过表型惰性选择表观遗传失调的癌细胞的优势。
Nucleic Acids Res. 2025 Apr 10;53(7). doi: 10.1093/nar/gkaf295.
4
Intrinsic OASL expression licenses interferon induction during influenza A virus infection.内在的OASL表达在甲型流感病毒感染期间促进干扰素诱导。
bioRxiv. 2025 Mar 17:2025.03.14.643375. doi: 10.1101/2025.03.14.643375.
5
Batch correcting single-cell spatial transcriptomics count data with Crescendo improves visualization and detection of spatial gene patterns.使用Crescendo对单细胞空间转录组学计数数据进行批量校正可改善空间基因模式的可视化和检测。
Genome Biol. 2025 Feb 25;26(1):36. doi: 10.1186/s13059-025-03479-9.
6
Identification of molecular determinants of gene-specific bursting patterns by high-throughput imaging screens.通过高通量成像筛选鉴定基因特异性爆发模式的分子决定因素。
Mol Cell. 2025 Mar 6;85(5):913-928.e8. doi: 10.1016/j.molcel.2025.01.022. Epub 2025 Feb 19.
7
Transcriptional bursting dynamics in gene expression.基因表达中的转录爆发动力学。
Front Genet. 2024 Sep 13;15:1451461. doi: 10.3389/fgene.2024.1451461. eCollection 2024.
8
Transcriptional bursting: from fundamentals to novel insights.转录爆发:从基础到新的见解。
Biochem Soc Trans. 2024 Aug 28;52(4):1695-1702. doi: 10.1042/BST20231286.
9
Identification of molecular determinants of gene-specific bursting patterns by high-throughput imaging screens.通过高通量成像筛选鉴定基因特异性爆发模式的分子决定因素。
bioRxiv. 2024 Jun 8:2024.06.08.597999. doi: 10.1101/2024.06.08.597999.
10
Quantifying and correcting bias in transcriptional parameter inference from single-cell data.从单细胞数据中量化和纠正转录参数推断中的偏差。
Biophys J. 2024 Jan 2;123(1):4-30. doi: 10.1016/j.bpj.2023.10.021. Epub 2023 Oct 27.
Cancer Cell. 2023 Jan 9;41(1):70-87.e14. doi: 10.1016/j.ccell.2022.10.002. Epub 2022 Nov 3.
4
Quantifying how post-transcriptional noise and gene copy number variation bias transcriptional parameter inference from mRNA distributions.量化转录后噪声和基因拷贝数变异如何影响从 mRNA 分布推断转录参数。
Elife. 2022 Oct 17;11:e82493. doi: 10.7554/eLife.82493.
5
Efficient Bayesian inference for stochastic agent-based models.基于代理的随机模型的高效贝叶斯推断。
PLoS Comput Biol. 2022 Oct 5;18(10):e1009508. doi: 10.1371/journal.pcbi.1009508. eCollection 2022 Oct.
6
Airpart: interpretable statistical models for analyzing allelic imbalance in single-cell datasets.Airpart:用于分析单细胞数据集等位基因失衡的可解释统计模型。
Bioinformatics. 2022 May 13;38(10):2773-2780. doi: 10.1093/bioinformatics/btac212.
7
Parameter inference for stochastic biochemical models from perturbation experiments parallelised at the single cell level.从单细胞水平的扰动实验并行化推断随机生化模型的参数。
PLoS Comput Biol. 2022 Mar 18;18(3):e1009950. doi: 10.1371/journal.pcbi.1009950. eCollection 2022 Mar.
8
Mapping transcriptomic vector fields of single cells.单细胞转录组向量场映射。
Cell. 2022 Feb 17;185(4):690-711.e45. doi: 10.1016/j.cell.2021.12.045. Epub 2022 Feb 1.
9
Pathway dynamics can delineate the sources of transcriptional noise in gene expression.通路动力学可以描绘基因表达中转录噪声的来源。
Elife. 2021 Oct 12;10:e69324. doi: 10.7554/eLife.69324.
10
Intricacies of single-cell multi-omics data integration.单细胞多组学数据整合的复杂性。
Trends Genet. 2022 Feb;38(2):128-139. doi: 10.1016/j.tig.2021.08.012. Epub 2021 Sep 21.