• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于改进RNA测序数据分析的双参数广义泊松模型。

A two-parameter generalized Poisson model to improve the analysis of RNA-seq data.

作者信息

Srivastava Sudeep, Chen Liang

机构信息

Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA.

出版信息

Nucleic Acids Res. 2010 Sep;38(17):e170. doi: 10.1093/nar/gkq670. Epub 2010 Jul 29.

DOI:10.1093/nar/gkq670
PMID:20671027
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2943596/
Abstract

Deep sequencing of RNAs (RNA-seq) has been a useful tool to characterize and quantify transcriptomes. However, there are significant challenges in the analysis of RNA-seq data, such as how to separate signals from sequencing bias and how to perform reasonable normalization. Here, we focus on a fundamental question in RNA-seq analysis: the distribution of the position-level read counts. Specifically, we propose a two-parameter generalized Poisson (GP) model to the position-level read counts. We show that the GP model fits the data much better than the traditional Poisson model. Based on the GP model, we can better estimate gene or exon expression, perform a more reasonable normalization across different samples, and improve the identification of differentially expressed genes and the identification of differentially spliced exons. The usefulness of the GP model is demonstrated by applications to multiple RNA-seq data sets.

摘要

RNA的深度测序(RNA-seq)已成为表征和定量转录组的有用工具。然而,RNA-seq数据分析存在重大挑战,例如如何从测序偏差中分离信号以及如何进行合理的标准化。在此,我们关注RNA-seq分析中的一个基本问题:位置水平读取计数的分布。具体而言,我们针对位置水平读取计数提出了一种双参数广义泊松(GP)模型。我们表明,GP模型比传统泊松模型对数据的拟合要好得多。基于GP模型,我们可以更好地估计基因或外显子表达,在不同样本间进行更合理的标准化,并改善差异表达基因的识别以及差异剪接外显子的识别。通过将GP模型应用于多个RNA-seq数据集,证明了该模型的实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/005551bb027b/gkq670f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/35fdc41572d4/gkq670f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/5e955e511eab/gkq670f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/8f81e61975e7/gkq670f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/44cdae40e2bb/gkq670f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/005551bb027b/gkq670f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/35fdc41572d4/gkq670f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/5e955e511eab/gkq670f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/8f81e61975e7/gkq670f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/44cdae40e2bb/gkq670f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b69c/2943596/005551bb027b/gkq670f5.jpg

相似文献

1
A two-parameter generalized Poisson model to improve the analysis of RNA-seq data.一种用于改进RNA测序数据分析的双参数广义泊松模型。
Nucleic Acids Res. 2010 Sep;38(17):e170. doi: 10.1093/nar/gkq670. Epub 2010 Jul 29.
2
Identifying differentially spliced genes from two groups of RNA-seq samples.从两组 RNA-seq 样本中鉴定差异剪接基因。
Gene. 2013 Apr 10;518(1):164-70. doi: 10.1016/j.gene.2012.11.045. Epub 2012 Dec 8.
3
deGPS is a powerful tool for detecting differential expression in RNA-sequencing studies.deGPS是一种用于在RNA测序研究中检测差异表达的强大工具。
BMC Genomics. 2015 Jun 13;16(1):455. doi: 10.1186/s12864-015-1676-0.
4
Differential expression analysis of RNA sequencing data by incorporating non-exonic mapped reads.通过纳入非外显子映射读数对RNA测序数据进行差异表达分析。
BMC Genomics. 2015;16 Suppl 7(Suppl 7):S14. doi: 10.1186/1471-2164-16-S7-S14. Epub 2015 Jun 11.
5
Challenges in estimating percent inclusion of alternatively spliced junctions from RNA-seq data.从 RNA-seq 数据估算选择性剪接连接百分比的挑战。
BMC Bioinformatics. 2012 Apr 19;13 Suppl 6(Suppl 6):S11. doi: 10.1186/1471-2105-13-S6-S11.
6
A multi-Poisson dynamic mixture model to cluster developmental patterns of gene expression by RNA-seq.一种用于通过RNA测序对基因表达发育模式进行聚类的多泊松动态混合模型。
Brief Bioinform. 2015 Mar;16(2):205-15. doi: 10.1093/bib/bbu013. Epub 2014 May 10.
7
BADGE: a novel Bayesian model for accurate abundance quantification and differential analysis of RNA-Seq data.标记:一种用于 RNA-Seq 数据精确丰度定量和差异分析的新型贝叶斯模型。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S6. doi: 10.1186/1471-2105-15-S9-S6. Epub 2014 Sep 10.
8
Co-expression analysis of high-throughput transcriptome sequencing data with Poisson mixture models.高通量转录组测序数据的泊松混合模型共表达分析。
Bioinformatics. 2015 May 1;31(9):1420-7. doi: 10.1093/bioinformatics/btu845. Epub 2015 Jan 5.
9
CEDER: accurate detection of differentially expressed genes by combining significance of exons using RNA-Seq.CEDER:通过使用 RNA-Seq 组合外显子的显著性来准确检测差异表达基因。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Sep-Oct;9(5):1281-92. doi: 10.1109/TCBB.2012.83.
10
Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.通过对异构体和外显子特异性读段测序率进行建模来改进RNA测序表达估计。
BMC Bioinformatics. 2015 Oct 16;16:332. doi: 10.1186/s12859-015-0750-6.

引用本文的文献

1
Computational methods for alternative polyadenylation and splicing in post-transcriptional gene regulation.转录后基因调控中可变聚腺苷酸化和剪接的计算方法
Exp Mol Med. 2025 Aug 14. doi: 10.1038/s12276-025-01496-z.
2
Shiba: a versatile computational method for systematic identification of differential RNA splicing across platforms.柴犬:一种用于跨平台系统识别差异RNA剪接的通用计算方法。
Nucleic Acids Res. 2025 Feb 8;53(4). doi: 10.1093/nar/gkaf098.
3
Shiba: A versatile computational method for systematic identification of differential RNA splicing across platforms.

本文引用的文献

1
Modeling non-uniformity in short-read rates in RNA-Seq data.RNA-Seq 数据中短读率非均匀性建模。
Genome Biol. 2010;11(5):R50. doi: 10.1186/gb-2010-11-5-r50. Epub 2010 May 11.
2
Biases in Illumina transcriptome sequencing caused by random hexamer priming.Illumina 转录组测序中随机六聚体引物引起的偏倚。
Nucleic Acids Res. 2010 Jul;38(12):e131. doi: 10.1093/nar/gkq224. Epub 2010 Apr 14.
3
A scaling normalization method for differential expression analysis of RNA-seq data.RNA-seq 数据差异表达分析的缩放标准化方法。
柴犬:一种用于跨平台系统识别差异RNA剪接的通用计算方法。
bioRxiv. 2025 Jan 23:2024.05.30.596331. doi: 10.1101/2024.05.30.596331.
4
Exploring the fragmentation efficiency of proteins analyzed by MALDI-TOF-TOF tandem mass spectrometry using computational and statistical analyses.使用计算和统计分析方法探索基质辅助激光解吸电离飞行时间串联质谱(MALDI-TOF-TOF)分析蛋白质的碎片化效率。
PLoS One. 2024 May 3;19(5):e0299287. doi: 10.1371/journal.pone.0299287. eCollection 2024.
5
Artifacts and biases of the reverse transcription reaction in RNA sequencing.RNA 测序中反转录反应的假象和偏差。
RNA. 2023 Jul;29(7):889-897. doi: 10.1261/rna.079623.123. Epub 2023 Mar 29.
6
A Comprehensive Survey of Statistical Approaches for Differential Expression Analysis in Single-Cell RNA Sequencing Studies.单细胞 RNA 测序研究中差异表达分析的统计方法综合综述。
Genes (Basel). 2021 Dec 2;12(12):1947. doi: 10.3390/genes12121947.
7
A New -Regularized Log-Linear Poisson Graphical Model with Applications to RNA Sequencing Data.一种新的正则化对数线性泊松图模型及其在 RNA 测序数据中的应用。
8
Alternative splicing: Human disease and quantitative analysis from high-throughput sequencing.可变剪接:人类疾病与高通量测序的定量分析
Comput Struct Biotechnol J. 2020 Dec 24;19:183-195. doi: 10.1016/j.csbj.2020.12.009. eCollection 2021.
9
Statistical and Bioinformatics Analysis of Data from Bulk and Single-Cell RNA Sequencing Experiments.批量和单细胞 RNA 测序实验数据的统计和生物信息学分析。
Methods Mol Biol. 2021;2194:143-175. doi: 10.1007/978-1-0716-0849-4_9.
10
Zero-inflated Poisson factor model with application to microbiome read counts.零膨胀泊松因子模型及其在微生物组读频数中的应用。
Biometrics. 2021 Mar;77(1):91-101. doi: 10.1111/biom.13272. Epub 2020 May 4.
Genome Biol. 2010;11(3):R25. doi: 10.1186/gb-2010-11-3-r25. Epub 2010 Mar 2.
4
Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments.mRNA-Seq 实验中标准化和差异表达的统计方法评估。
BMC Bioinformatics. 2010 Feb 18;11:94. doi: 10.1186/1471-2105-11-94.
5
An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data.组织转录组序列数据揭示了大量普遍表达的基因。
PLoS Comput Biol. 2009 Dec;5(12):e1000598. doi: 10.1371/journal.pcbi.1000598. Epub 2009 Dec 11.
6
Computation for ChIP-seq and RNA-seq studies.染色质免疫沉淀测序(ChIP-seq)和RNA测序(RNA-seq)研究的计算
Nat Methods. 2009 Nov;6(11 Suppl):S22-32. doi: 10.1038/nmeth.1371.
7
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.短DNA序列与人类基因组的超快速且内存高效比对。
Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.
8
Statistical inferences for isoform expression in RNA-Seq.RNA测序中异构体表达的统计推断。
Bioinformatics. 2009 Apr 15;25(8):1026-32. doi: 10.1093/bioinformatics/btp113. Epub 2009 Feb 25.
9
Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing.通过高通量测序对人类转录组中可变剪接复杂性进行深度研究。
Nat Genet. 2008 Dec;40(12):1413-5. doi: 10.1038/ng.259. Epub 2008 Nov 2.
10
Expression of 24,426 human alternative splicing events and predicted cis regulation in 48 tissues and cell lines.48种组织和细胞系中24,426个人类可变剪接事件的表达及预测的顺式调控
Nat Genet. 2008 Dec;40(12):1416-25. doi: 10.1038/ng.264. Epub 2008 Nov 2.