• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

baySeq:用于识别序列计数数据中差异表达的经验贝叶斯方法。

baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.

机构信息

Department of Plant Sciences, University of Cambridge, Downing Street, Cambridge, UK.

出版信息

BMC Bioinformatics. 2010 Aug 10;11:422. doi: 10.1186/1471-2105-11-422.

DOI:10.1186/1471-2105-11-422
PMID:20698981
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2928208/
Abstract

BACKGROUND

High throughput sequencing has become an important technology for studying expression levels in many types of genomic, and particularly transcriptomic, data. One key way of analysing such data is to look for elements of the data which display particular patterns of differential expression in order to take these forward for further analysis and validation.

RESULTS

We propose a framework for defining patterns of differential expression and develop a novel algorithm, baySeq, which uses an empirical Bayes approach to detect these patterns of differential expression within a set of sequencing samples. The method assumes a negative binomial distribution for the data and derives an empirically determined prior distribution from the entire dataset. We examine the performance of the method on real and simulated data.

CONCLUSIONS

Our method performs at least as well, and often better, than existing methods for analyses of pairwise differential expression in both real and simulated data. When we compare methods for the analysis of data from experimental designs involving multiple sample groups, our method again shows substantial gains in performance. We believe that this approach thus represents an important step forward for the analysis of count data from sequencing experiments.

摘要

背景

高通量测序已成为研究基因组和特别是转录组数据中表达水平的重要技术。分析此类数据的一种关键方法是寻找数据中显示特定差异表达模式的元素,以便将这些元素进一步用于进一步的分析和验证。

结果

我们提出了一种定义差异表达模式的框架,并开发了一种新的算法 baySeq,该算法使用经验贝叶斯方法在一组测序样本中检测这些差异表达模式。该方法假设数据服从负二项分布,并从整个数据集推导出经验确定的先验分布。我们在真实数据和模拟数据上检查了该方法的性能。

结论

我们的方法在真实和模拟数据中分析成对差异表达时的性能至少与现有方法一样好,并且通常更好。当我们比较涉及多个样本组的实验设计数据的分析方法时,我们的方法再次显示出性能的显著提高。我们相信,这种方法代表了测序实验中计数数据分析的重要进展。

相似文献

1
baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.baySeq:用于识别序列计数数据中差异表达的经验贝叶斯方法。
BMC Bioinformatics. 2010 Aug 10;11:422. doi: 10.1186/1471-2105-11-422.
2
Generalized empirical Bayesian methods for discovery of differential data in high-throughput biology.用于高通量生物学中差异数据发现的广义经验贝叶斯方法。
Bioinformatics. 2016 Jan 15;32(2):195-202. doi: 10.1093/bioinformatics/btv569. Epub 2015 Oct 1.
3
Empirical Bayesian analysis of paired high-throughput sequencing data with a beta-binomial distribution.基于贝塔二项式分布的高通量测序数据配对的经验贝叶斯分析。
BMC Bioinformatics. 2013 Apr 23;14:135. doi: 10.1186/1471-2105-14-135.
4
Methods for discovering genomic loci exhibiting complex patterns of differential methylation.发现呈现复杂差异甲基化模式的基因组位点的方法。
BMC Bioinformatics. 2017 Sep 18;18(1):416. doi: 10.1186/s12859-017-1836-0.
5
Covariate-dependent negative binomial factor analysis of RNA sequencing data.基于协变量的 RNA 测序数据负二项式因子分析。
Bioinformatics. 2018 Jul 1;34(13):i61-i69. doi: 10.1093/bioinformatics/bty237.
6
NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data.NPEBseq:一种基于非参数经验贝叶斯的 RNA-seq 数据差异表达分析方法。
BMC Bioinformatics. 2013 Aug 27;14:262. doi: 10.1186/1471-2105-14-262.
7
A multi-model statistical approach for proteomic spectral count quantitation.一种用于蛋白质组学光谱计数定量的多模型统计方法。
J Proteomics. 2016 Jul 20;144:23-32. doi: 10.1016/j.jprot.2016.05.032. Epub 2016 May 31.
8
Differential expression analysis of RNA sequencing data by incorporating non-exonic mapped reads.通过纳入非外显子映射读数对RNA测序数据进行差异表达分析。
BMC Genomics. 2015;16 Suppl 7(Suppl 7):S14. doi: 10.1186/1471-2164-16-S7-S14. Epub 2015 Jun 11.
9
β-empirical Bayes inference and model diagnosis of microarray data.基于经验贝叶斯的微阵列数据分析推断和模型诊断。
BMC Bioinformatics. 2012 Jun 19;13:135. doi: 10.1186/1471-2105-13-135.
10
Confident difference criterion: a new Bayesian differentially expressed gene selection algorithm with applications.置信差异准则:一种新的贝叶斯差异表达基因选择算法及其应用
BMC Bioinformatics. 2015 Aug 7;16:245. doi: 10.1186/s12859-015-0664-3.

引用本文的文献

1
Poisson Beta Regression for Count Data With an Application to Hospital Length of Stay Data.用于计数数据的泊松贝塔回归及其在住院时间数据中的应用
Stat Med. 2025 Aug;44(18-19):e70217. doi: 10.1002/sim.70217.
2
Dysregulation of cell migration by matrix metalloproteinases in geleophysic dysplasia.基质金属蛋白酶在弹力纤维发育异常中对细胞迁移的调节异常
Sci Rep. 2025 Jun 6;15(1):19970. doi: 10.1038/s41598-025-04666-1.
3
Incorporating scale uncertainty in microbiome and gene expression analysis as an extension of normalization.将尺度不确定性纳入微生物组和基因表达分析,作为归一化的扩展。

本文引用的文献

1
Differential expression analysis for sequence count data.差异表达分析序列计数数据。
Genome Biol. 2010;11(10):R106. doi: 10.1186/gb-2010-11-10-r106. Epub 2010 Oct 27.
2
Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments.mRNA-Seq 实验中标准化和差异表达的统计方法评估。
BMC Bioinformatics. 2010 Feb 18;11:94. doi: 10.1186/1471-2105-11-94.
3
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.edgeR:一个用于数字基因表达数据差异表达分析的 Bioconductor 包。
Genome Biol. 2025 May 22;26(1):139. doi: 10.1186/s13059-025-03609-3.
4
Heterogeneity-preserving discriminative feature selection for disease-specific subtype discovery.用于疾病特异性亚型发现的保持异质性的判别特征选择
Nat Commun. 2025 Apr 16;16(1):3593. doi: 10.1038/s41467-025-58718-1.
5
Resources Modulate Developmental Shifts but Not Infection Tolerance Upon Co-Infection in an Insect System.在昆虫系统中,资源调节发育转变,但不调节共感染时的感染耐受性。
Mol Ecol. 2025 Mar 20:e17726. doi: 10.1111/mec.17726.
6
Resiquimod induces a mixed Th1 and Th2 response via STAT1 and STAT3 signalling in chickens.瑞喹莫德通过STAT1和STAT3信号通路在鸡体内诱导混合的Th1和Th2反应。
Biochem Biophys Rep. 2025 Feb 4;41:101941. doi: 10.1016/j.bbrep.2025.101941. eCollection 2025 Mar.
7
Tau mediates the reshaping of the transcriptional landscape toward intermediate Alzheimer's disease stages.Tau介导转录景观重塑,促使向阿尔茨海默病中期发展。
Front Cell Dev Biol. 2025 Jan 3;12:1459573. doi: 10.3389/fcell.2024.1459573. eCollection 2024.
8
Two H3K23 histone methyltransferases, SET-32 and SET-21, function synergistically to promote nuclear RNAi-mediated transgenerational epigenetic inheritance in Caenorhabditis elegans.两种H3K23组蛋白甲基转移酶SET-32和SET-21协同发挥作用,以促进秀丽隐杆线虫中由核RNA干扰介导的跨代表观遗传。
Genetics. 2025 Feb 5;229(2). doi: 10.1093/genetics/iyae206.
9
Two H3K23 histone methyltransferases, SET-32 and SET-21, function synergistically to promote nuclear RNAi-mediated transgenerational epigenetic inheritance in .两种H3K23组蛋白甲基转移酶SET-32和SET-21协同发挥作用,以促进核RNA干扰介导的跨代表观遗传。 (原文中未提及具体生物,此处翻译时补充完整句子使其逻辑更清晰)
bioRxiv. 2024 Nov 6:2024.11.05.622152. doi: 10.1101/2024.11.05.622152.
10
ML-GAP: machine learning-enhanced genomic analysis pipeline using autoencoders and data augmentation.ML-GAP:使用自动编码器和数据增强的机器学习增强基因组分析管道。
Front Genet. 2024 Sep 25;15:1442759. doi: 10.3389/fgene.2024.1442759. eCollection 2024.
Bioinformatics. 2010 Jan 1;26(1):139-40. doi: 10.1093/bioinformatics/btp616. Epub 2009 Nov 11.
4
DEGseq: an R package for identifying differentially expressed genes from RNA-seq data.DEGseq:一个用于从 RNA-seq 数据中识别差异表达基因的 R 包。
Bioinformatics. 2010 Jan 1;26(1):136-8. doi: 10.1093/bioinformatics/btp612. Epub 2009 Oct 24.
5
PatMaN: rapid alignment of short sequences to large databases.PatMaN:短序列与大型数据库的快速比对
Bioinformatics. 2008 Jul 1;24(13):1530-1. doi: 10.1093/bioinformatics/btn223. Epub 2008 May 8.
6
The impact of next-generation sequencing technology on genetics.下一代测序技术对遗传学的影响。
Trends Genet. 2008 Mar;24(3):133-41. doi: 10.1016/j.tig.2007.12.007. Epub 2008 Feb 11.
7
Next-generation sequencing transforms today's biology.新一代测序技术改变了当今的生物学。
Nat Methods. 2008 Jan;5(1):16-8. doi: 10.1038/nmeth1156. Epub 2007 Dec 19.
8
The Arabidopsis Information Resource (TAIR): gene structure and function annotation.拟南芥信息资源库(TAIR):基因结构与功能注释
Nucleic Acids Res. 2008 Jan;36(Database issue):D1009-14. doi: 10.1093/nar/gkm965. Epub 2007 Nov 5.
9
Moderated statistical tests for assessing differences in tag abundance.用于评估标签丰度差异的适度统计检验。
Bioinformatics. 2007 Nov 1;23(21):2881-7. doi: 10.1093/bioinformatics/btm453. Epub 2007 Sep 19.
10
Small-sample estimation of negative binomial dispersion, with applications to SAGE data.负二项分布离散度的小样本估计及其在SAGE数据中的应用
Biostatistics. 2008 Apr;9(2):321-32. doi: 10.1093/biostatistics/kxm030. Epub 2007 Aug 29.