• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于模型的基因聚类算法在 RNA-seq 数据中的差异表达分析。

Differential expression analysis using a model-based gene clustering algorithm for RNA-seq data.

机构信息

Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo, 113-8657, Japan.

Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo, 113-8657, Japan.

出版信息

BMC Bioinformatics. 2021 Oct 20;22(1):511. doi: 10.1186/s12859-021-04438-4.

DOI:10.1186/s12859-021-04438-4
PMID:34670485
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8527798/
Abstract

BACKGROUND

RNA-seq is a tool for measuring gene expression and is commonly used to identify differentially expressed genes (DEGs). Gene clustering is used to classify DEGs with similar expression patterns for the subsequent analyses of data from experiments such as time-courses or multi-group comparisons. However, gene clustering has rarely been used for analyzing simple two-group data or differential expression (DE). In this study, we report that a model-based clustering algorithm implemented in an R package, MBCluster.Seq, can also be used for DE analysis.

RESULTS

The input data originally used by MBCluster.Seq is DEGs, and the proposed method (called MBCdeg) uses all genes for the analysis. The method uses posterior probabilities of genes assigned to a cluster displaying non-DEG pattern for overall gene ranking. We compared the performance of MBCdeg with conventional R packages such as edgeR, DESeq2, and TCC that are specialized for DE analysis using simulated and real data. Our results showed that MBCdeg outperformed other methods when the proportion of DEG (P) was less than 50%. However, the DEG identification using MBCdeg was less consistent than with conventional methods. We compared the effects of different normalization algorithms using MBCdeg, and performed an analysis using MBCdeg in combination with a robust normalization algorithm (called DEGES) that was not implemented in MBCluster.Seq. The new analysis method showed greater stability than using the original MBCdeg with the default normalization algorithm.

CONCLUSIONS

MBCdeg with DEGES normalization can be used in the identification of DEGs when the P is relatively low. As the method is based on gene clustering, the DE result includes information on which expression pattern the gene belongs to. The new method may be useful for the analysis of time-course and multi-group data, where the classification of expression patterns is often required.

摘要

背景

RNA-seq 是一种测量基因表达的工具,常用于鉴定差异表达基因(DEGs)。基因聚类用于对具有相似表达模式的 DEGs 进行分类,以便对实验数据(如时间序列或多组比较)进行后续分析。然而,基因聚类很少用于分析简单的两组数据或差异表达(DE)。在本研究中,我们报告了一种基于模型的聚类算法,该算法在 R 包 MBCluster.Seq 中实现,也可用于 DE 分析。

结果

MBCluster.Seq 最初输入的数据是 DEGs,而我们提出的方法(称为 MBCdeg)则使用所有基因进行分析。该方法使用基因被分配到显示非 DEG 模式的聚类的后验概率对所有基因进行排名。我们使用模拟和真实数据,比较了 MBCdeg 与专门用于 DE 分析的常规 R 包 edgeR、DESeq2 和 TCC 的性能。我们的结果表明,当 DEG 的比例(P)小于 50%时,MBCdeg 的性能优于其他方法。然而,MBCdeg 的 DEG 鉴定结果不如常规方法一致。我们比较了使用 MBCdeg 的不同归一化算法的效果,并使用未在 MBCluster.Seq 中实现的稳健归一化算法(称为 DEGES)与 MBCdeg 联合进行了分析。新的分析方法比使用原始 MBCdeg 和默认归一化算法更稳定。

结论

当 P 相对较低时,使用 DEGES 归一化的 MBCdeg 可用于 DEGs 的鉴定。由于该方法基于基因聚类,DE 结果包含基因所属的表达模式信息。该新方法可能对时间序列和多组数据的分析有用,其中通常需要对表达模式进行分类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/7d911b355417/12859_2021_4438_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/eb5786a6201f/12859_2021_4438_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/4494d97f16ff/12859_2021_4438_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/86aa78df53e2/12859_2021_4438_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/7d911b355417/12859_2021_4438_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/eb5786a6201f/12859_2021_4438_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/4494d97f16ff/12859_2021_4438_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/86aa78df53e2/12859_2021_4438_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/8527798/7d911b355417/12859_2021_4438_Fig4_HTML.jpg

相似文献

1
Differential expression analysis using a model-based gene clustering algorithm for RNA-seq data.基于模型的基因聚类算法在 RNA-seq 数据中的差异表达分析。
BMC Bioinformatics. 2021 Oct 20;22(1):511. doi: 10.1186/s12859-021-04438-4.
2
Enhanced clustering-based differential expression analysis method for RNA-seq data.用于RNA测序数据的基于增强聚类的差异表达分析方法
MethodsX. 2023 Dec 12;12:102518. doi: 10.1016/j.mex.2023.102518. eCollection 2024 Jun.
3
TCC: an R package for comparing tag count data with robust normalization strategies.TCC:一个用于比较标签计数数据的 R 包,带有稳健的归一化策略。
BMC Bioinformatics. 2013 Jul 9;14:219. doi: 10.1186/1471-2105-14-219.
4
Evaluation of methods for differential expression analysis on multi-group RNA-seq count data.多组RNA测序计数数据差异表达分析方法的评估
BMC Bioinformatics. 2015 Nov 4;16:361. doi: 10.1186/s12859-015-0794-7.
5
Identification of gene signatures from RNA-seq data using Pareto-optimal cluster algorithm.使用帕累托最优聚类算法从RNA测序数据中识别基因特征。
BMC Syst Biol. 2018 Dec 21;12(Suppl 8):126. doi: 10.1186/s12918-018-0650-2.
6
bestDEG: a web-based application automatically combines various tools to precisely predict differentially expressed genes (DEGs) from RNA-Seq data.bestDEG:一个基于网络的应用程序,可以自动结合各种工具,从 RNA-Seq 数据中准确预测差异表达基因(DEGs)。
PeerJ. 2022 Nov 10;10:e14344. doi: 10.7717/peerj.14344. eCollection 2022.
7
scMEB: a fast and clustering-independent method for detecting differentially expressed genes in single-cell RNA-seq data.scMEB:一种在单细胞 RNA-seq 数据中检测差异表达基因的快速且与聚类无关的方法。
BMC Genomics. 2023 May 25;24(1):280. doi: 10.1186/s12864-023-09374-6.
8
DeClUt: Decluttering differentially expressed genes through clustering of their expression profiles.DeClUt:通过聚类表达谱对差异表达基因进行去冗余。
Comput Methods Programs Biomed. 2024 Sep;254:108258. doi: 10.1016/j.cmpb.2024.108258. Epub 2024 May 31.
9
Robust identification of differentially expressed genes from RNA-seq data.从 RNA-seq 数据中稳健地识别差异表达基因。
Genomics. 2020 Mar;112(2):2000-2010. doi: 10.1016/j.ygeno.2019.11.012. Epub 2019 Nov 20.
10
Model-based clustering for RNA-seq data.基于模型的 RNA-seq 数据聚类。
Bioinformatics. 2014 Jan 15;30(2):197-205. doi: 10.1093/bioinformatics/btt632. Epub 2013 Nov 4.

引用本文的文献

1
Role of Endoplasmic Reticulum Stress-Associated Genes in Septic Neonatal Foals.内质网应激相关基因在新生败血症马驹中的作用
Antioxidants (Basel). 2025 Aug 21;14(8):1024. doi: 10.3390/antiox14081024.
2
DiCE: differential centrality-ensemble analysis based on gene expression profiles and protein-protein interaction network.DiCE:基于基因表达谱和蛋白质-蛋白质相互作用网络的差异中心性-集成分析
Nucleic Acids Res. 2025 Jul 8;53(13). doi: 10.1093/nar/gkaf609.
3
MBCdeg4: A modified clustering-based method for identifying differentially expressed genes from RNA-seq data.

本文引用的文献

1
Commentary: A Systematic Evaluation of Single Cell RNA-Seq Analysis Pipelines.评论:单细胞RNA测序分析流程的系统评估
Front Genet. 2020 Sep 4;11:941. doi: 10.3389/fgene.2020.00941. eCollection 2020.
2
rmRNAseq: differential expression analysis for repeated-measures RNA-seq data.rmRNAseq:重复测量 RNA-seq 数据的差异表达分析。
Bioinformatics. 2020 Aug 15;36(16):4432-4439. doi: 10.1093/bioinformatics/btaa525.
3
Power analysis for RNA-Seq differential expression studies using generalized linear mixed effects models.
MBCdeg4:一种基于聚类的改进方法,用于从RNA测序数据中识别差异表达基因。
MethodsX. 2024 Dec 30;14:103149. doi: 10.1016/j.mex.2024.103149. eCollection 2025 Jun.
4
Normalization of RNA-Seq data using adaptive trimmed mean with multi-reference.使用自适应修剪均值和多参考对 RNA-Seq 数据进行标准化。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae241.
5
Enhanced clustering-based differential expression analysis method for RNA-seq data.用于RNA测序数据的基于增强聚类的差异表达分析方法
MethodsX. 2023 Dec 12;12:102518. doi: 10.1016/j.mex.2023.102518. eCollection 2024 Jun.
6
Transcriptomics for Clinical and Experimental Biology Research: Hang on a Seq.临床与实验生物学研究中的转录组学:紧跟测序技术发展
Adv Genet (Hoboken). 2023 Jan 17;4(2):2200024. doi: 10.1002/ggn2.202200024. eCollection 2023 Jun.
7
Molecular docking and proteomics reveals the synergistic antibacterial mechanism of theaflavin with β-lactam antibiotics against MRSA.分子对接和蛋白质组学揭示了茶黄素与β-内酰胺类抗生素联合抗耐甲氧西林金黄色葡萄球菌的协同抗菌机制。
Front Microbiol. 2022 Nov 14;13:993430. doi: 10.3389/fmicb.2022.993430. eCollection 2022.
基于广义线性混合效应模型的 RNA-Seq 差异表达研究的功效分析。
BMC Bioinformatics. 2020 May 19;21(1):198. doi: 10.1186/s12859-020-3541-7.
4
Gene regulatory network and its constituent transcription factors that control nitrogen-deficiency responses in rice.控制水稻缺氮反应的基因调控网络及其组成转录因子。
New Phytol. 2020 Sep;227(5):1434-1452. doi: 10.1111/nph.16627. Epub 2020 May 29.
5
A graph-based algorithm for RNA-seq data normalization.基于图的算法用于 RNA-seq 数据标准化。
PLoS One. 2020 Jan 24;15(1):e0227760. doi: 10.1371/journal.pone.0227760. eCollection 2020.
6
Identification of Shoot Differentiation-Related Genes in Oliv.油橄榄 shoot 分化相关基因的鉴定
Genes (Basel). 2019 Dec 11;10(12):1034. doi: 10.3390/genes10121034.
7
Model-based clustering of multi-tissue gene expression data.基于模型的多组织基因表达数据聚类。
Bioinformatics. 2020 Mar 1;36(6):1807-1813. doi: 10.1093/bioinformatics/btz805.
8
GENAVi: a shiny web application for gene expression normalization, analysis and visualization.GENAVi:一个用于基因表达归一化、分析和可视化的闪亮网络应用程序。
BMC Genomics. 2019 Oct 16;20(1):745. doi: 10.1186/s12864-019-6073-7.
9
A systematic evaluation of single cell RNA-seq analysis pipelines.单细胞 RNA 测序分析流程的系统评价。
Nat Commun. 2019 Oct 11;10(1):4667. doi: 10.1038/s41467-019-12266-7.
10
Accurate Classification of Differential Expression Patterns in a Bayesian Framework With Robust Normalization for Multi-Group RNA-Seq Count Data.在具有稳健归一化的贝叶斯框架中对多组RNA-Seq计数数据的差异表达模式进行准确分类
Bioinform Biol Insights. 2019 Jul 8;13:1177932219860817. doi: 10.1177/1177932219860817. eCollection 2019.