• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于识别由环境信号诱导的基因表达差异模式的斯凯拉姆模型。

A skellam model to identify differential patterns of gene expression induced by environmental signals.

作者信息

Jiang Libo, Mao Ke, Wu Rongling

机构信息

Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China.

出版信息

BMC Genomics. 2014 Sep 8;15(1):772. doi: 10.1186/1471-2164-15-772.

DOI:10.1186/1471-2164-15-772
PMID:25199446
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4167515/
Abstract

BACKGROUND

RNA-seq, based on deep-sequencing techniques, has been widely employed to precisely measure levels of transcripts and their isoforms expressed under different conditions. However, robust statistical tools used to analyze these complex datasets are lacking. By grouping genes with similar expression profiles across treatments, cluster analysis provides insight into gene functions and networks that have become increasingly important.

RESULTS

We proposed and verified a cluster algorithm based on a skellam model for grouping genes into distinct groups based on the pattern of gene expression in response to changing conditions or in different tissues. This algorithm capitalizes on the skellam distribution to capture the count property of RNA-seq data and clusters genes in different environments. A two-stage hierarchical expectation-maximization (EM) algorithm was implemented to estimate the optimal number of groups and mean expression levels of each group across two environments. A procedure was formulated to test whether and how a given group shows a plastic response to environmental changes. The model was used to analyze an RNA-seq dataset measured from reciprocal crosses of early Arabidopsis thaliana embryos that respond differently based on the extent of maternal and paternal genome contributions, from which genes associated with maternal and paternal contributions were identified. Simulation studies were also performed to validate the statistical behavior of the model.

CONCLUSIONS

This model is a useful tool for clustering gene expression data by RNA-seq, thus facilitating our understanding of gene functions and networks.

摘要

背景

基于深度测序技术的RNA测序(RNA-seq)已被广泛用于精确测量在不同条件下表达的转录本及其异构体的水平。然而,用于分析这些复杂数据集的强大统计工具却很缺乏。通过对不同处理中具有相似表达谱的基因进行分组,聚类分析为了解基因功能和网络提供了见解,而这些功能和网络正变得越来越重要。

结果

我们提出并验证了一种基于斯凯拉姆模型的聚类算法,该算法可根据基因在应对变化条件或不同组织时的表达模式将基因分组为不同的类别。该算法利用斯凯拉姆分布来捕捉RNA-seq数据的计数特性,并在不同环境中对基因进行聚类。实施了两阶段分层期望最大化(EM)算法,以估计两个环境中最优的组数和每组的平均表达水平。制定了一个程序来测试给定的组是否以及如何对环境变化表现出可塑性反应。该模型用于分析从拟南芥早期胚胎的正反交中测得的RNA-seq数据集,这些胚胎根据母本和父本基因组贡献的程度表现出不同的反应,从中鉴定出与母本和父本贡献相关的基因。还进行了模拟研究以验证该模型的统计行为。

结论

该模型是通过RNA-seq对基因表达数据进行聚类的有用工具,从而有助于我们理解基因功能和网络。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/8360b41a5877/12864_2014_6446_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/88faea701a34/12864_2014_6446_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/867a8bbf908e/12864_2014_6446_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/eb8afcffe5a1/12864_2014_6446_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/2a53d6c05c63/12864_2014_6446_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/8360b41a5877/12864_2014_6446_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/88faea701a34/12864_2014_6446_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/867a8bbf908e/12864_2014_6446_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/eb8afcffe5a1/12864_2014_6446_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/2a53d6c05c63/12864_2014_6446_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aaf/4167515/8360b41a5877/12864_2014_6446_Fig5_HTML.jpg

相似文献

1
A skellam model to identify differential patterns of gene expression induced by environmental signals.一种用于识别由环境信号诱导的基因表达差异模式的斯凯拉姆模型。
BMC Genomics. 2014 Sep 8;15(1):772. doi: 10.1186/1471-2164-15-772.
2
A bi-Poisson model for clustering gene expression profiles by RNA-seq.一种用于通过RNA测序对基因表达谱进行聚类的双泊松模型。
Brief Bioinform. 2014 Jul;15(4):534-41. doi: 10.1093/bib/bbt029.
3
Model-based clustering for RNA-seq data.基于模型的 RNA-seq 数据聚类。
Bioinformatics. 2014 Jan 15;30(2):197-205. doi: 10.1093/bioinformatics/btt632. Epub 2013 Nov 4.
4
A multi-Poisson dynamic mixture model to cluster developmental patterns of gene expression by RNA-seq.一种用于通过RNA测序对基因表达发育模式进行聚类的多泊松动态混合模型。
Brief Bioinform. 2015 Mar;16(2):205-15. doi: 10.1093/bib/bbu013. Epub 2014 May 10.
5
EPIG-Seq: extracting patterns and identifying co-expressed genes from RNA-Seq data.EPIG-Seq:从RNA测序数据中提取模式并识别共表达基因。
BMC Genomics. 2016 Mar 22;17:255. doi: 10.1186/s12864-016-2584-7.
6
Differential expression analysis of RNA sequencing data by incorporating non-exonic mapped reads.通过纳入非外显子映射读数对RNA测序数据进行差异表达分析。
BMC Genomics. 2015;16 Suppl 7(Suppl 7):S14. doi: 10.1186/1471-2164-16-S7-S14. Epub 2015 Jun 11.
7
Gene expression atlas of embryo development in Arabidopsis.拟南芥胚胎发育的基因表达图谱。
Plant Reprod. 2019 Mar;32(1):93-104. doi: 10.1007/s00497-019-00364-x. Epub 2019 Feb 14.
8
A Protocol for Epigenetic Imprinting Analysis with RNA-Seq Data.一种利用RNA测序数据进行表观遗传印记分析的方案。
Methods Mol Biol. 2018;1751:199-208. doi: 10.1007/978-1-4939-7710-9_14.
9
Global scale transcriptome analysis of Arabidopsis embryogenesis in vitro.拟南芥体外胚胎发生的全球规模转录组分析。
BMC Genomics. 2015 Apr 16;16(1):301. doi: 10.1186/s12864-015-1504-6.
10
Identification of imprinted genes subject to parent-of-origin specific expression in Arabidopsis thaliana seeds.鉴定拟南芥种子中具有亲本来源特异性表达的印迹基因。
BMC Plant Biol. 2011 Aug 12;11:113. doi: 10.1186/1471-2229-11-113.

引用本文的文献

1
Transcriptional differentiation driving Cucumis sativus-Botrytis cinerea interactions based on the Skellam model and Bayesian networks.基于Skellam模型和贝叶斯网络驱动黄瓜-灰霉病菌相互作用的转录分化
AMB Express. 2021 Oct 20;11(1):138. doi: 10.1186/s13568-021-01296-4.
2
SEGN: Inferring real-time gene networks mediating phenotypic plasticity.SEGN:推断介导表型可塑性的实时基因网络。
Comput Struct Biotechnol J. 2020 Sep 5;18:2510-2521. doi: 10.1016/j.csbj.2020.08.029. eCollection 2020.
3
A block mixture model to map eQTLs for gene clustering and networking.

本文引用的文献

1
A bi-Poisson model for clustering gene expression profiles by RNA-seq.一种用于通过RNA测序对基因表达谱进行聚类的双泊松模型。
Brief Bioinform. 2014 Jul;15(4):534-41. doi: 10.1093/bib/bbt029.
2
Transcriptome profiles link environmental variation and physiological response of Mytilus californianus between Pacific tides.转录组图谱揭示了加州贻贝在太平洋潮汐间的环境变化与生理反应之间的联系。
Funct Ecol. 2012 Feb 1;26(1):144-155. doi: 10.1111/j.1365-2435.2011.01924.x. Epub 2011 Oct 13.
3
Phenotypic plasticity of the Drosophila transcriptome.果蝇转录组的表型可塑性。
一种用于绘制基因聚类和网络的eQTL图谱的块混合模型。
Sci Rep. 2016 Feb 19;6:21193. doi: 10.1038/srep21193.
4
Analyzing allele specific RNA expression using mixture models.使用混合模型分析等位基因特异性RNA表达。
BMC Genomics. 2015 Aug 1;16(1):566. doi: 10.1186/s12864-015-1749-0.
5
Modeling Expression Plasticity of Genes that Differentiate Drug-sensitive from Drug-resistant Cells to Chemotherapeutic Treatment.建立区分对化疗敏感和耐药细胞的基因表达可塑性模型。
Curr Genomics. 2014 Oct;15(5):349-56. doi: 10.2174/138920291505141106102854.
PLoS Genet. 2012;8(3):e1002593. doi: 10.1371/journal.pgen.1002593. Epub 2012 Mar 29.
4
A comparison of statistical methods for detecting differentially expressed genes from RNA-seq data.基于 RNA-seq 数据的差异表达基因检测的统计学方法比较。
Am J Bot. 2012 Feb;99(2):248-56. doi: 10.3732/ajb.1100340. Epub 2012 Jan 20.
5
Maternal and paternal genomes contribute equally to the transcriptome of early plant embryos.母本和父本基因组对等贡献于早期植物胚胎的转录组。
Nature. 2012 Jan 22;482(7383):94-7. doi: 10.1038/nature10756.
6
Differential expression in RNA-seq: a matter of depth.RNA-seq 中的差异表达:深度的问题。
Genome Res. 2011 Dec;21(12):2213-23. doi: 10.1101/gr.124321.111. Epub 2011 Sep 8.
7
How to cluster gene expression dynamics in response to environmental signals.如何对环境信号作出响应的基因表达动力学进行聚类。
Brief Bioinform. 2012 Mar;13(2):162-74. doi: 10.1093/bib/bbr032. Epub 2011 Jul 10.
8
From RNA-seq reads to differential expression results.从 RNA-seq 读取到差异表达结果。
Genome Biol. 2010;11(12):220. doi: 10.1186/gb-2010-11-12-220. Epub 2010 Dec 22.
9
Differential expression analysis for sequence count data.差异表达分析序列计数数据。
Genome Biol. 2010;11(10):R106. doi: 10.1186/gb-2010-11-10-r106. Epub 2010 Oct 27.
10
A two-parameter generalized Poisson model to improve the analysis of RNA-seq data.一种用于改进RNA测序数据分析的双参数广义泊松模型。
Nucleic Acids Res. 2010 Sep;38(17):e170. doi: 10.1093/nar/gkq670. Epub 2010 Jul 29.