• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FunPat:基于功能的RNA测序时间序列数据分析模式分析

FunPat: function-based pattern analysis on RNA-seq time series data.

作者信息

Sanavia Tiziana, Finotello Francesca, Di Camillo Barbara

出版信息

BMC Genomics. 2015;16(Suppl 6):S2. doi: 10.1186/1471-2164-16-S6-S2. Epub 2015 Jun 1.

DOI:10.1186/1471-2164-16-S6-S2
PMID:26046293
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4460925/
Abstract

BACKGROUND

Dynamic expression data, nowadays obtained using high-throughput RNA sequencing, are essential to monitor transient gene expression changes and to study the dynamics of their transcriptional activity in the cell or response to stimuli. Several methods for data selection, clustering and functional analysis are available; however, these steps are usually performed independently, without exploiting and integrating the information derived from each step of the analysis.

METHODS

Here we present FunPat, an R package for time series RNA sequencing data that integrates gene selection, clustering and functional annotation into a single framework. FunPat exploits functional annotations by performing for each functional term, e.g. a Gene Ontology term, an integrated selection-clustering analysis to select differentially expressed genes that share, besides annotation, a common dynamic expression profile.

RESULTS

FunPat performance was assessed on both simulated and real data. With respect to a stand-alone selection step, the integration of the clustering step is able to improve the recall without altering the false discovery rate. FunPat also shows high precision and recall in detecting the correct temporal expression patterns; in particular, the recall is significantly higher than hierarchical, k-means and a model-based clustering approach specifically designed for RNA sequencing data. Moreover, when biological replicates are missing, FunPat is able to provide reproducible lists of significant genes. The application to real time series expression data shows the ability of FunPat to select differentially expressed genes with high reproducibility, indirectly confirming high precision and recall in gene selection. Moreover, the expression patterns obtained as output allow an easy interpretation of the results.

CONCLUSIONS

A novel analysis pipeline was developed to search the main temporal patterns in classes of genes similarly annotated, improving the sensitivity of gene selection by integrating the statistical evidence of differential expression with the information on temporal profiles and the functional annotations. Significant genes are associated to both the most informative functional terms, avoiding redundancy of information, and the most representative temporal patterns, thus improving the readability of the results. FunPat package is provided in R/Bioconductor at link: http://sysbiobig.dei.unipd.it/?q=node/79.

摘要

背景

如今通过高通量RNA测序获得的动态表达数据对于监测瞬时基因表达变化以及研究其在细胞中的转录活性动态或对刺激的反应至关重要。有几种数据选择、聚类和功能分析的方法;然而,这些步骤通常是独立执行的,没有利用和整合分析每个步骤中获得的信息。

方法

在此我们展示了FunPat,一个用于时间序列RNA测序数据的R包,它将基因选择、聚类和功能注释整合到一个单一框架中。FunPat通过对每个功能术语(例如基因本体术语)执行综合选择 - 聚类分析来利用功能注释,以选择除注释外还共享共同动态表达谱的差异表达基因。

结果

在模拟数据和真实数据上评估了FunPat的性能。相对于单独的选择步骤,聚类步骤的整合能够在不改变错误发现率的情况下提高召回率。FunPat在检测正确的时间表达模式方面也显示出高精度和召回率;特别是,召回率显著高于层次聚类、k均值聚类以及专门为RNA测序数据设计的基于模型的聚类方法。此外,当缺少生物学重复时,FunPat能够提供可重复的显著基因列表。对实时序列表达数据的应用表明FunPat能够以高重现性选择差异表达基因,间接证实了基因选择中的高精度和召回率。此外,作为输出获得的表达模式便于结果的解释。

结论

开发了一种新颖的分析流程来搜索注释相似的基因类别中的主要时间模式,通过将差异表达的统计证据与时间谱信息和功能注释相结合来提高基因选择的敏感性。显著基因与最具信息性的功能术语以及最具代表性的时间模式相关联,从而避免了信息冗余,提高了结果的可读性。FunPat包可在R/Bioconductor中通过以下链接获取:http://sysbiobig.dei.unipd.it/?q=node/79 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/03eb5cbe3459/1471-2164-16-S6-S2-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/f7d65da8d8a2/1471-2164-16-S6-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/fd75d84034a4/1471-2164-16-S6-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/53ac62e965f6/1471-2164-16-S6-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/33a19768dfea/1471-2164-16-S6-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/f31e8be53204/1471-2164-16-S6-S2-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/1dc3eead876b/1471-2164-16-S6-S2-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/03eb5cbe3459/1471-2164-16-S6-S2-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/f7d65da8d8a2/1471-2164-16-S6-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/fd75d84034a4/1471-2164-16-S6-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/53ac62e965f6/1471-2164-16-S6-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/33a19768dfea/1471-2164-16-S6-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/f31e8be53204/1471-2164-16-S6-S2-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/1dc3eead876b/1471-2164-16-S6-S2-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/911c/4460925/03eb5cbe3459/1471-2164-16-S6-S2-7.jpg

相似文献

1
FunPat: function-based pattern analysis on RNA-seq time series data.FunPat:基于功能的RNA测序时间序列数据分析模式分析
BMC Genomics. 2015;16(Suppl 6):S2. doi: 10.1186/1471-2164-16-S6-S2. Epub 2015 Jun 1.
2
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.
3
A multi-Poisson dynamic mixture model to cluster developmental patterns of gene expression by RNA-seq.一种用于通过RNA测序对基因表达发育模式进行聚类的多泊松动态混合模型。
Brief Bioinform. 2015 Mar;16(2):205-15. doi: 10.1093/bib/bbu013. Epub 2014 May 10.
4
GFINDer: Genome Function INtegrated Discoverer through dynamic annotation, statistical analysis, and mining.GFINDer:通过动态注释、统计分析和挖掘实现的基因组功能综合发现工具。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W293-300. doi: 10.1093/nar/gkh432.
5
Search Engine for Antimicrobial Resistance: A Cloud Compatible Pipeline and Web Interface for Rapidly Detecting Antimicrobial Resistance Genes Directly from Sequence Data.抗菌药物耐药性搜索引擎:一种与云兼容的流程和网络界面,用于直接从序列数据中快速检测抗菌药物耐药基因。
PLoS One. 2015 Jul 21;10(7):e0133492. doi: 10.1371/journal.pone.0133492. eCollection 2015.
6
GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.GOexpress:一个用于通过对基因表达数据进行监督学习来识别和可视化稳健基因本体特征的R/Bioconductor软件包。
BMC Bioinformatics. 2016 Mar 11;17:126. doi: 10.1186/s12859-016-0971-3.
7
Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA.转录器:一种用于注释组装读段和识别非编码RNA的自动化计算流程。
PLoS One. 2015 Nov 18;10(11):e0140268. doi: 10.1371/journal.pone.0140268. eCollection 2015.
8
Clustering short time series gene expression data.聚类短时间序列基因表达数据。
Bioinformatics. 2005 Jun;21 Suppl 1:i159-68. doi: 10.1093/bioinformatics/bti1022.
9
A model selection criterion for model-based clustering of annotated gene expression data.一种用于带注释基因表达数据的基于模型聚类的模型选择标准。
Stat Appl Genet Mol Biol. 2015 Nov;14(5):413-28. doi: 10.1515/sagmb-2014-0095.
10
clusterExperiment and RSEC: A Bioconductor package and framework for clustering of single-cell and other large gene expression datasets.clusterExperiment 和 RSEC:一个用于单细胞和其他大型基因表达数据集聚类的 Bioconductor 包和框架。
PLoS Comput Biol. 2018 Sep 4;14(9):e1006378. doi: 10.1371/journal.pcbi.1006378. eCollection 2018 Sep.

引用本文的文献

1
Temporal profiling of host transcriptome highlights time- and tissue-dependent Interferon pathway activation in NNV-infected European sea bass.宿主转录组的时间分析突显了在感染神经坏死病毒的欧洲海鲈中,干扰素途径的激活具有时间和组织依赖性。
Sci Rep. 2025 Jul 10;15(1):24941. doi: 10.1038/s41598-025-09705-5.
2
Learning perturbation-inducible cell states from observability analysis of transcriptome dynamics.从转录组动力学的可观察性分析中学习可干扰的细胞状态。
Nat Commun. 2023 May 31;14(1):3148. doi: 10.1038/s41467-023-37897-9.
3
Role of the Extracytoplasmic Function Sigma Factor SigE in the Stringent Response of Mycobacterium tuberculosis.

本文引用的文献

1
Measuring differential gene expression with RNA-seq: challenges and strategies for data analysis.利用RNA测序技术测量差异基因表达:数据分析面临的挑战与策略
Brief Funct Genomics. 2015 Mar;14(2):130-42. doi: 10.1093/bfgp/elu035. Epub 2014 Sep 18.
2
Next maSigPro: updating maSigPro bioconductor package for RNA-seq time series.Next maSigPro:更新用于RNA测序时间序列的maSigPro生物导体包。
Bioinformatics. 2014 Sep 15;30(18):2598-602. doi: 10.1093/bioinformatics/btu333. Epub 2014 Jun 3.
3
The analytical landscape of static and temporal dynamics in transcriptome data.
胞质外功能σ因子SigE在结核分枝杆菌严谨反应中的作用
Microbiol Spectr. 2023 Mar 22;11(2):e0294422. doi: 10.1128/spectrum.02944-22.
4
The Pancreatic ß-cell Response to Secretory Demands and Adaption to Stress.胰腺β细胞对分泌需求的反应和对应激的适应。
Endocrinology. 2021 Nov 1;162(11). doi: 10.1210/endocr/bqab173.
5
Temporal Transcriptome Analysis Reveals Dynamic Gene Expression Patterns Driving β-Cell Maturation.时间转录组分析揭示驱动β细胞成熟的动态基因表达模式。
Front Cell Dev Biol. 2021 May 4;9:648791. doi: 10.3389/fcell.2021.648791. eCollection 2021.
6
Temporal Dynamic Methods for Bulk RNA-Seq Time Series Data.批量 RNA-Seq 时间序列数据的时间动态方法。
Genes (Basel). 2021 Feb 27;12(3):352. doi: 10.3390/genes12030352.
7
State-Transition Analysis of Time-Sequential Gene Expression Identifies Critical Points That Predict Development of Acute Myeloid Leukemia.基于时间序列的基因表达状态转换分析鉴定出预测急性髓系白血病发生的关键节点。
Cancer Res. 2020 Aug 1;80(15):3157-3169. doi: 10.1158/0008-5472.CAN-20-0354. Epub 2020 May 15.
8
Improving culture medium by a comparative transcriptome method.通过比较转录组方法改进培养基
J Vet Sci. 2020 Mar;21(2):e30. doi: 10.4142/jvs.2020.21.e30.
9
Trajectory-based differential expression analysis for single-cell sequencing data.基于轨迹的单细胞测序数据分析。
Nat Commun. 2020 Mar 5;11(1):1201. doi: 10.1038/s41467-020-14766-3.
10
TimeMeter assesses temporal gene expression similarity and identifies differentially progressing genes.TimeMeter 评估基因表达的时间相似性,并识别差异表达基因。
Nucleic Acids Res. 2020 May 21;48(9):e51. doi: 10.1093/nar/gkaa142.
转录组数据中静态和时间动态的分析格局。
Front Genet. 2014 Feb 20;5:35. doi: 10.3389/fgene.2014.00035. eCollection 2014.
4
Reducing bias in RNA sequencing data: a novel approach to compute counts.降低 RNA 测序数据中的偏差:一种计算计数的新方法。
BMC Bioinformatics. 2014;15 Suppl 1(Suppl 1):S7. doi: 10.1186/1471-2105-15-S1-S7. Epub 2014 Jan 10.
5
SeqGSEA: a Bioconductor package for gene set enrichment analysis of RNA-Seq data integrating differential expression and splicing.SeqGSEA:一个用于 RNA-Seq 数据基因集富集分析的 Bioconductor 软件包,集成了差异表达和剪接分析。
Bioinformatics. 2014 Jun 15;30(12):1777-9. doi: 10.1093/bioinformatics/btu090. Epub 2014 Feb 17.
6
Model-based clustering for RNA-seq data.基于模型的 RNA-seq 数据聚类。
Bioinformatics. 2014 Jan 15;30(2):197-205. doi: 10.1093/bioinformatics/btt632. Epub 2013 Nov 4.
7
High-resolution temporal response patterns to influenza vaccine reveal a distinct human plasma cell gene signature.高时间分辨率的流感疫苗反应模式揭示了独特的人类浆细胞基因特征。
Sci Rep. 2013;3:2327. doi: 10.1038/srep02327.
8
More powerful significant testing for time course gene expression data using functional principal component analysis approaches.基于功能主成分分析方法的时间序列基因表达数据更强大的显著检验。
BMC Bioinformatics. 2013 Jan 16;14:6. doi: 10.1186/1471-2105-14-6.
9
Dynamic chromatin remodeling mediated by polycomb proteins orchestrates pancreatic differentiation of human embryonic stem cells.多梳蛋白介导的动态染色质重塑协调人类胚胎干细胞的胰腺分化。
Cell Stem Cell. 2013 Feb 7;12(2):224-37. doi: 10.1016/j.stem.2012.11.023. Epub 2013 Jan 11.
10
A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis.Illumina 高通量 RNA 测序数据分析中标准化方法的综合评估。
Brief Bioinform. 2013 Nov;14(6):671-83. doi: 10.1093/bib/bbs046. Epub 2012 Sep 17.