• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蓝藻如何给旧方法带来新问题:微阵列时间序列分析面临的挑战。

How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis.

机构信息

Institute for Theoretical Biology, Humboldt University Berlin, Invalidenstraße 43, D-10115 Berlin, Germany.

出版信息

BMC Bioinformatics. 2013 Apr 21;14:133. doi: 10.1186/1471-2105-14-133.

DOI:10.1186/1471-2105-14-133
PMID:23601192
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3679775/
Abstract

BACKGROUND

The transcriptomes of several cyanobacterial strains have been shown to exhibit diurnal oscillation patterns, reflecting the diurnal phototrophic lifestyle of the organisms. The analysis of such genome-wide transcriptional oscillations is often facilitated by the use of clustering algorithms in conjunction with a number of pre-processing steps. Biological interpretation is usually focussed on the time and phase of expression of the resulting groups of genes. However, the use of microarray technology in such studies requires the normalization of pre-processing data, with unclear impact on the qualitative and quantitative features of the derived information on the number of oscillating transcripts and their respective phases.

RESULTS

A microarray based evaluation of diurnal expression in the cyanobacterium Synechocystis sp. PCC 6803 is presented. As expected, the temporal expression patterns reveal strong oscillations in transcript abundance. We compare the Fourier transformation-based expression phase before and after the application of quantile normalization, median polishing, cyclical LOESS, and least oscillating set (LOS) normalization. Whereas LOS normalization mostly preserves the phases of the raw data, the remaining methods introduce systematic biases. In particular, quantile-normalization is found to introduce a phase-shift of 180°, effectively changing night-expressed genes into day-expressed ones. Comparison of a large number of clustering results of differently normalized data shows that the normalization method determines the result. Subsequent steps, such as the choice of data transformation, similarity measure, and clustering algorithm, only play minor roles. We find that the standardization and the DTF transformation are favorable for the clustering of time series in contrast to the 12 m transformation. We use the cluster-wise functional enrichment of a clustering derived by LOS normalization, clustering using flowClust, and DFT transformation to derive the diurnal biological program of Synechocystis sp..

CONCLUSION

Application of quantile normalization, median polishing, and also cyclic LOESS normalization of the presented cyanobacterial dataset lead to increased numbers of oscillating genes and the systematic shift of the expression phase. The LOS normalization minimizes the observed detrimental effects. As previous analyses employed a variety of different normalization methods, a direct comparison of results must be treated with caution.

摘要

背景

已经证明,几种蓝藻菌株的转录组表现出昼夜振荡模式,反映了生物体的昼夜光合作用生活方式。通过使用聚类算法结合许多预处理步骤,通常可以分析这种全基因组转录振荡。生物解释通常集中在产生的基因组的表达时间和相位上。然而,在这些研究中使用微阵列技术需要对预处理数据进行标准化,这对衍生信息中振荡转录本的数量及其各自相位的定性和定量特征的影响尚不清楚。

结果

呈现了基于微阵列的蓝藻集胞藻 PCC 6803 昼夜表达的评估。正如预期的那样,时间表达模式显示转录物丰度的强烈振荡。我们比较了应用分位数归一化、中位数抛光、循环 LOESS 和最少振荡集 (LOS) 归一化前后基于傅里叶变换的表达相位。虽然 LOS 归一化主要保留了原始数据的相位,但其余方法会引入系统偏差。特别是,分位数归一化被发现会引入 180°的相移,有效地将夜间表达的基因转变为白天表达的基因。比较大量不同归一化数据的聚类结果表明,归一化方法决定了结果。随后的步骤,如数据转换、相似性度量和聚类算法的选择,只起次要作用。我们发现与 12 m 转换相比,标准化和 DTF 转换有利于时间序列的聚类。我们使用 LOS 归一化、使用 flowClust 的聚类和 DFT 转换的聚类中每类的功能富集,推导出集胞藻昼夜生物程序。

结论

对所提供的蓝藻数据集应用分位数归一化、中位数抛光和循环 LOESS 归一化会导致振荡基因数量的增加和表达相位的系统移位。LOS 归一化将观察到的不利影响降到最低。由于以前的分析使用了多种不同的归一化方法,因此必须谨慎对待结果的直接比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/06cc5a6f6905/1471-2105-14-133-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/ba18ac6c0022/1471-2105-14-133-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/68cc2a653088/1471-2105-14-133-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/94e2c5d87896/1471-2105-14-133-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/f9f87beaafab/1471-2105-14-133-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/06cc5a6f6905/1471-2105-14-133-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/ba18ac6c0022/1471-2105-14-133-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/68cc2a653088/1471-2105-14-133-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/94e2c5d87896/1471-2105-14-133-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/f9f87beaafab/1471-2105-14-133-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ca7/3679775/06cc5a6f6905/1471-2105-14-133-5.jpg

相似文献

1
How cyanobacteria pose new problems to old methods: challenges in microarray time series analysis.蓝藻如何给旧方法带来新问题:微阵列时间序列分析面临的挑战。
BMC Bioinformatics. 2013 Apr 21;14:133. doi: 10.1186/1471-2105-14-133.
2
Effect of data normalization on fuzzy clustering of DNA microarray data.数据归一化对DNA微阵列数据模糊聚类的影响。
BMC Bioinformatics. 2006 Mar 14;7:134. doi: 10.1186/1471-2105-7-134.
3
Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering.微阵列分类发现中的挑战:归一化、基因选择和聚类的综合考察。
BMC Bioinformatics. 2010 Oct 11;11:503. doi: 10.1186/1471-2105-11-503.
4
Diurnal Regulation of Cellular Processes in the Cyanobacterium Synechocystis sp. Strain PCC 6803: Insights from Transcriptomic, Fluxomic, and Physiological Analyses.集胞藻6803菌株中细胞过程的昼夜调节:转录组学、通量组学和生理学分析的见解
mBio. 2016 May 3;7(3):e00464-16. doi: 10.1128/mBio.00464-16.
5
Impact of DNA microarray data transformation on gene expression analysis - comparison of two normalization methods.DNA微阵列数据转换对基因表达分析的影响——两种标准化方法的比较
Acta Biochim Pol. 2011;58(4):573-80. Epub 2011 Dec 20.
6
Effect of various normalization methods on Applied Biosystems expression array system data.各种标准化方法对Applied Biosystems表达阵列系统数据的影响。
BMC Bioinformatics. 2006 Dec 15;7:533. doi: 10.1186/1471-2105-7-533.
7
Identification and modeling of genes with diurnal oscillations from microarray time series data.从微阵列时间序列数据中识别和建模具有昼夜节律波动的基因。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Jan-Mar;8(1):108-21. doi: 10.1109/TCBB.2009.37.
8
Faster cyclic loess: normalizing RNA arrays via linear models.更快的循环黄土算法:通过线性模型对RNA阵列进行归一化
Bioinformatics. 2004 Nov 1;20(16):2778-86. doi: 10.1093/bioinformatics/bth327. Epub 2004 May 27.
9
Using generalized procrustes analysis (GPA) for normalization of cDNA microarray data.使用广义普罗克汝斯分析(GPA)对cDNA微阵列数据进行标准化处理。
BMC Bioinformatics. 2008 Jan 16;9:25. doi: 10.1186/1471-2105-9-25.
10
Multiple gene expression profile alignment for microarray time-series data clustering.微阵列时间序列数据聚类的多个基因表达谱比对。
Bioinformatics. 2010 Sep 15;26(18):2281-8. doi: 10.1093/bioinformatics/btq422. Epub 2010 Jul 16.

引用本文的文献

1
Improved RNA stability estimation indicates that transcriptional interference is frequent in diverse bacteria.改进的 RNA 稳定性估计表明,转录干扰在多种细菌中很常见。
Commun Biol. 2023 Jul 15;6(1):732. doi: 10.1038/s42003-023-05097-2.
2
Manipulation of topoisomerase expression inhibits cell division but not growth and reveals a distinctive promoter structure in Synechocystis.拓扑异构酶表达的调控抑制细胞分裂但不影响生长,并揭示了集胞藻中独特的启动子结构。
Nucleic Acids Res. 2022 Dec 9;50(22):12790-12808. doi: 10.1093/nar/gkac1132.
3
Model-Based Clustering With Data Correction For Removing Artifacts In Gene Expression Data.

本文引用的文献

1
Revisiting global gene expression analysis.重新审视全球基因表达分析。
Cell. 2012 Oct 26;151(3):476-82. doi: 10.1016/j.cell.2012.10.012.
2
The yin and yang of yeast transcription: elements of a global feedback system between metabolism and chromatin.酵母转录的阴阳两面:代谢和染色质之间的全球反馈系统的要素。
PLoS One. 2012;7(6):e37906. doi: 10.1371/journal.pone.0037906. Epub 2012 Jun 7.
3
A high resolution map of a cyanobacterial transcriptome.一张蓝藻转录组的高分辨率图谱。
基于模型的聚类与数据校正以去除基因表达数据中的伪迹
Ann Appl Stat. 2016 Feb;11(4):1998-2026. doi: 10.1214/17-AOAS1051. Epub 2017 Dec 28.
4
Similarity-Based Segmentation of Multi-Dimensional Signals.基于相似性的多维信号分割
Sci Rep. 2017 Sep 27;7(1):12355. doi: 10.1038/s41598-017-12401-8.
5
Minimal tool set for a prokaryotic circadian clock.原核生物钟的最小工具集。
BMC Evol Biol. 2017 Jul 21;17(1):169. doi: 10.1186/s12862-017-0999-7.
6
Cellular trade-offs and optimal resource allocation during cyanobacterial diurnal growth.蓝藻昼夜生长过程中的细胞权衡与最优资源分配
Proc Natl Acad Sci U S A. 2017 Aug 1;114(31):E6457-E6465. doi: 10.1073/pnas.1617508114. Epub 2017 Jul 18.
7
Time resolved DNA occupancy dynamics during the respiratory oscillation uncover a global reset point in the yeast growth program.呼吸振荡过程中时间分辨的DNA占据动态揭示了酵母生长程序中的一个全局重置点。
Microb Cell. 2014 Sep 1;1(9):279-288. doi: 10.15698/mic2014.09.166.
8
Toward Multiscale Models of Cyanobacterial Growth: A Modular Approach.迈向蓝藻生长的多尺度模型:一种模块化方法。
Front Bioeng Biotechnol. 2016 Dec 26;4:95. doi: 10.3389/fbioe.2016.00095. eCollection 2016.
9
Elucidating temporal resource allocation and diurnal dynamics in phototrophic metabolism using conditional FBA.使用条件通量平衡分析阐明光合营养代谢中的时间资源分配和昼夜动态。
Sci Rep. 2015 Oct 26;5:15247. doi: 10.1038/srep15247.
10
Determination of Temporal Order among the Components of an Oscillatory System.振荡系统各组成部分之间时间顺序的确定。
PLoS One. 2015 Jul 7;10(7):e0124842. doi: 10.1371/journal.pone.0124842. eCollection 2015.
Genome Biol. 2011;12(5):R47. doi: 10.1186/gb-2011-12-5-r47. Epub 2011 May 25.
4
Hierarchical signature clustering for time series microarray data.层次签名聚类分析用于时间序列基因芯片数据。
Adv Exp Med Biol. 2011;696:57-65. doi: 10.1007/978-1-4419-7046-6_6.
5
A day in the life of microcystis aeruginosa strain PCC 7806 as revealed by a transcriptomic analysis.微囊藻 PCC 7806 菌株的转录组分析揭示了其生活的一天。
PLoS One. 2011 Jan 19;6(1):e16208. doi: 10.1371/journal.pone.0016208.
6
Algorithm-driven artifacts in median polish summarization of microarray data.算法驱动的中值平滑微阵列数据汇总中的伪影。
BMC Bioinformatics. 2010 Nov 11;11:553. doi: 10.1186/1471-2105-11-553.
7
Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering.微阵列分类发现中的挑战:归一化、基因选择和聚类的综合考察。
BMC Bioinformatics. 2010 Oct 11;11:503. doi: 10.1186/1471-2105-11-503.
8
G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration.G 栈调制探针在表达谱芯片上的强度 - 序列修正和信号校准。
BMC Bioinformatics. 2010 Apr 27;11:207. doi: 10.1186/1471-2105-11-207.
9
Oscillations in supercoiling drive circadian gene expression in cyanobacteria.超螺旋的波动驱动蓝细菌的生物钟基因表达。
Proc Natl Acad Sci U S A. 2009 Dec 29;106(52):22564-8. doi: 10.1073/pnas.0912673106. Epub 2009 Dec 14.
10
CyanoBase: the cyanobacteria genome database update 2010.蓝藻基因组数据库更新 2010 版(CyanoBase:the cyanobacteria genome database update 2010)
Nucleic Acids Res. 2010 Jan;38(Database issue):D379-81. doi: 10.1093/nar/gkp915. Epub 2009 Oct 30.