• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

公开:使用功能注释和假定转录因子结合位点对通过一系列转录组数据获得的簇进行剖析。

DISCLOSE : DISsection of CLusters Obtained by SEries of transcriptome data using functional annotations and putative transcription factor binding sites.

作者信息

Blom Evert-Jan, van Hijum Sacha A F T, Hofstede Klaas J, Silvis Remko, Roerdink Jos B T M, Kuipers Oscar P

机构信息

Molecular Genetics, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, The Netherlands.

出版信息

BMC Bioinformatics. 2008 Dec 16;9:535. doi: 10.1186/1471-2105-9-535.

DOI:10.1186/1471-2105-9-535
PMID:19087282
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2661003/
Abstract

BACKGROUND

A typical step in the analysis of gene expression data is the determination of clusters of genes that exhibit similar expression patterns. Researchers are confronted with the seemingly arbitrary choice between numerous algorithms to perform cluster analysis.

RESULTS

We developed an exploratory application that benchmarks the results of clustering methods using functional annotations. In addition, a de novo DNA motif discovery algorithm is integrated in our program which identifies overrepresented DNA binding sites in the upstream DNA sequences of genes from the clusters that are indicative of sites of transcriptional control. The performance of our program was evaluated by comparing the original results of a time course experiment with the findings of our application.

CONCLUSION

DISCLOSE assists researchers in the prokaryotic research community in systematically evaluating results of the application of a range of clustering algorithms to transcriptome data. Different performance measures allow to quickly and comprehensively determine the best suited clustering approach for a given dataset.

摘要

背景

基因表达数据分析中的一个典型步骤是确定呈现相似表达模式的基因簇。研究人员在众多用于进行聚类分析的算法之间面临看似随意的选择。

结果

我们开发了一个探索性应用程序,该程序使用功能注释对聚类方法的结果进行基准测试。此外,我们的程序集成了一种从头DNA基序发现算法,该算法可识别来自聚类的基因上游DNA序列中过度表达的DNA结合位点,这些位点指示转录控制位点。通过将时间进程实验的原始结果与我们应用程序的发现进行比较,对我们程序的性能进行了评估。

结论

DISCLOSE有助于原核生物研究领域的研究人员系统地评估一系列聚类算法应用于转录组数据的结果。不同的性能指标有助于快速、全面地确定给定数据集最适合的聚类方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/507915fb5b6c/1471-2105-9-535-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/343548eeb90c/1471-2105-9-535-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/70f94e7387c6/1471-2105-9-535-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/af361c7b656a/1471-2105-9-535-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/507915fb5b6c/1471-2105-9-535-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/343548eeb90c/1471-2105-9-535-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/70f94e7387c6/1471-2105-9-535-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/af361c7b656a/1471-2105-9-535-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b3/2661003/507915fb5b6c/1471-2105-9-535-4.jpg

相似文献

1
DISCLOSE : DISsection of CLusters Obtained by SEries of transcriptome data using functional annotations and putative transcription factor binding sites.公开:使用功能注释和假定转录因子结合位点对通过一系列转录组数据获得的簇进行剖析。
BMC Bioinformatics. 2008 Dec 16;9:535. doi: 10.1186/1471-2105-9-535.
2
A mathematical and computational framework for quantitative comparison and integration of large-scale gene expression data.用于大规模基因表达数据定量比较与整合的数学和计算框架。
Nucleic Acids Res. 2005 May 10;33(8):2580-94. doi: 10.1093/nar/gki536. Print 2005.
3
TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes.TimesVector:一种用于分析来自多种表型的时间序列转录组数据的向量化聚类方法。
Bioinformatics. 2017 Dec 1;33(23):3827-3835. doi: 10.1093/bioinformatics/btw780.
4
Clustering and re-clustering for pattern discovery in gene expression data.用于基因表达数据中模式发现的聚类和再聚类。
J Bioinform Comput Biol. 2005 Apr;3(2):281-301. doi: 10.1142/s0219720005001053.
5
Ab initio identification of putative human transcription factor binding sites by comparative genomics.通过比较基因组学从头鉴定假定的人类转录因子结合位点
BMC Bioinformatics. 2005 May 2;6:110. doi: 10.1186/1471-2105-6-110.
6
Combining sequence and time series expression data to learn transcriptional modules.结合序列和时间序列表达数据以学习转录模块。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jul-Sep;2(3):194-202. doi: 10.1109/TCBB.2005.34.
7
Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。
Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.
8
Clustering of gene expression data: performance and similarity analysis.基因表达数据的聚类:性能与相似性分析
BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S19. doi: 10.1186/1471-2105-7-S4-S19.
9
Tandem machine learning for the identification of genes regulated by transcription factors.用于识别受转录因子调控基因的串联机器学习
BMC Bioinformatics. 2005 Aug 22;6:204. doi: 10.1186/1471-2105-6-204.
10
CUBIC: identification of regulatory binding sites through data clustering.CUBIC:通过数据聚类识别调控结合位点。
J Bioinform Comput Biol. 2003 Apr;1(1):21-40. doi: 10.1142/s0219720003000162.

引用本文的文献

1
Advances in human papillomavirus detection for cervical cancer screening and diagnosis: challenges of conventional methods and opportunities for emergent tools.用于宫颈癌筛查和诊断的人乳头瘤病毒检测进展:传统方法面临的挑战与新兴工具带来的机遇
Anal Methods. 2025 Feb 13;17(7):1428-1450. doi: 10.1039/d4ay01921k.
2
A Fast and Reliable Pipeline for Bacterial Transcriptome Analysis Case study: Serine-dependent Gene Regulation in Streptococcus pneumoniae.一种用于细菌转录组分析的快速可靠流程 案例研究:肺炎链球菌中丝氨酸依赖性基因调控
J Vis Exp. 2015 Apr 25(98):52649. doi: 10.3791/52649.
3
The transcriptional and gene regulatory network of Lactococcus lactis MG1363 during growth in milk.

本文引用的文献

1
Inferring biological functions and associated transcriptional regulators using gene set expression coherence analysis.使用基因集表达一致性分析推断生物学功能和相关转录调节因子。
BMC Bioinformatics. 2007 Nov 17;8:453. doi: 10.1186/1471-2105-8-453.
2
A novel ensemble learning method for de novo computational identification of DNA binding sites.一种用于从头计算识别DNA结合位点的新型集成学习方法。
BMC Bioinformatics. 2007 Jul 12;8:249. doi: 10.1186/1471-2105-8-249.
3
Analysis of temporal gene expression during Bacillus subtilis spore germination and outgrowth.
乳球菌 MG1363 在牛奶中生长过程中的转录和基因调控网络。
PLoS One. 2013;8(1):e53085. doi: 10.1371/journal.pone.0053085. Epub 2013 Jan 17.
4
EfaR is a major regulator of Enterococcus faecalis manganese transporters and influences processes involved in host colonization and infection.EfaR 是粪肠球菌锰转运蛋白的主要调控因子,影响宿主定植和感染相关的过程。
Infect Immun. 2013 Mar;81(3):935-44. doi: 10.1128/IAI.06377-11. Epub 2013 Jan 7.
5
Time-resolved transcriptomics and bioinformatic analyses reveal intrinsic stress responses during batch culture of Bacillus subtilis.时间分辨转录组学和生物信息学分析揭示了枯草芽孢杆菌分批培养过程中的固有应激反应。
PLoS One. 2011;6(11):e27160. doi: 10.1371/journal.pone.0027160. Epub 2011 Nov 8.
6
Using SCOPE to identify potential regulatory motifs in coregulated genes.使用SCOPE鉴定共调控基因中的潜在调控基序。
J Vis Exp. 2011 May 31(51):2703. doi: 10.3791/2703.
枯草芽孢杆菌孢子萌发和生长过程中的时间基因表达分析。
J Bacteriol. 2007 May;189(9):3624-34. doi: 10.1128/JB.01736-06. Epub 2007 Feb 23.
4
FIVA: Functional Information Viewer and Analyzer extracting biological knowledge from transcriptome data of prokaryotes.FIVA:从原核生物转录组数据中提取生物学知识的功能信息查看器和分析器。
Bioinformatics. 2007 May 1;23(9):1161-3. doi: 10.1093/bioinformatics/btl658. Epub 2007 Jan 19.
5
Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes.使用功能类别参考集评估基因表达数据聚类算法的方法。
BMC Bioinformatics. 2006 Aug 31;7:397. doi: 10.1186/1471-2105-7-397.
6
JProGO: a novel tool for the functional interpretation of prokaryotic microarray data using Gene Ontology information.JProGO:一种利用基因本体论信息对原核生物微阵列数据进行功能解读的新型工具。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W510-5. doi: 10.1093/nar/gkl329.
7
RegulonDB (version 5.0): Escherichia coli K-12 transcriptional regulatory network, operon organization, and growth conditions.RegulonDB(版本5.0):大肠杆菌K-12转录调控网络、操纵子组织及生长条件
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D394-7. doi: 10.1093/nar/gkj156.
8
EXPANDER--an integrative program suite for microarray data analysis.EXPANDER——一个用于微阵列数据分析的综合程序套件。
BMC Bioinformatics. 2005 Sep 21;6:232. doi: 10.1186/1471-2105-6-232.
9
WebLogo: a sequence logo generator.WebLogo:一个序列图生成器。
Genome Res. 2004 Jun;14(6):1188-90. doi: 10.1101/gr.849004.
10
Selection of informative clusters from hierarchical cluster tree with gene classes.从带有基因类别的层次聚类树中选择信息性聚类
BMC Bioinformatics. 2004 Mar 25;5:32. doi: 10.1186/1471-2105-5-32.