• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过关联规则发现进行基因表达的综合分析。

Integrated analysis of gene expression by Association Rules Discovery.

作者信息

Carmona-Saez Pedro, Chagoyen Monica, Rodriguez Andres, Trelles Oswaldo, Carazo Jose M, Pascual-Montano Alberto

机构信息

BioComputing Unit, National Center for Biotechnology (CNB-CSIC), Cantoblanco, 28049, Madrid, Spain.

出版信息

BMC Bioinformatics. 2006 Feb 7;7:54. doi: 10.1186/1471-2105-7-54.

DOI:10.1186/1471-2105-7-54
PMID:16464256
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1386712/
Abstract

BACKGROUND

Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. To extract biological knowledge, and to fully understand such datasets, it is essential to include external biological information about genes and gene products to the analysis of expression data. However, most of the current approaches to analyze microarray datasets are mainly focused on the analysis of experimental data, and external biological information is incorporated as a posterior process.

RESULTS

In this study we present a method for the integrative analysis of microarray data based on the Association Rules Discovery data mining technique. The approach integrates gene annotations and expression data to discover intrinsic associations among both data sources based on co-occurrence patterns. We applied the proposed methodology to the analysis of gene expression datasets in which genes were annotated with metabolic pathways, transcriptional regulators and Gene Ontology categories. Automatically extracted associations revealed significant relationships among these gene attributes and expression patterns, where many of them are clearly supported by recently reported work.

CONCLUSION

The integration of external biological information and gene expression data can provide insights about the biological processes associated to gene expression programs. In this paper we show that the proposed methodology is able to integrate multiple gene annotations and expression data in the same analytic framework and extract meaningful associations among heterogeneous sources of data. An implementation of the method is included in the Engene software package.

摘要

背景

微阵列技术正在生成大量关于数千个基因甚至整个基因组在不同实验条件下的表达水平的数据。为了提取生物学知识并充分理解此类数据集,在表达数据分析中纳入有关基因和基因产物的外部生物学信息至关重要。然而,当前大多数分析微阵列数据集的方法主要集中在实验数据的分析上,外部生物学信息是作为后续过程纳入的。

结果

在本研究中,我们提出了一种基于关联规则发现数据挖掘技术的微阵列数据综合分析方法。该方法整合基因注释和表达数据,以基于共现模式发现两个数据源之间的内在关联。我们将所提出的方法应用于基因表达数据集的分析,其中基因用代谢途径、转录调节因子和基因本体类别进行注释。自动提取的关联揭示了这些基因属性与表达模式之间的显著关系,其中许多关系得到了最近报道的工作的明确支持。

结论

外部生物学信息与基因表达数据的整合可以提供有关与基因表达程序相关的生物学过程的见解。在本文中,我们表明所提出的方法能够在同一分析框架中整合多个基因注释和表达数据,并从异质数据源中提取有意义的关联。该方法的一个实现包含在Engene软件包中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/51b96be42362/1471-2105-7-54-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/62d7232b3026/1471-2105-7-54-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/71cff79138f2/1471-2105-7-54-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/33372d7bcc95/1471-2105-7-54-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/e85050ee8a79/1471-2105-7-54-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/f1bf903c8601/1471-2105-7-54-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/51b96be42362/1471-2105-7-54-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/62d7232b3026/1471-2105-7-54-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/71cff79138f2/1471-2105-7-54-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/33372d7bcc95/1471-2105-7-54-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/e85050ee8a79/1471-2105-7-54-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/f1bf903c8601/1471-2105-7-54-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ced1/1386712/51b96be42362/1471-2105-7-54-6.jpg

相似文献

1
Integrated analysis of gene expression by Association Rules Discovery.通过关联规则发现进行基因表达的综合分析。
BMC Bioinformatics. 2006 Feb 7;7:54. doi: 10.1186/1471-2105-7-54.
2
Integration of GO annotations in Correspondence Analysis: facilitating the interpretation of microarray data.在对应分析中整合基因本体论注释:助力微阵列数据的解读
Bioinformatics. 2005 May 15;21(10):2424-9. doi: 10.1093/bioinformatics/bti367. Epub 2005 Mar 3.
3
Using pre-existing microarray datasets to increase experimental power: application to insulin resistance.利用已有微阵列数据集增加实验功效:以胰岛素抵抗为例。
PLoS Comput Biol. 2010 Mar 26;6(3):e1000718. doi: 10.1371/journal.pcbi.1000718.
4
Knowledge guided analysis of microarray data.基于知识的微阵列数据分析
J Biomed Inform. 2006 Aug;39(4):401-11. doi: 10.1016/j.jbi.2005.08.004. Epub 2005 Sep 15.
5
Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes.从微阵列数据中提取基因表达模式并识别共表达基因,可揭示生物响应过程。
BMC Bioinformatics. 2007 Nov 2;8:427. doi: 10.1186/1471-2105-8-427.
6
Attribute clustering for grouping, selection, and classification of gene expression data.用于基因表达数据分组、选择和分类的属性聚类
IEEE/ACM Trans Comput Biol Bioinform. 2005 Apr-Jun;2(2):83-101. doi: 10.1109/TCBB.2005.17.
7
ADGO: analysis of differentially expressed gene sets using composite GO annotation.ADGO:使用复合基因本体注释分析差异表达基因集
Bioinformatics. 2006 Sep 15;22(18):2249-53. doi: 10.1093/bioinformatics/btl378. Epub 2006 Jul 12.
8
Biclustering of gene expression data by Non-smooth Non-negative Matrix Factorization.基于非光滑非负矩阵分解的基因表达数据双聚类分析
BMC Bioinformatics. 2006 Feb 17;7:78. doi: 10.1186/1471-2105-7-78.
9
Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes.使用功能类别参考集评估基因表达数据聚类算法的方法。
BMC Bioinformatics. 2006 Aug 31;7:397. doi: 10.1186/1471-2105-7-397.
10
THEA: ontology-driven analysis of microarray data.THEA:微阵列数据的本体驱动分析。
Bioinformatics. 2004 Nov 1;20(16):2636-43. doi: 10.1093/bioinformatics/bth295. Epub 2004 May 6.

引用本文的文献

1
Just in case: undergraduate students identifying and mitigating barriers to their sexual and reproductive health needs.以防万一:本科学生识别和减轻其性健康和生殖健康需求的障碍。
BMC Womens Health. 2024 Feb 6;24(1):96. doi: 10.1186/s12905-023-02854-7.
2
Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications.将关联规则挖掘扩展到微生物组模式分析:支持实际应用的工具和指南
Front Bioinform. 2022 Jan 10;1:794547. doi: 10.3389/fbinf.2021.794547. eCollection 2021.
3
Data- and expert-driven rule induction and filtering framework for functional interpretation and description of gene sets.

本文引用的文献

1
Analyzing microarray data using quantitative association rules.使用定量关联规则分析微阵列数据。
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii123-9. doi: 10.1093/bioinformatics/bti1121.
2
Applications of DNA microarrays in biology.DNA微阵列在生物学中的应用。
Annu Rev Biochem. 2005;74:53-82. doi: 10.1146/annurev.biochem.74.082803.133212.
3
Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays.阈值选择对通过DNA微阵列分析基因表达时得出的生物学结论的影响。
用于基因集功能解释和描述的数据驱动与专家驱动规则归纳及筛选框架
J Biomed Semantics. 2017 Jun 26;8(1):23. doi: 10.1186/s13326-017-0129-x.
4
BicPAM: Pattern-based biclustering for biomedical data analysis.BicPAM:用于生物医学数据分析的基于模式的双聚类
Algorithms Mol Biol. 2014 Dec 16;9(1):27. doi: 10.1186/s13015-014-0027-z. eCollection 2014.
5
Mining TCGA data using Boolean implications.使用布尔蕴涵挖掘癌症基因组图谱(TCGA)数据。
PLoS One. 2014 Jul 23;9(7):e102119. doi: 10.1371/journal.pone.0102119. eCollection 2014.
6
BicSPAM: flexible biclustering using sequential patterns.BicSPAM:使用序列模式的灵活双聚类
BMC Bioinformatics. 2014 May 6;15:130. doi: 10.1186/1471-2105-15-130.
7
Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain.用于在骨骼发育异常领域发现关联规则的语义趣味性度量。
J Biomed Semantics. 2014 Feb 5;5(1):8. doi: 10.1186/2041-1480-5-8.
8
Mining rare associations between biological ontologies.挖掘生物本体之间的罕见关联。
PLoS One. 2014 Jan 3;9(1):e84475. doi: 10.1371/journal.pone.0084475. eCollection 2014.
9
A Bayesian extension of the hypergeometric test for functional enrichment analysis.用于功能富集分析的超几何检验的贝叶斯扩展。
Biometrics. 2014 Mar;70(1):84-94. doi: 10.1111/biom.12122. Epub 2013 Dec 9.
10
A primer to frequent itemset mining for bioinformatics.生物信息学频繁项集挖掘入门
Brief Bioinform. 2015 Mar;16(2):216-31. doi: 10.1093/bib/bbt074. Epub 2013 Oct 26.
Proc Natl Acad Sci U S A. 2005 Jun 21;102(25):8961-5. doi: 10.1073/pnas.0502674102. Epub 2005 Jun 10.
4
Putting microarrays in a context: integrated analysis of diverse biological data.将微阵列置于特定情境中:对多种生物数据的综合分析。
Brief Bioinform. 2005 Mar;6(1):34-43. doi: 10.1093/bib/6.1.34.
5
TOR regulates ribosomal protein gene expression via PKA and the Forkhead transcription factor FHL1.TOR通过蛋白激酶A(PKA)和叉头转录因子FHL1调节核糖体蛋白基因的表达。
Cell. 2004 Dec 29;119(7):969-79. doi: 10.1016/j.cell.2004.11.047.
6
Ribosome biogenesis: giant steps for a giant problem.核糖体生物发生:一个重大问题的巨大进展。
Cell. 2004 Dec 29;119(7):901-2. doi: 10.1016/j.cell.2004.12.011.
7
Growth-regulated recruitment of the essential yeast ribosomal protein gene activator Ifh1.生长调节的必需酵母核糖体蛋白基因激活因子Ifh1的募集
Nature. 2004 Dec 23;432(7020):1058-61. doi: 10.1038/nature03200.
8
The transcription factor Ifh1 is a key regulator of yeast ribosomal protein genes.转录因子Ifh1是酵母核糖体蛋白基因的关键调节因子。
Nature. 2004 Dec 23;432(7020):1054-8. doi: 10.1038/nature03175.
9
Sfp1 is a stress- and nutrient-sensitive regulator of ribosomal protein gene expression.Sfp1是核糖体蛋白基因表达的应激和营养敏感调节因子。
Proc Natl Acad Sci U S A. 2004 Oct 5;101(40):14315-22. doi: 10.1073/pnas.0405353101. Epub 2004 Sep 7.
10
GO::TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes.GO::TermFinder——用于访问基因本体论信息并查找与基因列表相关的显著富集基因本体论术语的开源软件。
Bioinformatics. 2004 Dec 12;20(18):3710-5. doi: 10.1093/bioinformatics/bth456. Epub 2004 Aug 5.