• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LeFE算法:在解读微阵列数据时接纳基因表达的复杂性。

The LeFE algorithm: embracing the complexity of gene expression in the interpretation of microarray data.

作者信息

Eichler Gabriel S, Reimers Mark, Kane David, Weinstein John N

机构信息

Genomics and Bioinformatics Groups, Laboratory of Molecular Pharmacology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.

出版信息

Genome Biol. 2007;8(9):R187. doi: 10.1186/gb-2007-8-9-r187.

DOI:10.1186/gb-2007-8-9-r187
PMID:17845722
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2375025/
Abstract

Interpretation of microarray data remains a challenge, and most methods fail to consider the complex, nonlinear regulation of gene expression. To address that limitation, we introduce Learner of Functional Enrichment (LeFE), a statistical/machine learning algorithm based on Random Forest, and demonstrate it on several diverse datasets: smoker/never smoker, breast cancer classification, and cancer drug sensitivity. We also compare it with previously published algorithms, including Gene Set Enrichment Analysis. LeFE regularly identifies statistically significant functional themes consistent with known biology.

摘要

微阵列数据的解读仍然是一项挑战,并且大多数方法未能考虑基因表达的复杂非线性调控。为解决这一局限性,我们引入了功能富集学习器(LeFE),这是一种基于随机森林的统计/机器学习算法,并在几个不同的数据集上进行了验证:吸烟者/从不吸烟者、乳腺癌分类以及癌症药物敏感性。我们还将其与先前发表的算法进行了比较,包括基因集富集分析。LeFE能够定期识别出与已知生物学一致的具有统计学意义的功能主题。

相似文献

1
The LeFE algorithm: embracing the complexity of gene expression in the interpretation of microarray data.LeFE算法:在解读微阵列数据时接纳基因表达的复杂性。
Genome Biol. 2007;8(9):R187. doi: 10.1186/gb-2007-8-9-r187.
2
An entropy-based gene selection method for cancer classification using microarray data.一种基于熵的利用微阵列数据进行癌症分类的基因选择方法。
BMC Bioinformatics. 2005 Mar 24;6:76. doi: 10.1186/1471-2105-6-76.
3
Feature selection and nearest centroid classification for protein mass spectrometry.蛋白质质谱的特征选择与最近质心分类
BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68.
4
Integration of RNA-Seq data with heterogeneous microarray data for breast cancer profiling.整合RNA测序数据与异质性微阵列数据用于乳腺癌分析。
BMC Bioinformatics. 2017 Nov 21;18(1):506. doi: 10.1186/s12859-017-1925-0.
5
Bioinformatics/biostatistics: microarray analysis.生物信息学/生物统计学:微阵列分析
Methods Mol Biol. 2012;823:347-58. doi: 10.1007/978-1-60327-216-2_22.
6
Regularized Least Squares Cancer classifiers from DNA microarray data.基于DNA微阵列数据的正则化最小二乘癌症分类器。
BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-6-S4-S2.
7
Cancer DNA microarray analysis considering multi-subclass with graph-based clustering method.基于图形聚类方法的多亚类癌症DNA微阵列分析
J Biosci Bioeng. 2008 Nov;106(5):442-8. doi: 10.1263/jbb.106.442.
8
Redefinition of Affymetrix probe sets by sequence overlap with cDNA microarray probes reduces cross-platform inconsistencies in cancer-associated gene expression measurements.通过与cDNA微阵列探针的序列重叠来重新定义Affymetrix探针集,可减少癌症相关基因表达测量中跨平台的不一致性。
BMC Bioinformatics. 2005 Apr 25;6:107. doi: 10.1186/1471-2105-6-107.
9
Towards precise classification of cancers based on robust gene functional expression profiles.基于稳健的基因功能表达谱实现癌症的精准分类
BMC Bioinformatics. 2005 Mar 17;6:58. doi: 10.1186/1471-2105-6-58.
10
Fuzzy logic selection as a new reliable tool to identify molecular grade signatures in breast cancer--the INNODIAG study.模糊逻辑选择作为识别乳腺癌分子分级特征的一种新型可靠工具——INNODIAG研究
BMC Med Genomics. 2015 Feb 7;8:3. doi: 10.1186/s12920-015-0077-1.

引用本文的文献

1
Risk Prediction of Cardiovascular Events by Exploration of Molecular Data with Explainable Artificial Intelligence.利用可解释人工智能探索分子数据预测心血管事件风险。
Int J Mol Sci. 2021 Sep 24;22(19):10291. doi: 10.3390/ijms221910291.
2
Integrating biological knowledge and gene expression data using pathway-guided random forests: a benchmarking study.使用通路引导的随机森林整合生物学知识和基因表达数据:一项基准研究
Bioinformatics. 2020 Aug 1;36(15):4301-4308. doi: 10.1093/bioinformatics/btaa483.
3
Application of random forest based approaches to surface-enhanced Raman scattering data.

本文引用的文献

1
Smoking and cancer-related gene expression in bronchial epithelium and non-small-cell lung cancers.吸烟与支气管上皮及非小细胞肺癌中癌症相关基因的表达
J Pathol. 2006 Oct;210(2):192-204. doi: 10.1002/path.2039.
2
Coregulation of estrogen receptor by ERBB4/HER4 establishes a growth-promoting autocrine signal in breast tumor cells.ERBB4/HER4对雌激素受体的共同调节在乳腺肿瘤细胞中建立了一种促进生长的自分泌信号。
Cancer Res. 2006 Aug 15;66(16):7991-8. doi: 10.1158/0008-5472.CAN-05-4397.
3
Baseline gene expression predicts sensitivity to gefitinib in non-small cell lung cancer cell lines.
基于随机森林方法在表面增强拉曼散射数据中的应用。
Sci Rep. 2020 Mar 25;10(1):5436. doi: 10.1038/s41598-020-62338-8.
4
A system-level pathway-phenotype association analysis using synthetic feature random forest.基于合成特征随机森林的系统水平通路-表型关联分析。
Genet Epidemiol. 2014 Apr;38(3):209-19. doi: 10.1002/gepi.21794. Epub 2014 Feb 17.
5
Random forests for genomic data analysis.随机森林在基因组数据分析中的应用。
Genomics. 2012 Jun;99(6):323-9. doi: 10.1016/j.ygeno.2012.04.003. Epub 2012 Apr 21.
6
Investigating the effect of paralogs on microarray gene-set analysis.研究旁系同源基因对基因芯片基因集分析的影响。
BMC Bioinformatics. 2011 Jan 24;12:29. doi: 10.1186/1471-2105-12-29.
7
Identification of functional modules that correlate with phenotypic difference: the influence of network topology.鉴定与表型差异相关的功能模块:网络拓扑结构的影响。
Genome Biol. 2010;11(2):R23. doi: 10.1186/gb-2010-11-2-r23. Epub 2010 Feb 26.
8
Gene expression meta-analysis supports existence of molecular apocrine breast cancer with a role for androgen receptor and implies interactions with ErbB family.基因表达荟萃分析支持存在具有雄激素受体作用的分子大汗腺癌,并提示其与表皮生长因子受体(ErbB)家族存在相互作用。
BMC Med Genomics. 2009 Sep 11;2:59. doi: 10.1186/1755-8794-2-59.
9
Human urinary metabolomic profile of PPARalpha induced fatty acid beta-oxidation.过氧化物酶体增殖物激活受体α诱导脂肪酸β氧化的人尿代谢组学特征
J Proteome Res. 2009 Sep;8(9):4293-300. doi: 10.1021/pr9004103.
10
Machine learning approach identifies new pathways associated with demyelination in a viral model of multiple sclerosis.机器学习方法鉴定出多发性硬化病毒模型中与脱髓鞘相关的新途径。
J Cell Mol Med. 2010 Jan;14(1-2):434-48. doi: 10.1111/j.1582-4934.2008.00646.x. Epub 2009 Jan 14.
基线基因表达可预测非小细胞肺癌细胞系对吉非替尼的敏感性。
Mol Cancer Res. 2006 Aug;4(8):521-8. doi: 10.1158/1541-7786.MCR-06-0095.
4
Strategies to enhance epidermal growth factor inhibition: targeting the mevalonate pathway.增强表皮生长因子抑制作用的策略:靶向甲羟戊酸途径。
Clin Cancer Res. 2006 Jul 15;12(14 Pt 2):4426s-4431s. doi: 10.1158/1078-0432.CCR-06-0089.
5
Pathway analysis using random forests classification and regression.使用随机森林分类和回归的通路分析
Bioinformatics. 2006 Aug 15;22(16):2028-36. doi: 10.1093/bioinformatics/btl344. Epub 2006 Jun 29.
6
Cigarette smoke condensate induces cytochromes P450 and aldo-keto reductases in oral cancer cells.香烟烟雾冷凝物可诱导口腔癌细胞中的细胞色素P450和醛酮还原酶。
Toxicol Lett. 2006 Aug 20;165(2):182-94. doi: 10.1016/j.toxlet.2006.03.008. Epub 2006 Apr 18.
7
Comparison of methods for sequential screening of large compound sets.大型化合物集序贯筛选方法的比较
Comb Chem High Throughput Screen. 2006 Feb;9(2):115-22. doi: 10.2174/138620706775541882.
8
Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.基因集富集分析:一种基于知识的方法用于解读全基因组表达谱。
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50. doi: 10.1073/pnas.0506580102. Epub 2005 Sep 30.
9
High-Throughput GoMiner, an 'industrial-strength' integrative gene ontology tool for interpretation of multiple-microarray experiments, with application to studies of Common Variable Immune Deficiency (CVID).高通量GoMiner,一种用于解释多微阵列实验的“工业强度”综合基因本体工具,应用于常见变异免疫缺陷(CVID)研究。
BMC Bioinformatics. 2005 Jul 5;6:168. doi: 10.1186/1471-2105-6-168.
10
Identification of molecular apocrine breast tumours by microarray analysis.通过微阵列分析鉴定分子顶泌乳腺肿瘤
Oncogene. 2005 Jul 7;24(29):4660-71. doi: 10.1038/sj.onc.1208561.