• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因表达数据中基于临床驱动的半监督类别发现

Clinically driven semi-supervised class discovery in gene expression data.

作者信息

Steinfeld Israel, Navon Roy, Ardigò Diego, Zavaroni Ivana, Yakhini Zohar

机构信息

Agilent Laboratories, Tel Aviv, Israel.

出版信息

Bioinformatics. 2008 Aug 15;24(16):i90-7. doi: 10.1093/bioinformatics/btn279.

DOI:10.1093/bioinformatics/btn279
PMID:18689846
Abstract

MOTIVATION

Unsupervised class discovery in gene expression data relies on the statistical signals in the data to exclusively drive the results. It is often the case, however, that one is interested in constraining the search space to respect certain biological prior knowledge while still allowing a flexible search within these boundaries.

RESULTS

We develop an approach to semi-supervised class discovery. One component of our approach uses clinical sample information to constrain the search space and guide the class discovery process to yield biologically relevant partitions. A second component consists of using known biological annotation of genes to drive the search, seeking partitions that manifest strong differential expression in specific sets of genes. We develop efficient algorithmics for these tasks, implementing both approaches and combinations thereof. We show that our method is robust enough to detect known clinical parameters in accordance with expected clinical values. We also use our method to elucidate cardiovascular disease (CVD) putative risk factors.

AVAILABILITY

MonoClaD (Monotone Class Discovery). See http:// bioinfo.cs.technion.ac.il/people/zohar/MonoClad/.

SUPPLEMENTARY INFORMATION

Supplementary data is available at http://bioinfo.cs.technion.ac.il/people/zohar/MonoClad/software. html

摘要

动机

基因表达数据中的无监督类别发现完全依赖于数据中的统计信号来驱动结果。然而,通常情况下,人们希望在尊重某些生物学先验知识的同时限制搜索空间,并且仍能在这些边界内进行灵活搜索。

结果

我们开发了一种半监督类别发现方法。我们方法的一个组成部分利用临床样本信息来限制搜索空间,并指导类别发现过程以产生生物学相关的划分。第二个组成部分是利用已知的基因生物学注释来驱动搜索,寻找在特定基因集中表现出强烈差异表达的划分。我们为这些任务开发了高效的算法,实现了这两种方法及其组合。我们表明,我们的方法足够稳健,能够根据预期临床值检测出已知临床参数。我们还使用我们的方法阐明心血管疾病(CVD)的潜在危险因素。

可用性

MonoClaD(单调类别发现)。见http://bioinfo.cs.technion.ac.il/people/zohar/MonoClad/。

补充信息

补充数据可在http://bioinfo.cs.technion.ac.il/people/zohar/MonoClad/software.html获得。

相似文献

1
Clinically driven semi-supervised class discovery in gene expression data.基因表达数据中基于临床驱动的半监督类别发现
Bioinformatics. 2008 Aug 15;24(16):i90-7. doi: 10.1093/bioinformatics/btn279.
2
Induction of comprehensible models for gene expression datasets by subgroup discovery methodology.通过子群发现方法为基因表达数据集诱导可理解模型。
J Biomed Inform. 2004 Aug;37(4):269-84. doi: 10.1016/j.jbi.2004.07.007.
3
Graph-based consensus clustering for class discovery from gene expression data.基于图的共识聚类用于从基因表达数据中发现类别
Bioinformatics. 2007 Nov 1;23(21):2888-96. doi: 10.1093/bioinformatics/btm463. Epub 2007 Sep 14.
4
Hierarchical tree snipping: clustering guided by prior knowledge.层次树剪枝:由先验知识引导的聚类
Bioinformatics. 2007 Dec 15;23(24):3335-42. doi: 10.1093/bioinformatics/btm526. Epub 2007 Nov 7.
5
A supervised approach for identifying discriminating genotype patterns and its application to breast cancer data.一种用于识别区分性基因型模式的监督方法及其在乳腺癌数据中的应用。
Bioinformatics. 2007 Jan 15;23(2):e91-8. doi: 10.1093/bioinformatics/btl298.
6
Biomarker discovery across annotated and unannotated microarray datasets using semi-supervised learning.使用半监督学习在有注释和无注释的微阵列数据集中发现生物标志物。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S7. doi: 10.1186/1471-2164-9-S2-S7.
7
I/NI-calls for the exclusion of non-informative genes: a highly effective filtering tool for microarray data.I/NI-要求排除无信息基因:一种用于微阵列数据的高效筛选工具。
Bioinformatics. 2007 Nov 1;23(21):2897-902. doi: 10.1093/bioinformatics/btm478. Epub 2007 Oct 5.
8
A method for detection of differential gene expression in the presence of inter-individual variability in response.一种在存在个体间反应变异性的情况下检测差异基因表达的方法。
Bioinformatics. 2005 Nov 1;21(21):3990-2. doi: 10.1093/bioinformatics/bti667. Epub 2005 Sep 13.
9
A pattern recognition approach to infer time-lagged genetic interactions.一种用于推断时间滞后基因相互作用的模式识别方法。
Bioinformatics. 2008 May 1;24(9):1183-90. doi: 10.1093/bioinformatics/btn098. Epub 2008 Mar 12.
10
Annotation-based distance measures for patient subgroup discovery in clinical microarray studies.临床微阵列研究中用于发现患者亚组的基于注释的距离度量。
Bioinformatics. 2007 Sep 1;23(17):2256-64. doi: 10.1093/bioinformatics/btm322. Epub 2007 Jun 22.

引用本文的文献

1
Detecting significant expression patterns in single-cell and spatial transcriptomics with a flexible computational approach.使用灵活的计算方法在单细胞和空间转录组学中检测显著表达模式。
Sci Rep. 2024 Oct 30;14(1):26121. doi: 10.1038/s41598-024-75314-3.
2
Efficient gene expression signature for a breast cancer immuno-subtype.乳腺癌免疫亚型的高效基因表达特征。
PLoS One. 2021 Jan 12;16(1):e0245215. doi: 10.1371/journal.pone.0245215. eCollection 2021.
3
Molecular harvesting with electroporation for tissue profiling.电穿孔法进行分子提取以进行组织剖析。
Sci Rep. 2019 Oct 31;9(1):15750. doi: 10.1038/s41598-019-51634-7.
4
Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning.基于特征选择和半监督学习的基因表达谱数据癌症生物标志物识别
IEEE J Transl Eng Health Med. 2014 Dec 2;2:4300211. doi: 10.1109/JTEHM.2014.2375820. eCollection 2014.
5
Aberrant DNA methylation in ES cells.胚胎干细胞中的异常DNA甲基化。
PLoS One. 2014 May 22;9(5):e96090. doi: 10.1371/journal.pone.0096090. eCollection 2014.
6
Mutual enrichment in ranked lists and the statistical assessment of position weight matrix motifs.排名列表中的相互富集以及位置权重矩阵基序的统计评估。
Algorithms Mol Biol. 2014 Apr 5;9(1):11. doi: 10.1186/1748-7188-9-11.
7
Improving MEME via a two-tiered significance analysis.通过双层意义分析来改进 MEME。
Bioinformatics. 2014 Jul 15;30(14):1965-73. doi: 10.1093/bioinformatics/btu163. Epub 2014 Mar 24.
8
DRIMust: a web server for discovering rank imbalanced motifs using suffix trees.DRIMust:一个使用后缀树发现不平衡基序的网络服务器。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W174-9. doi: 10.1093/nar/gkt407. Epub 2013 May 17.
9
miRNA target enrichment analysis reveals directly active miRNAs in health and disease.miRNA 靶标富集分析揭示了健康和疾病中直接活跃的 miRNA。
Nucleic Acids Res. 2013 Feb 1;41(3):e45. doi: 10.1093/nar/gks1142. Epub 2012 Dec 2.
10
A unified computational model for revealing and predicting subtle subtypes of cancers.揭示和预测癌症微妙亚型的统一计算模型。
BMC Bioinformatics. 2012 May 1;13:70. doi: 10.1186/1471-2105-13-70.