• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在利用微阵列数据进行癌症分类时基因重要性方面的齐普夫定律。

Zipf's law in importance of genes for cancer classification using microarray data.

作者信息

Li Wentian, Yang Yaning

机构信息

Center for Genomics and Human Genetics North Shore LIJ Research Institute, 350 Community Drive, Manhasset, NY 11030, USA.

出版信息

J Theor Biol. 2002 Dec 21;219(4):539-51. doi: 10.1006/jtbi.2002.3145.

DOI:10.1006/jtbi.2002.3145
PMID:12425984
Abstract

Using a measure of how differentially expressed a gene is in two biochemically/phenotypically different conditions, we can rank all genes in a microarray dataset. We have shown that the falling-off of this measure (normalized maximum likelihood in a classification model such as logistic regression) as a function of the rank is typically a power-law function. This power-law function in other similar ranked plots are known as the Zipf's law, observed in many natural and social phenomena. The presence of this power-law function prevents an intrinsic cutoff point between the "important" genes and "irrelevant" genes. We have shown that similar power-law functions are also present in permuted dataset, and provide an explanation from the well-known chi(2) distribution of likelihood ratios. We discuss the implication of this Zipf's law on gene selection in a microarray data analysis, as well as other characterizations of the ranked likelihood plots such as the rate of fall-off of the likelihood.

摘要

通过测量一个基因在两种生物化学/表型不同的条件下的差异表达程度,我们可以对微阵列数据集中的所有基因进行排名。我们已经表明,这种测量值(如逻辑回归等分类模型中的归一化最大似然值)作为排名的函数下降,通常是一个幂律函数。在其他类似的排名图中,这种幂律函数被称为齐普夫定律,在许多自然和社会现象中都有观察到。这种幂律函数的存在阻止了“重要”基因和“无关”基因之间的内在分界点。我们已经表明,类似的幂律函数也存在于置换数据集中,并从似然比的著名卡方分布中给出了解释。我们讨论了这种齐普夫定律在微阵列数据分析中基因选择的意义,以及排名似然图的其他特征,如似然下降率。

相似文献

1
Zipf's law in importance of genes for cancer classification using microarray data.在利用微阵列数据进行癌症分类时基因重要性方面的齐普夫定律。
J Theor Biol. 2002 Dec 21;219(4):539-51. doi: 10.1006/jtbi.2002.3145.
2
Can Zipf's law be adapted to normalize microarrays?齐普夫定律能否用于对微阵列进行标准化?
BMC Bioinformatics. 2005 Feb 23;6:37. doi: 10.1186/1471-2105-6-37.
3
Zipf's Law Arises Naturally When There Are Underlying, Unobserved Variables.当存在潜在的、未被观察到的变量时,齐普夫定律自然产生。
PLoS Comput Biol. 2016 Dec 20;12(12):e1005110. doi: 10.1371/journal.pcbi.1005110. eCollection 2016 Dec.
4
Beyond Zipf's Law: The Lavalette Rank Function and Its Properties.超越齐普夫定律:拉瓦莱特排名函数及其性质。
PLoS One. 2016 Sep 22;11(9):e0163241. doi: 10.1371/journal.pone.0163241. eCollection 2016.
5
Zipf's law revisited: Spoken dialog, linguistic units, parameters, and the principle of least effort.再探齐夫定律:口语对话、语言单位、参数和省力原则。
Psychon Bull Rev. 2023 Feb;30(1):77-101. doi: 10.3758/s13423-022-02142-9. Epub 2022 Jul 15.
6
Stochastic model of Zipf's law and the universality of the power-law exponent.齐普夫定律的随机模型与幂律指数的普遍性
Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Apr;89(4):042115. doi: 10.1103/PhysRevE.89.042115. Epub 2014 Apr 8.
7
Zipf's law leads to Heaps' law: analyzing their relation in finite-size systems.齐夫定律导致海普斯定律:分析有限系统中的它们之间的关系。
PLoS One. 2010 Dec 2;5(12):e14139. doi: 10.1371/journal.pone.0014139.
8
Applicability of Zipf's Law in Traditional Chinese Medicine Prescriptions.Zipf 定律在中医方剂中的适用性。
Chin Med Sci J. 2022 Sep 30;37(3):195-200. doi: 10.24920/004133.
9
The languages of health in general practice electronic patient records: a Zipf's law analysis.全科医疗电子病历中的健康语言:齐普夫定律分析
J Biomed Semantics. 2014 Jan 10;5(1):2. doi: 10.1186/2041-1480-5-2.
10
Zipf's law in gene expression.基因表达中的齐普夫定律。
Phys Rev Lett. 2003 Feb 28;90(8):088102. doi: 10.1103/PhysRevLett.90.088102. Epub 2003 Feb 26.

引用本文的文献

1
Use of 6 Nucleotide Length Words to Study the Complexity of Gene Sequences from Different Organisms.使用6个核苷酸长度的单词来研究不同生物体基因序列的复杂性。
Entropy (Basel). 2022 Apr 30;24(5):632. doi: 10.3390/e24050632.
2
Empirical evidence for concerted evolution in the 18S rDNA region of the planktonic diatom genus Chaetoceros.浮游硅藻 Chaetoceros 属 18S rDNA 区协同进化的实证证据。
Sci Rep. 2021 Jan 12;11(1):807. doi: 10.1038/s41598-020-80829-6.
3
RNA-Seq-Based Breast Cancer Subtypes Classification Using Machine Learning Approaches.
基于RNA测序的乳腺癌亚型机器学习分类方法
Comput Intell Neurosci. 2020 Oct 29;2020:4737969. doi: 10.1155/2020/4737969. eCollection 2020.
4
Asymptotic structural properties of quasi-random saturated structures of RNA.RNA准随机饱和结构的渐近结构性质
Algorithms Mol Biol. 2013 Oct 25;8(1):24. doi: 10.1186/1748-7188-8-24.
5
Cisplatin for small cell lung cancer: Associated publications in Science Citation Index Expanded.顺铂用于小细胞肺癌:科学引文索引扩展版中的相关出版物
Oncol Lett. 2013 Feb;5(2):684-688. doi: 10.3892/ol.2012.1029. Epub 2012 Nov 15.
6
Word decoding of protein amino Acid sequences with availability analysis: a linguistic approach.蛋白质氨基酸序列的词法解码与可用性分析:一种语言学法。
PLoS One. 2012;7(11):e50039. doi: 10.1371/journal.pone.0050039. Epub 2012 Nov 21.
7
Cia5d regulates a new fibroblast-like synoviocyte invasion-associated gene expression signature.Cia5d调控一种新的成纤维细胞样滑膜细胞侵袭相关基因表达特征。
Arthritis Res Ther. 2008;10(4):R92. doi: 10.1186/ar2476. Epub 2008 Aug 15.
8
A simple method to combine multiple molecular biomarkers for dichotomous diagnostic classification.一种用于二分诊断分类的组合多种分子生物标志物的简单方法。
BMC Bioinformatics. 2006 Oct 10;7:442. doi: 10.1186/1471-2105-7-442.
9
Microarray analyses of peripheral blood cells identifies unique gene expression signature in psoriatic arthritis.外周血细胞的微阵列分析确定了银屑病关节炎中独特的基因表达特征。
Mol Med. 2005 Jan-Dec;11(1-12):21-9. doi: 10.2119/2006-00003.Gulko.
10
Entropy-based gene ranking without selection bias for the predictive classification of microarray data.基于熵的基因排序,无选择偏差用于微阵列数据的预测分类
BMC Bioinformatics. 2003 Nov 6;4:54. doi: 10.1186/1471-2105-4-54.